cuDNN
About
NVIDIA cuDNN
The NVIDIA CUDA® Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. cuDNN provides highly tuned implementations for standard routines such as forward and backward convolution, pooling, normalization, and activation layers.
Deep learning researchers and framework developers worldwide rely on cuDNN for high-performance GPU acceleration. It allows them to focus on training neural networks and developing software applications rather than spending time on low-level GPU performance tuning. cuDNN accelerates widely used deep learning frameworks, including Caffe2, Chainer, Keras, MATLAB, MxNet, PyTorch, and TensorFlow. For access to NVIDIA optimized deep learning framework containers that have cuDNN integrated into frameworks, visit NVIDIA GPU CLOUD to learn more and get started.
Key Features
- Tensor Core acceleration for all popular convolutions including 2D, 3D, Grouped, Depth-wise separable, and Dilated with NHWC and NCHW inputs and outputs
- Optimized kernels for computer vision and speech models including ResNet, ResNext, SSD, MaskRCNN, Unet, VNet, BERT, GPT-2, Tacotron2 and WaveGlow
- Supports FP32, FP16, and TF32 floating point formats and INT8, and UINT8 integer formats
- Arbitrary dimension ordering, striding, and sub-regions for 4d tensors means easy integration into any neural net implementation
- Speed up fused operations on any CNN architecture
cuDNN is supported on Windows and Linux with Ampere, Turing, Volta, Pascal, Maxwell, and Kepler GPU architectures in data center and mobile GPUs.
Specification
You May Also Like
Related products
-
Pytorch
SKU: N/AProduction Ready Transition seamlessly between eager and graph modes with TorchScript, and accelerate the path to production with TorchServe. Distributed Training Scalable distributed training and performance optimization in research and production is enabled by the torch.distributed backend. Robust Ecosystem A rich ecosystem of tools and libraries extends PyTorch and supports development in computer vision, NLP ...More Information -
Omniverse Marbles RTX
SKU: N/ANVIDIA Marbles RTX is a playable physics-based mini game that was first showcased during the GeForce Ampere launch. It is entirely ray traced, obeys the laws of physics and runs using NVIDIA DLSS integrated for optimal visual sharpening.More Information -
Omniverse Audio2Face
SKU: N/ANVIDIA’s Audio2Face is an Omniverse application that uses a combination of AI technologies to generate facial animation and dialogue lip-sync from an audio source input. The application provides an array of pre- and post-process parameters to fine-tune the animation performance before exporting the result as a geometry cache. Audio2Face requires Windows 64-bit 1909 or above.More Information
Our Customers

























