About
CUDA is a parallel computing platform and programming model developed by Nvidia for general computing on its own GPUs (graphics processing units). CUDA enables developers to speed up compute-intensive applications by harnessing the power of GPUs for the parallelizable part of the computation.
This post is a super simple introduction to CUDA, the popular parallel computing platform and programming model from NVIDIA. I wrote a previous “Easy Introduction” to CUDA in 2013 that has been very popular over the years. But CUDA programming has gotten easier, and GPUs have gotten much faster, so it’s time for an updated (and even easier) introduction.
CUDA C++ is just one of the ways you can create massively parallel applications with CUDA. It lets you use the powerful C++ programming language to develop high performance algorithms accelerated by thousands of parallel threads running on GPUs. Many developers have accelerated their computation- and bandwidth-hungry applications this way, including the libraries and frameworks that underpin the ongoing revolution in artificial intelligence known as Deep Learning.
Specification
You May Also Like
Related products
-

NVIDIA QUADRO RTX8000 PASSIVE
SKU: 900-2G150-0050-000More Information- GPU Memory: 48 GB GDDR6 with ECC
- CUDA Cores: 4608
- NVIDIA Tensor Cores: 576
- NVIDIA RT Cores: 72
-

NVIDIA TESLA V100-32GB
SKU: 900-2G500-0010-000More Information- GPU Memory: 32GB HBM2
- CUDA Cores: 5120
- NVIDIA Tensor Cores: 640
- Single-Precision Performance: 14 TeraFLOPS
-

NVIDIA DGX A100
SKU: DGXA-2530A+P2CMI00More Information- 8X NVIDIA A100 GPUS WITH 320 GB TOTAL GPU MEMORY
- 6X NVIDIA NVSWITCHES
- 9X MELLANOX CONNECTX-6 200Gb/S NETWORK INTERFACE
- DUAL 64-CORE AMD CPUs AND 1 TB SYSTEM MEMORY
- 15 TB GEN4 NVME SSD
Our Customers





























