NVIDIA H200 NVL
About
The GPU for Generative AI and HPC
The NVIDIA H200 Tensor Core GPU supercharges generative AI and high-performance computing (HPC) workloads with game-changing performance and memory capabilities. As the first GPU with HBM3e, the H200’s larger and faster memory fuels the acceleration of generative AI and large language models (LLMs) while advancing scientific computing for HPC workloads.
Specification
H200 NVL
FP64
34 TFLOPS
FP64 Tensor Core
67 TFLOPS
FP32
67 TFLOPS
TF32 Tensor Core
989 TFLOPS2²
BFLOAT16 Tensor Core
1,979 TFLOPS²
FP16 Tensor Core
1,979 TFLOPS²
FP8 Tensor Core
3,958 TFLOPS²
INT8 Tensor Core
3,958 TFLOPS²
GPU memory
141GB
GPU memory bandwidth
4.8TB/s
Decoders
7 NVDEC
7 JPEG
Confidential Computing
Supported
Max thermal design power (TDP)
Up to 600W (configurable)
Multi-Instance GPUs
Up to 7 MIGs @16.5GB each
Form factor
PCIe
Interconnect
2- or 4-way NVIDIA NVLink bridge: 900GB/s PCIe Gen5: 128GB/s
Server options
NVIDIA MGX™ H200 NVL partner and NVIDIA-Certified Systems with up to 8 GPUs
NVIDIA AI Enterprise
Included
You May Also Like
Related products
-
NVIDIA QUADRO GV100
SKU: N/A- GPU Memory: 32GB HBM2
- CUDA Cores: 5120
- Single Precision: 14.8 TFLOPs
- Double Precision: 7.4 TFLOPs
-
NVIDIA L40
SKU: 900-2G133-0010-000The NVIDIA L40 brings the highest level of power and performance for visual computing workloads in the data center. Third-generation RT Cores and industry-leading 48 GB of GDDR6 memory deliver up to twice the real-time ray-tracing performance of the previous generation to accelerate high-fidelity creative workflows, including real-time, full-fidelity, interactive rendering, 3D design, video streaming, and virtual production. -
NVIDIA QUADRO RTX8000 PASSIVE
SKU: 900-2G150-0050-000- GPU Memory: 48 GB GDDR6 with ECC
- CUDA Cores: 4608
- NVIDIA Tensor Cores: 576
- NVIDIA RT Cores: 72
Our Partners
Previous
Next