NVIDIA HGX H200
- 141GB of HBM3e GPU memory
- 4.8TB/s of memory bandwidth
- 4 petaFLOPS of FP8 performance
- 2X LLM inference performance
- 110X HPC performance
About
The GPU for Generative AI and HPC
The NVIDIA H200 Tensor Core GPU supercharges generative AI and high-performance computing (HPC) workloads with game-changing performance and memory capabilities. As the first GPU with HBM3e, the H200’s larger and faster memory fuels the acceleration of generative AI and large language models (LLMs) while advancing scientific computing for HPC workloads.
Specification
HGX H200
FP64
34 TFLOPS
FP64 Tensor Core
67 TFLOPS
FP32
67 TFLOPS
TF32 Tensor Core
989 TFLOPS2²
BFLOAT16 Tensor Core
1,979 TFLOPS²
FP16 Tensor Core
1,979 TFLOPS²
FP8 Tensor Core
3,958 TFLOPS²
INT8 Tensor Core
3,958 TFLOPS²
GPU memory
141GB
GPU memory bandwidth
4.8TB/s
Decoders
7 NVDEC
7 JPEG
Confidential Computing
Supported
Max thermal design power (TDP)
Up to 700W (configurable)
Multi-Instance GPUs
Up to 7 MIGs @16.5GB each
Form factor
SXM
Interconnect
NVIDIA NVLink®: 900GB/s
PCIe Gen5: 128GB/s
Server options
NVIDIA HGX™ H200 partner and NVIDIA-Certified Systems™ with 4 or 8 GPUs
NVIDIA AI Enterprise
Add-on
You May Also Like
Related products
-
NVIDIA RTX A5000
SKU: 900-5G132-2500-000- GPU Memory: 24GB GDDR6 with error-correcting code (ECC)
- 4x DisplayPort 1.4
- PCI Express Gen 4 x 16
-
NVIDIA QUADRO RTX8000
SKU: N/A- GPU Memory: 48 GB GDDR6 with ECC
- CUDA Cores: 4608
- NVIDIA Tensor Cores: 576
- NVIDIA RT Cores: 72
-
NVIDIA H100
SKU: 900-21010-0000-000Take an order-of-magnitude leap inaccelerated computing. The NVIDIA H100 Tensor Core GPU delivers unprecedented performance,scalability, and security for every workload. With NVIDIA® NVLink® SwitchSystem, up to 256 H100 GPUs can be connected to accelerate exascaleworkloads, while the dedicated Transformer Engine supports trillion-parameter language models. H100 uses breakthrough innovations in theNVIDIA Hopper™ architecture to deliver industry-leading ...More Information
Our Partners
Previous
Next