NVIDIA Tensor Cores
Unprecedented Acceleration for Generative AI
Tensor Cores enable mixed-precision computing, dynamically adapting calculations to accelerate throughput while preserving accuracy and providing enhanced security. The latest generation of Tensor Cores are faster than ever on a broad array of AI and high-performance computing (HPC) tasks. From 4X speedups in training trillion-parameter generative AI models to a 30X increase in inference performance, NVIDIA Tensor Cores accelerate all workloads for modern AI factories.

Revolutionary AI Training
Training multi-trillion-parameter generative AI models in 16-bit floating point (FP16) precision can take months. NVIDIA Tensor Cores provide an order-of-magnitude higher performance with reduced precisions like FP8 in the Transformer Engine. With direct support in native frameworks via CUDA-X™ libraries, implementation is automatic, which dramatically slashes training-to-convergence times while maintaining accuracy.

Breakthrough Inference
Breakthrough Inference Breakthrough Inference Achieving low latency at high throughput while maximizing utilization is the most important performance requirement of deploying inference reliably. The NVIDIA Blackwell architecture’s second-generation Transformer Engine delivers exceptional performance and also has the versatility to accelerate diverse multi-trillion-parameter generative AI models. Tensor Cores has enabled NVIDIA to win MLPerf industry-wide benchmarks for inference.

Advanced HPC
HPC is a fundamental pillar of modern science. To unlock next-generation discoveries, scientists use simulations to better understand complex molecules for drug discovery, physics for potential sources of energy, and atmospheric data to better predict and prepare for extreme weather patterns. NVIDIA Tensor Cores offer a full range of precisions, including FP64, to accelerate scientific computing with the highest accuracy needed. The HPC SDK provides the essential compilers, libraries, and tools for developing HPC applications for the NVIDIA platform.
NVIDIA Blackwell Tensor Cores
Fifth Generation
The Blackwell architecture delivers a 30X speedup compared to the previous NVIDIA Hopper™ generation for massive models such as GPT-MoE-1.8T. This performance boost is made possible with the fifth-generation of Tensor Cores. Blackwell Tensor Cores add new precisions, including community-defined microscaling formats, giving better accuracy and ease of replacement for higher precisions.
New Precision Formats
As generative AI models explode in size and complexity, it’s critical to improve training and inference performance. To meet these compute needs, Blackwell Tensor Cores support new quantization formats and precisions, including community-defined microscaling formats.

Second-Generation Transformer Engine
The second-generation Transformer Engine uses custom Blackwell Tensor Core technology combined with NVIDIA® TensorRT™-LLM and NeMo™ Framework innovations to accelerate inference and training for large language models (LLMs) and mixture-of-experts (MoE) models. The Transformer Engine is fueled by the Tensor Cores’ FP4 precision, doubling performance and efficiency while maintaining high accuracy for current and next-generation MoE models. The Transformer Engine works to democratize today’s LLMs with real-time performance. Enterprises can optimize business processes by deploying state-of-the-art generative AI models with affordable economics.
NVIDIA Hopper Architecture Tensor Cores
Fourth Generation
Since the introduction of Tensor Core technology, NVIDIA Hopper GPUs have increased their peak performance by 60X, fueling the democratization of computing for AI and HPC. The NVIDIA Hopper architecture advances fourth-generation Tensor Cores with the Transformer Engine, using FP8 to deliver 6X higher performance over FP16 for trillion-parameter-model training. Combined with 3X more performance using TF32, FP64, FP16, and INT8 precisions, Hopper Tensor Cores deliver speedups to all workloads.
The Most Powerful End-to-End AI and HPC Data Center Platform
Tensor Cores are essential building blocks of the complete NVIDIA data center solution that incorporates hardware, networking, software, libraries, and optimized AI models and applications from the NVIDIA NGC™ catalog. The most powerful end-to-end AI and HPC platform, it allows researchers to deliver real-world results and deploy solutions into production at scale.
| Blackwell | Hopper | |
|---|---|---|
| Supported Tensor Core precisions | FP64, TF32, BF16, FP16, FP8, INT8, FP6, FP4 | FP64, TF32, BF16, FP16, FP8, INT8 |
| Supported CUDA® Core precisions | FP64, FP32, FP16, BF16 | FP64, FP32, FP16, BF16, INT8 |
*Preliminary specifications, may be subject to change

MacBook
iPad
Apple Watch
Airpods
iMac
Studio Display
iphone
Gaming Laptop

Gaming Desktop
DDR5 Desktop
DDR5 Laptop
DDR5 Server
DDR4 Desktop
DDR4 Laptop
DDR4 Server
DDR3 Desktop
DDR3 Laptop
DDR3 Server
Intel Socket
Intel Z890
Intel B860
Intel B760
Intel H770
Intel B660
Intel H670
Intel H610
Intel Z690
Intel H510
Intel Z590
Intel B560
Intel H470
Intel Z490
Intel H410
Intel B460
Intel H310
Intel B360
Intel B365
Intel X299
Intel Z390
Intel Z370
Intel H370
Intel Z270 H270
Intel B250
Intel Z170 H170
Intel H110
Intel H81
Intel B85
Intel H61
Intel B150
AMD Socket
AMD B850
AMD B840
AMD TRX50
AMD A620
AMD X870
AMD B650
AMD A520
AMD TRX40
AMD B550
AMD X570
AMD X470
AMD B450
AMD X370
AMD A320
AMD B350
AMD X399
AMD A88
AMD A68 A78
Cpu Air Coolers
CPU Liquid Coolers
Fans
AMD CPUs Desktop
AMD Server CPU
Intel Server CPU
Samsung CPUs
Other special CPUs

Solid State Drives
NVMe PCIe M.2
SATA 2.5inch
Hard Disk Drive
Server Hard Drives
NAS hard drive
Monitoring hard drive
Portable Solid State Drives
Memory Cards
USB Flash Drives
Nvidia GPU
RTX 50 series
RTX 30 series
GTX 16 series
GTX 10 series
RX 9000 series
RX 7000 series
RX 6000 series
RX 5000 series
RX 500 series
RTX 20 series
Rack server
Blade server
Tower server
Storage Server Solutions
Network switch
Workstation
Mobile Workstation
Server motherboard
Workstation Motherboard
SONY Gaming Console
ASUS Gaming Console
Lenovo Gaming Console
One XPlayer
Microsoft Gaming Console
XBOX Gaming Console
MSI Gaming Console
Motherboard
GTX TITAN
Computer Cases