NVIDIA A2 Tensor Core GPU
Entry-level GPU that brings NVIDIA AI to any server.
Versatile Entry-Level Inference
The NVIDIA A2 Tensor Core GPU provides entry-level inference with low power, a small footprint, and high performance for NVIDIA AI at the edge. Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.
Up to 20X More Inference Performance
AI inference is deployed to enhance consumer lives with smart, real-time experiences and to gain insights from trillions of end-point sensors and cameras. Compared to CPU-only servers, edge and entry-level servers with NVIDIA A2 Tensor Core GPUs offer up to 20X more inference performance, instantly upgrading any server to handle modern AI.
Computer Vision
(EfficientDet-DO)
Natural Language Processing
(BERT-Large)
Text-to-Speech
(Tacotron2 + Waveglow)
Comparisons of one NVIDIA A2 Tensor Core GPU versus a dual-socket Xeon Gold 6330N CPU
System Configuration: [CPU: HPE DL380 Gen10 Plus, 2S Xeon Gold 6330N @2.2GHz, 512GB DDR4]
NLP: BERT-Large (Sequence length: 384, SQuAD: v1.1) | TensorRT 8.2, Precision: INT8, BS:1 (GPU) | OpenVINO 2021.4, Precision: INT8, BS:1 (CPU)
Text-to-Speech: Tacotron2 + Waveglow end-to-end pipeline (input length: 128) | PyTorch 1.9, Precision: FP16, BS:1 (GPU) | PyTorch 1.9, Precision: FP32, BS:1 (CPU)
Computer Vision: EfficientDet-D0 (COCO, 512x512) | TensorRT 8.2, Precision: INT8, BS:8 (GPU) | OpenVINO 2021.4, Precision: INT8, BS:8 (CPU)
Higher IVA Performance for the
Intelligent Edge
Servers equipped with NVIDIA A2 GPUs offer up to 1.3X more performance in intelligent edge use cases, including smart cities, manufacturing, and retail. NVIDIA A2 GPUs running IVA workloads deliver more efficient deployments with up to 1.6X better price-performance and 10 percent better energy efficiency than previous GPU generations.
IVA Performance (Normalized)
System Configuration: [Supermicro SYS-1029GQ-TRT, 2S Xeon Gold 6240 @2.6GHz, 512GB DDR4, 1x NVIDIA A2 OR 1x NVIDIA T4] | Measured performance with Deepstream 5.1. Networks: ShuffleNet-v2 (224x224), MobileNet-v2 (224x224). | Pipeline represents end-to-end performance with video capture and decode, pre-processing, batching, inference, and post-processing.
Optimized for Any Server
NVIDIA A2 is optimized for inference workloads and deployments in entry-level servers constrained by space and thermal requirements, such as 5G edge and industrial environments. A2 delivers a low-profile form factor operating in a low-power envelope, from a TDP of 60W down to 40W, making it ideal for any server.
Lower Power and Configurable TDP
Leading AI Inference Performance Across Cloud, Data Center, and Edge
AI inference continues to drive breakthrough innovation across industries, including consumer internet, healthcare and life sciences, financial services, retail, manufacturing, and supercomputing. A2’s small form factor and low power combined with the NVIDIA A100 and A30 Tensor Core GPUs deliver a complete AI inference portfolio across cloud, data center, and edge. A2 and the NVIDIA AI inference portfolio ensure AI applications deploy with fewer servers and less power, resulting in faster insights with substantially lower costs.
Ready for Enterprise Utilization
NVIDIA AI Enterprise
NVIDIA AI Enterprise, an end-to-end cloud-native suite of AI and data analytics software, is certified to run on A2 in hypervisor-based virtual infrastructure with VMware vSphere. This enables management and scaling of AI and inference workloads in a hybrid cloud environment.
Mainstream NVIDIA-Certified Systems
NVIDIA-Certified Systems™ with NVIDIA A2 bring together compute acceleration and high-speed, secure NVIDIA networking in enterprise data center servers, built and sold by NVIDIA’s OEM partners. This program lets customers identify, acquire, and deploy systems for traditional and diverse modern AI applications from the NVIDIA NGC™ catalog on a single high-performance, cost-effective, and scalable infrastructure.
Powered by the NVIDIA Ampere Architecture
The NVIDIA Ampere architecture is designed for the age of elastic computing, delivering the performance and acceleration needed to power modern enterprise applications. Explore the heart of the world’s highest-performing, elastic data centers.
Technical Specifications
| Peak FP32 | 4.5 TF | |
| TF32 Tensor Core | 9 TF | 18 TF¹ | |
| BFLOAT16 Tensor Core | 18 TF | 36 TF¹ | |
| Peak FP16 Tensor Core | 18 TF | 36 TF¹ | |
| Peak INT8 Tensor Core | 36 TOPS | 72 TOPS¹ | |
| Peak INT4 Tensor Core | 72 TOPS | 144 TOPS¹ | |
| RT Cores | 10 | |
| Media engines | 1 video encoder 2 video decoders (includes AV1 decode) | |
| GPU memory | 16GB GDDR6 | |
| GPU memory bandwidth | 200GB/s | |
| Interconnect | PCIe Gen4 x8 | |
| Form factor | 1-slot, low-profile PCIe | |
| Max thermal design power (TDP) | 40–60W (configurable) | |
| Virtual GPU (vGPU) software support² | NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS) | |
1 With sparsity
2 Supported in future vGPU release

MacBook
iPad
Apple Watch
Airpods
iMac
Studio Display
iphone
Gaming Laptop

Gaming Desktop
DDR5 Desktop
DDR5 Laptop
DDR5 Server
DDR4 Desktop
DDR4 Laptop
DDR4 Server
DDR3 Desktop
DDR3 Laptop
DDR3 Server
Intel Socket
Intel Z890
Intel B860
Intel B760
Intel H770
Intel B660
Intel H670
Intel H610
Intel Z690
Intel H510
Intel Z590
Intel B560
Intel H470
Intel Z490
Intel H410
Intel B460
Intel H310
Intel B360
Intel B365
Intel X299
Intel Z390
Intel Z370
Intel H370
Intel Z270 H270
Intel B250
Intel Z170 H170
Intel H110
Intel H81
Intel B85
Intel H61
Intel B150
AMD Socket
AMD B850
AMD B840
AMD TRX50
AMD A620
AMD X870
AMD B650
AMD A520
AMD TRX40
AMD B550
AMD X570
AMD X470
AMD B450
AMD X370
AMD A320
AMD B350
AMD X399
AMD A88
AMD A68 A78
Cpu Air Coolers
CPU Liquid Coolers
Fans
AMD CPUs Desktop
AMD Server CPU
Intel Server CPU
Samsung CPUs
Other special CPUs

Solid State Drives
NVMe PCIe M.2
SATA 2.5inch
Hard Disk Drive
Server Hard Drives
NAS hard drive
Monitoring hard drive
Portable Solid State Drives
Memory Cards
USB Flash Drives
Nvidia GPU
RTX 50 series
RTX 30 series
GTX 16 series
GTX 10 series
RX 9000 series
RX 7000 series
RX 6000 series
RX 5000 series
RX 500 series
RTX 20 series
Rack server
Blade server
Tower server
Storage Server Solutions
Network switch
Workstation
Mobile Workstation
Server motherboard
Workstation Motherboard
SONY Gaming Console
ASUS Gaming Console
Lenovo Gaming Console
One XPlayer
Microsoft Gaming Console
XBOX Gaming Console
MSI Gaming Console
Motherboard
GTX TITAN
Computer Cases