NVIDIA A2 Tensor Core GPU
The NVIDIA® A2 Tensor Core GPU provides entry-level inference with low power, a small footprint, and high performance for NVIDIA AI at the edge. Featuring a low-profile PCIe Gen4 card and a low 40-60W configurable thermal design power (TDP) capability, the A2 brings versatile inference acceleration to any server for deployment at scale.
1 With sparsity
2 Supported in future vGPU release
Peak FP3
4.5 TF
TF32 Tensor Core
9 TF | 18 TF¹
BFLOAT16 Tensor Core
18 TF | 36 TF¹
Peak FP16 Tensor Core
18 TF | 36 TF¹
Peak INT8 Tensor Core
36 TOPS | 72 TOPS¹
Peak INT4 Tensor Core
72 TOPS | 144 TOPS¹
RT Cores
10
Media engines
1 video encoder 2 video decoders (includes AV1 decode)
GPU memory
16GB GDDR6
GPU memory bandwidth
200GB/s
Interconnect
PCIe Gen4 x8
Form factor
1-slot, low-profile PCIe
Max thermal design power (TDP)
40–60W (configurable)
Virtual GPU (vGPU) software support²
NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS)