NVIDIA® Tesla® P40 «NEW»

Experience Maximum Inference Throughput
In the new era of AI and intelligent machines, Deep learning is shaping our world like no other computing model in history. GPUs powered by the revolutionary NVIDIA® Pascal™ architecture provide the computational engine for the new era of artificial intelligence, enabling amazing user experiences by accelerating deep learning applications at scale.

The NVIDIA® Tesla® P40 is purpose-built to deliver maximum throughput for Deep Learning  deployment. With 47 TOPS (Tera-Operations Per Second) of inference performance and INT8 operations per GPU, a single server with 8 Tesla P40s delivers the performance of over 140 CPU servers.

As models increase in accuracy and complexity, CPUs are no longer capable of delivering interactive user experience. The NVIDIA® Tesla® P40 delivers over 30X lower latency than a CPU for real-time responsiveness in even the most complex models.

Specifications
NVIDIA® Tesla® P40
Specifications

GPU Architecture

NVIDIA® Tesla® P40

NVIDIA® Pascal™

Specifications

Single-Precision Performance

NVIDIA® Tesla® P40

12 TeraFLOPS*

Specifications

Integer Operations (INT8)

NVIDIA® Tesla® P40

47 TOPS* (Tera-Operations per Second)

Specifications

GPU Memory

NVIDIA® Tesla® P40

24 GB

Specifications

Memory Bandwidth

NVIDIA® Tesla® P40

346 GB/s

Specifications

System Interface

NVIDIA® Tesla® P40

PCI Express 3.0 x16

Specifications

Form Factor

NVIDIA® Tesla® P40

4.4” H x 10.5” L, Dual Slot, Full Height

Specifications

Max Power

NVIDIA® Tesla® P40

250W

Specifications

Enhanced Programmability with Page Migration Engine

NVIDIA® Tesla® P40

Yes

Specifications

ECC Protection

NVIDIA® Tesla® P40

Yes

Specifications

Server-Optimized for Data Center Deployment

NVIDIA® Tesla® P40

Yes

Specifications

Hardware-Accelerated Video Engine

NVIDIA® Tesla® P40

1x Decode Engine, 2x Encode Engine

Get a Quote Talk to a Solutions Architect