NVIDIA L40S GPU
Experience breakthrough multi-workload performance with the NVIDIA L40S GPU. Combining powerful AI compute with best-in-class graphics and media acceleration, the L40S GPU is built to power the next generation of data center workloads—from generative AI and large language model (LLM) inference and training to 3D graphics, rendering, and video.
Download the PDF Datasheet for the L40S.
*Figures below done with Sparsity.

GPU Architecture
NVIDIA Ada Lovelace Architecture
GPU Memory
48 GB GDDR6 with ECC
Memory Bandwidth
864GB/s
Interconnect Interface
PCIe Gen4 x16: 64GB/s bidirectional
NVIDIA Ada Lovelace Architecture-Based CUDA® Cores
18,176
NVIDIA Third-Generation RT Cores
142
NVIDIA Fourth-Generation Tensor Cores
568
RT Core Performance TFLOPS
212
FP32 TFLOPS
91.6
TF32 Tensor Core TFLOPS
183 I 366*
BFLOAT16 Tensor Core TFLOPS
362.05 I 733*
FP16 Tensor Core
362.05 I 733*
FP8 Tensor Core
733 I 1,466*
Tensor TOPS
- Peak INT8 Tensor TOPS: 733 I 1,466*
- Peak INT4 Tensor TOPS: 733 I 1,466*
Display Ports
4 x DP 1.4a
Max Power Consumption
350W
Power Connector
16-pin
Form Factor
4.4" (H) x 10.5" (L), dual slot
Thermal
Passive
Virtual GPU (vGPU) Software Support
Yes
vGPU Profiles Supported
NVENC | NVDEC
3x | 3x (Includes AV1 Encode & Decode)
Secure Boot with Root of Trust
Yes
NEBS Ready
Yes / Level 3
Multi-Instance GPU (MIG) Support
No
NVIDIA® NVLink® Support
No