NVIDIA L40S

Unparalleled AI and graphics performance for the data center.

The Most Powerful Universal GPU

Experience breakthrough multi-workload performance with the NVIDIA L40S GPU. Combining powerful AI compute with best-in-class graphics and media acceleration, the L40S GPU is built to power the next generation of data center workloads—from generative AI and large language model (LLM) inference and training to 3D graphics, rendering, and video.

Highlights

Universal Performance

Tensor Performance

1,466 TFLOPS¹

RT Core Performance

212 TFLOPS

Single-Precision Performance

91.6 TFLOPS

¹ Peak rates are based on GPU boost clock.

Features

Powered by the NVIDIA Ada Lovelace Architecture

Workloads

Multi-Workload Acceleration

Performance

Breakthrough Performance

Specifications

NVIDIA L40S GPU

FP32	91.6 teraFLOPS
TF32 Tensor Core	366 teraFLOPS*
FP16	733 teraFLOPS*
FP8	1,466 teraFLOPS*
RT Core Performance	212 teraFLOPS
Max Power Consumption	350W
*With Sparsity

NVIDIA L40S

The Most Powerful Universal GPU

Highlights

Universal Performance

Tensor Performance

1,466 TFLOPS¹

RT Core Performance

212 TFLOPS

Single-Precision Performance

91.6 TFLOPS

Features

Powered by the NVIDIA Ada Lovelace Architecture

Fourth-Generation Tensor Cores

Third-Generation RT Cores

CUDA Cores

Transformer Engine

Efficiency and Security

DLSS 3

Workloads

Multi-Workload Acceleration

Generative AI

Rendering and 3D Graphics

NVIDIA OVX L40S

LLM Training and Inference

NVIDIA Omniverse

Performance

Breakthrough Performance

Image Generative AI

Large Language Model (LLM) Inference

Specifications

NVIDIA L40S GPU

Broadway Store

Valencia Store

Emeryville Store

Alameda Store