Key Features

Accelerate Next Generation Workloads

  • NVIDIA Omniverse™ Enterprise
  • Rendering and 3D Graphics
  • High-Performance Virtual Workstations with NVIDIA RTX™ Virtual Workstation (RTX vWS) Software
  • AI Training and Data Science
  • Streaming and Video Content

Delivering unprecedented visual computing performance for the data center.

From virtual workstation application to large-scale modeling and simulation, modern visual computing and scientific workflows are growing in both complexity and quantity. Enterprises need data center technology that can deliver extreme performance and scale with versatile capabilities to conquer the diverse computing demands of these increasingly complex workloads. The NVIDIA® L40 GPU delivers unprecedented visual computing performance for the data center, providing next-generation graphics, compute, and AI capabilities. Built on the revolutionary NVIDIA Ada Lovelace architecture, the NVIDIA L40 harnesses the power of the latest generation RT, Tensor, and CUDA cores to deliver groundbreaking visualization and compute performance for the most demanding data center workloads.

Categories

Powered by NVIDIA Ada Lovelace Architecture

NVIDIA CUDA Cores

Double-speed processing for single-precision floating point (FP32) operations provides significant performance improvements for graphics and simulation workflows, such as complex 3D computer-aided design (CAD) and computer-aided engineering (CAE), on the desktop.

Third-Generation RT Cores

With up to 2X the throughput over the previous generation, third-generation RT Cores deliver massive speedups for workloads like photorealistic rendering of movie content, architectural design evaluations, and virtual prototyping of product designs. This technology also accelerates the rendering of ray-traced motion blur with greater visual accuracy.

Fourth-Generation Tensor Cores

Fourth-generation Tensor Cores provide faster AI compute performance, delivering more than 2X the performance of the previous generation. These new Tensor Cores support acceleration of the FP8 precision data type and provide independent floating-point and integer data paths to speed up execution of mixed floating point and integer calculations.

48 Gigabytes (GB) of GPU Memory

With 48GB GDDR6 memory, the RTX 6000 gives data scientists, engineers, and creative professionals the large memory necessary to work with massive datasets and workloads like rendering, data science, and simulation.

Virtualization-Ready

Support for NVIDIA RTX Virtual Workstation (vWS) software allows a personal workstation to be repurposed into multiple high-performance virtual workstation instances, enabling remote users to share resources to drive high-end design, AI, and compute workloads.

PCIe Gen 4

PCI Express Gen 4 support provides double the bandwidth of PCIe Gen 3, improving data-transfer speeds from CPU memory for data-intensive tasks like AI and data science.

GPU Architecture NVIDIA Ada Lovelace architecture
GPU Memory 48GB GDDR6 with ECC
Memory bandwidth 864GB/s
Interconnect Interface PCIe Gen4x16: 64GB/s bi-directional
NVIDIA Ada Lovelace Architecture-Based CUDA Cores 18,176
NVIDIA Fourth-Generation Tensor Cores 568
NVIDIA Third-Generation RT Cores 142
RT Core performance TFLOPS 209
FP32 TFLOPS 90.5
TF32 Tensor Core TFLOPS 90.5 | 181**
BFLOAT16 Tensor Core TFLOPS 181.05 | 362.1**
FP16 Tensor Core 181.05 | 362.1**
FP8 Tensor Core 362 | 724**
Peak INT8 Tensor TOPS 362 | 724**
Peak INT4 Tensor TOPS 724 | 1448**
Form Factor 4.4” (H) x 10.5” (L) – dual slot
Display Ports 4 x DisplayPort 1.4a
Max Power Consumption 300W
Power Connector 16-pin
Thermal Passive
Virtual GPU (vGPU) software support Yes
vGPU Profiles Supported See Virtual GPU Licensing Guide
NVENC | NVDEC 3x | 3x (Includes AV1 Encode & Decode)
Secure Boot with Root of Trust Yes
NEBS Ready Level 3
MIG Support No
NVLink Support No
* Preliminary specifications, subject to change. ** With Sparsity

Speak with an expert to learn more.