3 NVIDIA L40 | Taknet

NVIDIA

NVIDIA L40

Key Features

Accelerate Next Generation Workloads

NVIDIA Omniverse™ Enterprise
Rendering and 3D Graphics
High-Performance Virtual Workstations with NVIDIA RTX™ Virtual Workstation (RTX vWS) Software
AI Training and Data Science
Streaming and Video Content

Delivering unprecedented visual computing performance for the data center.

From virtual workstation application to large-scale modeling and simulation, modern visual computing and scientific workflows are growing in both complexity and quantity. Enterprises need data center technology that can deliver extreme performance and scale with versatile capabilities to conquer the diverse computing demands of these increasingly complex workloads. The NVIDIA® L40 GPU delivers unprecedented visual computing performance for the data center, providing next-generation graphics, compute, and AI capabilities. Built on the revolutionary NVIDIA Ada Lovelace architecture, the NVIDIA L40 harnesses the power of the latest generation RT, Tensor, and CUDA cores to deliver groundbreaking visualization and compute performance for the most demanding data center workloads.

Powered by NVIDIA Ada Lovelace Architecture

NVIDIA CUDA Cores

Double-speed processing for single-precision floating point (FP32) operations provides significant performance improvements for graphics and simulation workflows, such as complex 3D computer-aided design (CAD) and computer-aided engineering (CAE), on the desktop.

Third-Generation RT Cores

With up to 2X the throughput over the previous generation, third-generation RT Cores deliver massive speedups for workloads like photorealistic rendering of movie content, architectural design evaluations, and virtual prototyping of product designs. This technology also accelerates the rendering of ray-traced motion blur with greater visual accuracy.

Fourth-Generation Tensor Cores

Fourth-generation Tensor Cores provide faster AI compute performance, delivering more than 2X the performance of the previous generation. These new Tensor Cores support acceleration of the FP8 precision data type and provide independent floating-point and integer data paths to speed up execution of mixed floating point and integer calculations.

48 Gigabytes (GB) of GPU Memory

With 48GB GDDR6 memory, the RTX 6000 gives data scientists, engineers, and creative professionals the large memory necessary to work with massive datasets and workloads like rendering, data science, and simulation.

Virtualization-Ready

Support for NVIDIA RTX Virtual Workstation (vWS) software allows a personal workstation to be repurposed into multiple high-performance virtual workstation instances, enabling remote users to share resources to drive high-end design, AI, and compute workloads.

PCIe Gen 4

PCI Express Gen 4 support provides double the bandwidth of PCIe Gen 3, improving data-transfer speeds from CPU memory for data-intensive tasks like AI and data science.

GPU Architecture	NVIDIA Ada Lovelace architecture
GPU Memory	48GB GDDR6 with ECC
Memory bandwidth	864GB/s
Interconnect Interface	PCIe Gen4x16: 64GB/s bi-directional
NVIDIA Ada Lovelace Architecture-Based CUDA Cores	18,176
NVIDIA Fourth-Generation Tensor Cores	568
NVIDIA Third-Generation RT Cores	142
RT Core performance TFLOPS	209
FP32 TFLOPS	90.5
TF32 Tensor Core TFLOPS	90.5 \| 181**
BFLOAT16 Tensor Core TFLOPS	181.05 \| 362.1**
FP16 Tensor Core	181.05 \| 362.1**
FP8 Tensor Core	362 \| 724**
Peak INT8 Tensor TOPS	362 \| 724**
Peak INT4 Tensor TOPS	724 \| 1448**
Form Factor	4.4” (H) x 10.5” (L) – dual slot
Display Ports	4 x DisplayPort 1.4a
Max Power Consumption	300W
Power Connector	16-pin
Thermal	Passive
Virtual GPU (vGPU) software support	Yes
vGPU Profiles Supported	See Virtual GPU Licensing Guide
NVENC \| NVDEC	3x \| 3x (Includes AV1 Encode & Decode)
Secure Boot with Root of Trust	Yes
NEBS Ready	Level 3
MIG Support	No
NVLink Support	No

* Preliminary specifications, subject to change. ** With Sparsity

Speak with an expert to learn more.