Key Features

Unprecedented User Experience and Density for Graphics-Rich VDI

The NVIDIA A16, built on the NVIDIA Ampere architecture, combines with NVIDIA virtual GPU (vGPU) software to raise the bar on user experience for graphics-rich virtual desktop infrastructure (VDI). As more organizations turn to remote work as long term strategy, A16 with the NVIDIA Virtual PC (vPC) or Virtual Applications (vApps) software, enables knowledge workers across every industry to maximize productivity with performance indistinguishable from a native PC. NVIDIA A16 delivers up to 2X the user density, compared with the previous generation M10, reducing the amount of hardware resources needed and lowering the total cost of ownership (TCO).
 
When combined with NVIDIA RTX Virtual Workstation (vWS) software, the A16 enables affordable entry-level virtual workstations ideal for running workloads such as computer-aided design (CAD). The A16 features a unique quad-GPU board design enabling the provisioning of mixed user profile sizes, so IT can support light virtual PC workloads as well as users with larger memory and graphics requirements. Mixing user types on a board is also supported, enabling the provisioning of virtual PCs, virtual workstations, and even virtualized compute on a single board.

Superior Streaming Media Performance

The NVIDIA A16 features the highest number of video encoders and decoders with four on-chip hardware encoders (NVENC) and eight decoder (NVDEC) units in a single A16 board. This provides the best encode, decode and transcode performance translating to a maximized number of video streams per A16 board at an attractive price point versus alternative offerings.

Categories

Powered by NVIDIA Ampere Architecture

NVIDIA Ampere Architecture-Based CUDA Cores

Accelerate graphics workflows with the latest CUDA® cores for up to 2.5X single-precision floating-point (FP32) performance compared to the previous generation.

Second-Generation RT Cores

Produce more visually accurate renders faster with hardware-accelerated motion blur and up to 2X faster ray-tracing performance than the previous generation.

Third-Generation Tensor Cores

Boost AI and data science model training with up to 10X faster training performance compared to the previous generation with hardware-support for structural sparsity.

Virtualization-Ready

Repurpose your personal workstation into multiple high-performance virtual workstations with support for NVIDIA RTX Virtual Workstation (vWS) software.

Third-Generation NVIDIA NVLink

Scale memory and performance across multiple GPUs with NVIDIA® NVLink™ to tackle larger datasets, models, and scenes.

PCI Express Gen 4

Improve data-transfer speeds from CPU memory for data-intensive tasks with support for PCI Express Gen 4.

Power Efficiency

Leverage a dual-slot, power efficient design that’s 2.5X more power efficient than the previous generation and crafted to fit a wide range of workstations.

GPU Architecture NVIDIA Ampere architecture
GPU Memory 4x 16 GB GDDR6
Memory Bandwidth 4x 200 GB/s
Error-Correcting Code (ECC) Yes
NVIDIA Ampere Architecture-based CUDA Cores 4x 1280
NVIDIA Third-generation Tensor Cores 4x 40
NVIDIA Second-generation RT Cores 4x 10
FP32 | TF32 | TF321 (TFLOPS) 4x 4.5 | 4x 9 | 4x 18
FP16 | FP161 (TFLOPS) 4x 17.9 | 4x 35.9
INT8 | INT81 (TOPS) 4x 35.9 | 4x 71.8
System Interface PCIe Gen4 (x16)
Max Power Consumption 250W
Thermal Solution Passive
Form Factor Full height, full length (FHFL) Dual Slot
Power Connector 8-pin CPU
Encode/Decode Engines 4 NVENC/8 NVDEC (includes AV1 decode)
Secure and Measured Boot with Hardware Root of Trust for GPU Yes (optional)
vGPU Software Support NVIDIA Virtual PC (vPC), NVIDIA Virtual Applications (vApps), NVIDIA RTX Virtual Workstation (vWS), NVIDIA AI Enterprise, NVIDIA Virtual Compute Server (vCS)
Graphics APIs DirectX 12.0, Shader Model 5.1, OpenGL 4.6, Vulkan 1.1
Compute APIs CUDA, DirectCompute, OpenCL™, OpenACC®
MIG Support No

Speak with an expert to learn more.