NVIDIA Quadro GP100

NVIDIA Quadro GP100 - 16GB HBM2 Workstation Graphics Card (5.2 TFLOPs FP64, 10.3TFLOPs FP32) - Retail

NVIDIA's GP100 is the new graphical compute option in the Quadro line up. Based on the same foundations as the NVIDIA Tesla P100, professionals can now experience supercomputer levels of performance for scientific and analysis workflows in your Desktop Workstation.

Key Features

  • 3584 CUDA Cores
  • 16GB HBM2 Memory
  • 5.2 TFLOPS FP64 Performance
  • 10.2 TFLOPS FP32 Performance
  • 20.7 TFLOPS FP16 Performance
  • 4x DisplayPort 1.4 & 1x DVI-D Connectors


What is the NVIDIA Quadro GP100?

Until now, Pascal architecture combined with high-speed memory HBM of the second generation were used only in the rack based server environments, with NVIDIA’s latest Tesla P100. NVIDIA have now release the Quadro GP100 to fill this gap in the market for Desktop Workstation based users wanting to perform deep learning, FEA, Simulation or Analysis workflows.

The Quadro GP100 is based on the same architecture as the Tesla P100 which was initially released as the, compute card installed in NVIDIA’s supercomputers DGX-1. But after a few months, NVIDIA introduced the PCI Express version, which can work with 12 or 16 GB of HBM2 memory.

Now NVIDIA have broadened the availability of this supercomputing performance by offering actively cooled Quadro GP100 perfectly suited to office based Workstation solutions. In some areas, an even greater level of performance is required, so the demand for computing resources in Workstation solutions continues to grow. And there is no doubt that it will continue to grow in the future. Until now, the fastest professional graphics cards in the family were Quadro P5000 and P6000 based on the GPU GP102 and GP104 were perfectly suited for those working with visualisation and Media & Entertainment but the Quadro GP100 is more suited for those working with big data computation and scientific areas.

How Does it Perform?

The graphics processor has 3,584 stream processors in it. FP16 computing is performed with a capacity of 20.7 TFLOPS, FP32 computing can make with the performance of 10.3 TFLOPS and complex FP64 computing produces a massive 5.2 TFLOPS of performance. As a result, the performance of Quadro GP100 is even better than Tesla P100’s performance and currently the most powerful workstation based compute card in the world for FP64 double precision performance.

16 GB of HBM2 memory is connected by 4,096-bit interface with a bandwidth of 720 GB/s. Available video outputs of the video card: 4x DisplayPort 1.4 and 1x DVI which can support 4x4.096x2.160 pixels at 120 Hz or 4x 5.120x2.880 pixels at 60 Hz.

As a special function Quadro GP100 accelerators can be combined with help of NVLink now available via PCIe connection. Like SLI, for the connections are used four channels with a capacity of 160 GB/s. Physically, the connection looks like two bridges resembling SLI.

According to NVIDIA sources, in tasks on the basis of deep study GP100 model delivers 20.7 16-bit teraflops. It can be used as a development platform for the introduction of deep study (deep learning) environment in Windows and Linux.

NVIDIA Pascal is the most powerful architecture of NVIDIA GPUs, which has ever created to date. With a new video card Quadro GP100, you can easily work on any innovative products. With this video card, you will be able to complete even the most complex simulation in record times.

Peak Double Precision FP64 Performance: 5.2 TFLOPS
Peak Single Precision FP32 Performance: 10.3 TFLOPS
Peak Half Precision FP16 Performance: 20.7 TFLOPS
Display & Audio Output:
Ports: DP 1.4 (4) + DVI-D DL (1) + Stereo
DisplayPort with Audio: Yes
DVI-D Single-Link Connector: Via included adapter
HDMI Support: Via optional adapters
VGA Support: Via optional adapters
Number of Displays Supported: 4
Maximum DP 1.4 Resolution: HDR 7680 x 4320 at 30Hz (30-bit color)
5K Display Support: HDR 5120 x 2880 at 60Hz (30-bit color)
4K Display Support: HDR 4096 x 2160 at 60Hz or 3840 x 2160 at 60Hz
Maximum DVI-D DL Resolution: 2560 x 1600 at 60Hz via 3rd party adapter
Maximum DVI-D SL Resolution: 1920 x 1200 at 60Hz via included adapter
HDCP Support: Yes
Professional 3D Support: Yes, via included stereo connector bracket
Quadro Sync II Compatible: Yes (Frame Lock and Genlock)
Capacity: 16 GB HBM2
Memory Interface: 4096-bit
Memory Bandwidth: 717 GB/s
Memory Clock: 11 Gbps
Multi-GPU Scalability: NVLink (2-way) or SLI HB
Thermal Solution: Ultra-quiet active fansink
NVIDIA GPU Direct Compatible: Yes
Graphics APIs: Shader Model 5.1, OpenGL 4.5, DirectX 12.0, Vulkan 1.0
Compute APIs: CUDA, DirectCompute, OpenCL
NVIDIA MOSAIC: Yes (Windows 10, 8.1, 8, 7, and Linux)
Scalable geometry architecture: Yes
Hardware tessellation engine: Yes
NVIDIA GigaThread engine with dual copy engines: Yes
Shader Model 5.1 (OpenGL 4.5 and DirectX 12): Yes
Up to 32K x 32K texture and render processing: Yes
Transparent multisampling and super sampling: Yes
16x angle independent anisotropic filtering: Yes
32-bit per-component floating point texture filtering and blending: Yes
64x full scene antialiasing (FSAA)/128x FSAA in SLI Mode: Yes
Decode acceleration for: MPEG-2, MPEG-4 Part 2 Advanced Simple Profile, H.264, HEVC, MVC, VC1, DivX (version 3.11 and later), and Flash (10.1 and later)
Dedicated H.264 & HEVC Encoder: Yes
• Blu-ray dual-stream hardware acceleration (supporting HD picture-in-picture playback)
• NVIDIA GPU Boost 3.0 (Automatically improves GPU engine throughput to maximize application performance)
• Pascal SM Architecture (streaming multi-processor design that delivers greater processing efficiency)
• Dynamic Parallelism (GPU dynamically spawns new threads without going back to the CPU)
• Mixed-precision (16-, 32- and 64-bit) computing
• API support includes: CUDA C, CUDA C++, DirectCompute 5.0, OpenCL, Java, Python, and Fortran
• Error correction codes (ECC) on graphics memory
• 64 KB of RAM (dedicated shared memory per SM)

