NVIDIA Datacenter GPU Lineage Reference

GPU Resource Reference

NVIDIA Datacenter GPU
Lineage Reference

A consolidated reference for NVIDIA-branded datacenter and compute GPUs from the announced Rubin roadmap back to the Tesla brand era, with ITAD-specific valuation guidance per generation. Built for ITAD operators, secondary market buyers, and procurement teams who need to identify SKUs, verify form-factor compatibility, and flag export-controlled variants in disposition streams.

Rubin → Blackwell → Tesla 11 architecture generations Form factor compatibility Export-control SKU map ITAD valuation notes

GPU Resource Configuration Guide

OEM part-number mapping for Dell, HPE, Lenovo, and DGX platforms. Server-first lookup and PN reverse-search.

Open Guide →

Reviewed and refreshed as new SKUs ship.

Every NVIDIA datacenter GPU, with the disposition pitfalls flagged

The page below catalogs NVIDIA-branded datacenter and compute GPUs by architecture generation, ordered newest first: announced Rubin and Feynman roadmap, then Blackwell, Hopper, Ada Lovelace, Ampere, Turing, Volta, Pascal, Maxwell, Kepler, and the Tesla / Fermi early generation. Each section lists the shipped SKUs with year, memory configuration, form factor, and TDP.

What sets this reference apart from a public spec list is the ITAD commentary attached to each generation. Form-factor identification (SXM vs PCIe), HGX baseboard compatibility, China-export-control SKU variants, and firmware-entitlement constraints are the four most common sources of secondary-market pricing error — they are flagged inline at the generation where they apply, and consolidated in the matrices at the bottom of the page.

Out of scope: GeForce consumer cards, Quadro workstation cards, and OEM-only variants are not catalogued here unless they are routinely encountered in enterprise ITAD streams.

The “Tesla” name has two unrelated meanings — both appear on this page

The Tesla GPU product brand was retired in May 2020. Datacenter products before that date carried “Tesla” branding regardless of the underlying architecture (e.g., Tesla K80 is Kepler, Tesla V100 is Volta). After May 2020, NVIDIA names datacenter products by architecture letter — A-series for Ampere, L-series for Ada Lovelace, H-series for Hopper, B-series for Blackwell.

Separately, NVIDIA’s first unified-architecture GPU family from 2007 is also called “Tesla” — that is a microarchitecture name, used internally for the original CUDA generation. Neither has any connection to the EV company.

Announced, not yet shipping

Rubin · Feynman

For forward-planning purposes only — specifications are provisional until shipping silicon is available.

ArchitectureGPUExpectedMemoryNotes
RubinR100H2 2026HBM4 (8 stacks)Pairs with Vera ARM CPU
Rubin UltraGR200 (est.)2027HBM4 (12 stacks)
FeynmanTBD2028TBDAnnounced GTC March 2025

Blackwell · B-series (2024–present)

B100 / B200 flagship · HBM3e · GB200 NVL72 rack-scale systems

Blackwell ships as discrete B100 / B200 / B200A cards and as Grace Blackwell GB200 superchips integrated into NVL72 rack-scale liquid-cooled systems. SXM6 is a new socket and is not backward compatible with HGX H100 / SXM5 servers.

ModelYearMemoryForm factorTDPNotes
B1002024192 GB HBM3eSXM6~700W8× HBM stacks
B2002024192 GB HBM3eSXM6~1,000WFlagship Blackwell
B200A2025192 GB HBM3ePCIeTBDAir-cooled PCIe variant
GB200 Superchip2024192 GB HBM3e + 480 GB LPDDR5SXM~2,700W*Grace CPU + B200 GPU
GB200 NVL72202472× B200 (13.8 TB total)Rack systemLiquid36 Grace Blackwell chips per rack
GB300 NVL72 (Blackwell Ultra)2025192 GB HBM3e per GPURack systemLiquid~50% more FP4 perf vs GB200 NVL72

*Per-rack figure for NVL72 system

ITAD note. Blackwell is effectively sold out in new channels through mid-2026; meaningful secondary-market volume is not expected before 2026–2027.

GB200 NVL72 is a rack-scale liquid-cooled system, not a discrete GPU card. Disposition requires specialized liquid-cooling logistics and whole-rack handling rather than card-level treatment. SXM6 is a new socket — it is not interchangeable with HGX H100 / SXM5 servers. B200A (PCIe) targets air-cooled deployments and will be more accessible to standard rack environments.

Hopper · H-series (2022–present)

H100 / H200 flagship training · HBM3 / HBM3e · NVLink 4.0

Hopper is the dominant training and high-end inference architecture of the current cycle. H100 SXM5 cards plug into HGX H100 baseboards; the same baseboard accepts H200 SXM5 cards, which carry the same GH100 die but roughly double the memory capacity.

ModelYearMemoryForm factorTDPNotes
H100 SXM5 80GB202280 GB HBM3SXM5700WFlagship training; NVLink 4.0 (900 GB/s)
H100 PCIe 80GB202280 GB HBM2ePCIe350WLower bandwidth; no NVSwitch
H100 NVL 94GB202394 GB HBM3PCIe400WNVLink pair config; 188 GB combined
GH200 Grace Hopper202396 GB HBM3 + 480 GB LPDDR5SXM900WGrace CPU + H100 superchip
H800 SXM 80GB202380 GB HBM3SXM5700WChina export-compliantCHINA-ONLY
H800 PCIe 80GB202380 GB HBM2ePCIe350WChina export-compliantCHINA-ONLY
H20202496 GB HBM3PCIe200WChina export-compliant; reduced computeCHINA-ONLY
H200 SXM 141GB2024141 GB HBM3eSXM5700WSame GH100 die as H100; ~2× memory
H200 NVL 141GB2024141 GB HBM3ePCIe400WNVLink pair; 282 GB combined

ITAD note — form factor. H100 SXM5 vs PCIe is the single largest valuation split in current secondary-market activity. SXM5 requires an HGX H100 baseboard and commands a 30–50% premium over the PCIe variant. Misidentifying form factor at intake is a significant pricing-error risk. H200 SXM5 is board-level compatible with HGX H100 baseboards.

H800 and H20 are China export-control SKUs under BIS Entity List rules. Re-export to restricted parties or jurisdictions may be unlawful — flag for compliance review before disposition.

Firmware entitlement caveat

nvfwupd and NVIDIA firmware packages for H100 and H200 are gated behind NVIDIA Enterprise Support Portal entitlements that do not transfer with secondary-market hardware. Units acquired through ITAD channels may have no supported firmware upgrade path. This is a material disclosure for any disposition where the buyer expects ongoing firmware maintenance, and it is a topic that secondary market documentation rarely surfaces.

Ada Lovelace · L-series (2022–present)

Air-cooled enterprise AI · GDDR6 not HBM

Ada Lovelace is the air-cooled enterprise AI line — GDDR6 rather than HBM, PCIe rather than SXM. L40S has effectively replaced A40 as the standard enterprise AI card in air-cooled deployments. L4 is the spiritual successor to T4 — single-slot, low-watt, designed for broad cloud and enterprise inference deployment.

ModelYearMemoryForm factorTDPNotes
L4202324 GB GDDR6PCIe72WT4 successor; Google Cloud G2 instances
L40202248 GB GDDR6PCIe300WRendering / inference
L40S202348 GB GDDR6PCIe350WAI + graphics; FP8 Tensor Cores; A40 successor
L20202348 GB GDDR6PCIe200WChina export-compliant L40SCHINA-ONLY
L2202324 GB GDDR6PCIe60WChina export-compliant L4CHINA-ONLY

ITAD note. L40S is the current air-cooled enterprise AI workhorse and is positioned as the A40 successor. Secondary-market volume is still emerging; expect meaningful supply as hyperscaler refresh cycles reach 2026–2027.

L4’s 72W single-slot envelope removes most deployment barriers, which is why it shows up across so many cloud and enterprise refresh streams. L20 and L2 are China-market-only — flag them for re-export compliance review at intake.

Ampere · A-series (2020–2023)

Architecture-named branding · A100 flagship · first major HGX deployment

Ampere introduced the current architecture-letter naming convention and the HGX baseboard model that carries through to Hopper and Blackwell. A100 SXM4 cards plug into HGX A100 baseboards (4-GPU or 8-GPU variants) and are not interchangeable with PCIe servers.

ModelYearMemoryForm factorTDPNotes
A10202124 GB GDDR6PCIe150WInference / graphics; single-slot
A1620214× 16 GB GDDR6PCIe250WVDI / vGPU; quad-GPU single card
A30202124 GB HBM2ePCIe165WMid-range HPC/AI
A40202048 GB GDDR6PCIe300WLarge-VRAM inference / rendering
A100 40GB SXM4202040 GB HBM2SXM4400WFlagship training
A100 80GB SXM4202080 GB HBM2eSXM4400WExtended-memory training
A100 40GB PCIe202040 GB HBM2PCIe300W
A100 80GB PCIe202180 GB HBM2ePCIe300W
A800 80GB SXM202280 GB HBM2eSXM4400WChina export-compliant A100CHINA-ONLY
A800 40GB PCIe202240 GB HBM2PCIe300WChina export-compliantCHINA-ONLY

ITAD note. A100 SXM4 is not a drop-in PCIe card — it requires an HGX-compatible baseboard, and the 4-GPU and 8-GPU HGX A100 boards are different products. Verify the donor server before pricing.

A800 is the China export-control variant of A100 with reduced NVLink bandwidth (400 GB/s vs A100’s 600 GB/s). Treat as a distinct SKU for pricing and confirm end-destination compliance — re-export to restricted parties or jurisdictions may be unlawful under BIS rules.

A40 is the large-VRAM enterprise workhorse for the generation: PCIe form factor, no HGX requirement, 48 GB on a single card. A10 is the high-volume inference card that appears across Dell and HPE mid-cycle refreshes.

Turing · T4 (2018–2022)

Inference-optimized · single-slot 70W

Turing in the consumer market is the RTX 20-series. In the datacenter line, only the T4 shipped — but it became one of the most widely deployed datacenter GPUs ever produced, on the strength of its 70W single-slot envelope.

ModelYearMemoryForm factorTDPNotes
Tesla T4201816 GB GDDR6PCIe70WDominant cloud inference GPU

ITAD note. T4 is among the highest-volume datacenter GPUs in the secondary market — AWS G4dn and Google Cloud T4 instances drove enormous deployment. OEM-rebranded variants (Dell, HPE, Lenovo) are ubiquitous in enterprise refresh streams. Per-unit value is modest, but the SKU moves quickly.

Volta · V100 (2017–2020)

Architecture-named (Tesla brand still on packaging)

V100 is the first generation where NVIDIA explicitly named datacenter products by architecture rather than product brand. SXM2, SXM3, and PCIe variants shipped over a four-year span at 16 GB and 32 GB capacities.

ModelYearMemoryForm factorTDPNotes
Tesla V100 SXM2 16GB201716 GB HBM2SXM2300WDGX-1 GPU; NVLink 2.0
Tesla V100 PCIe 16GB201716 GB HBM2PCIe250W
Tesla V100 SXM2 32GB201832 GB HBM2SXM2300W
Tesla V100 PCIe 32GB201832 GB HBM2PCIe250W
Tesla V100S PCIe 32GB201932 GB HBM2PCIe250WFaster memory variant
Tesla V100 SXM3 32GB201832 GB HBM2SXM3350WDGX-2 only — not interchangeable

ITAD note. The SXM2 vs SXM3 distinction is a critical valuation split. SXM3 is DGX-2 exclusive and has effectively no aftermarket beyond that single platform, despite premium specifications. PCIe variants are the most liquid form factor across all V100 capacities. The 32 GB capacity commands a meaningful premium over 16 GB for memory-bound training workloads.

Pascal (2016–2018)

Tesla product brand · P-series · first HBM datacenter generation

Pascal introduced HBM2 memory and the SXM2 form factor to the datacenter line. The P100 was the first NVLink-capable datacenter GPU; the P40 has had unusual longevity in the inference market.

ModelYearMemoryForm factorTDPNotes
Tesla P420168 GB GDDR5PCIe75WLow-power inference
Tesla P6201616 GB GDDR5MXM90WServer blade
Tesla P40201624 GB GDDR5PCIe250WTraining / large-batch inference
Tesla P100 SXM2201616 GB HBM2SXM2300WFirst HBM datacenter GPU; NVLink
Tesla P100 PCIe201616 GB HBM2PCIe250WPCIe variant; no NVLink

ITAD note. P40 retains durable inference demand — 24 GB GDDR5 at low secondary cost is an attractive combination for AI hobbyists and cost-sensitive inference deployments. P100 SXM2 requires an NVLink-capable server (DGX-1, HGX-1) and should not be priced as a drop-in PCIe card; verify the donor server before quoting. P4 is high-volume but low-value per unit.

Maxwell (2014–2016)

Tesla product brand · M-series

Maxwell shifted toward inference, virtual desktop infrastructure, and power-efficient deployment. The M60 was widely deployed in Citrix and VMware vGPU environments.

ModelYearMemoryNotes
Tesla M420154 GB GDDR5Low-power inference
Tesla M1020154× 8 GB GDDR5VDI / virtual desktop
Tesla M40201512 or 24 GB GDDR5Training
Tesla M6020152× 8 GB GDDR5VDI / vGPU (Citrix, VMware)

Kepler (2012–2016)

Tesla product brand · K-series

Kepler was the dominant compute architecture of the early HPC and early cloud era. The K80 in particular powered AWS P2 instances and a large fraction of first-wave cloud GPU deployments.

ModelYearMemoryNotes
Tesla K1020122× 4 GB GDDR5Dual-GPU inference card
Tesla K2020125 GB GDDR5Scientific HPC
Tesla K20X20126 GB GDDR5Titan supercomputer GPU
Tesla K40201312 GB GDDR5HPC workhorse
Tesla K8020142× 12 GB GDDR5Dual-GPU; dominant early cloud GPU

ITAD note. K80 inventory remains visible in HPC refresh streams — the install base was large and many units are still being decommissioned. K40 and K80 are end-of-driver-support, so retained value is workload-specific (CUDA-compatible legacy applications and budget HPC users) rather than broad market.

Tesla microarchitecture & Fermi (2007–2013)

First-generation CUDA compute · Tesla product brand

The earliest NVIDIA datacenter compute parts. CUDA was introduced on the original Tesla microarchitecture; ECC memory, double-precision performance, and the SXM/server-module concept arrived with Fermi. These cards are largely retired from production deployment but still surface occasionally in legacy refresh streams.

ModelYearMemoryNotes
Tesla C87020071.5 GB GDDR3First CUDA compute card
Tesla S107020084× 4 GB GDDR31U server appliance
Tesla C106020084 GB GDDR3Workstation compute
Tesla S205020104× 3 GB GDDR5Fermi server appliance
Tesla C205020103 GB GDDR5 ECCFermi workstation
Tesla C207020106 GB GDDR5 ECCFermi workstation
Tesla M205020103 GB GDDR5 ECCFermi server module
Tesla M207020106 GB GDDR5 ECCFermi server module
Tesla M209020116 GB GDDR5 ECCHigher-end Fermi server

Form factor compatibility

SXM sockets are not cross-compatible across generations. Form-factor identification at intake determines which servers a card is usable in, and is one of the primary sources of valuation error in secondary-market disposition.

Form factorArchitectureServer requirement
PCIe x16All generationsAny standard PCIe server
SXM2Pascal, VoltaDGX-1 / HGX-1 baseboard
SXM3Volta (DGX-2 only)DGX-2 exclusive — not interchangeable
SXM4Ampere (A100)HGX A100 baseboard (4-GPU or 8-GPU)
SXM5Hopper (H100, H200)HGX H100 or H200 baseboard
SXM6Blackwell (B100, B200)HGX B200 baseboard — not SXM5 compatible
MXMPascal (P6)Blade-server specific

Export-control SKU map

BIS export control rules have produced restricted variants of multiple GPU generations. These are distinct SKUs with different specifications and potential re-export restrictions; do not treat as equivalent to the base SKU for pricing.

Export SKUBased onPrimary restrictionMarket
A800A100NVLink BW: 400 GB/s (vs 600 GB/s)China only
H800H100Reduced NVLink bandwidthChina only
H20HopperSignificantly reduced computeChina only
L20L40SReduced interconnect performanceChina only
L2L4Reduced interconnect performanceChina only

Compliance action at intake

Flag any of the above SKUs during intake. Verify end-destination compliance before disposition. Re-export to restricted parties or countries may be unlawful under BIS Entity List rules.

Need a SKU-level valuation or compatibility check?

GPU Resource provides current and forward fair-market value, OEM part-number cross-references, and disposition advisory across the SKUs catalogued on this page. If you have a specific intake to value or a configuration to verify, get in touch.

Contact GPU Resource