On Tuesday at Nvidia’s GTC 2025 convention in San Jose, California, CEO Jensen Huang revealed a number of new AI-accelerating GPUs the corporate plans to launch over the approaching months and years. He additionally revealed extra specs about beforehand introduced chips.
The centerpiece announcement was Vera Rubin, first teased at Computex 2024 and now scheduled for launch within the second half of 2026. This GPU, named after a famous astronomer, will characteristic tens of terabytes of reminiscence and comes with a customized Nvidia-designed CPU known as Vera.
In accordance with Nvidia, Vera Rubin will ship important efficiency enhancements over its predecessor, Grace Blackwell, significantly for AI coaching and inference.

Specs for Vera Rubin, offered by Jensen Huang throughout his GTC 2025 keynote.
Vera Rubin options two GPUs collectively on one die that ship 50 petaflops of FP4 inference efficiency per chip. When configured in a full NVL144 rack, the system delivers 3.6 exaflops of FP4 inference compute—3.3 occasions greater than Blackwell Extremely’s 1.1 exaflops in an analogous rack configuration.
The Vera CPU options 88 customized ARM cores with 176 threads linked to Rubin GPUs by way of a high-speed 1.8 TB/s NVLink interface.
Huang additionally introduced Rubin Extremely, which can observe within the second half of 2027. Rubin Extremely will use the NVL576 rack configuration and have particular person GPUs with 4 reticle-sized dies, delivering 100 petaflops of FP4 precision (a 4-bit floating-point format used for representing and processing numbers inside AI fashions) per chip.
On the rack stage, Rubin Extremely will present 15 exaflops of FP4 inference compute and 5 exaflops of FP8 coaching efficiency—about 4 occasions extra highly effective than the Rubin NVL144 configuration. Every Rubin Extremely GPU will embody 1TB of HBM4e reminiscence, with the whole rack containing 365TB of quick reminiscence.