Nvidia Vera Rubin AI platform: Next-Gen computing for data centers & enterprises

TOI GLOBAL DESK | TOI GLOBAL | Jan 06, 2026, 19:33 IST

Supermicro Announces Support for Upcoming NVIDIA Vera Rubin NVL72, HGX Rubin NVL8 and Expanded Rack-Scale Manufacturing Capacity for Liquid-Cooled AI Solutions

At CES 2026, Nvidia has unveiled its cutting-edge Vera Rubin AI platform, engineered to handle intense AI operations within data centers and cloud environments. This innovative system is designed not only to reduce the costs associated with AI computing but also to accelerate the development of AI models. By utilizing a multi-chip architecture, Nvidia has enhanced performance significantly.

Nvidia has unveiled its next-generation AI computing platform, Vera Rubin, at CES 2026 in Las Vegas. Nvidia founder and CEO Jensen Huang announced the platform is now in full production, designed to support large-scale artificial intelligence workloads for data centers, cloud providers, and enterprises building advanced AI systems. The Rubin platform aims to lower the cost of AI computing and accelerate the training and deployment of AI models, succeeding Nvidia’s Blackwell architecture as part of the company's expansion of AI use across industries.

Vera Rubin is Nvidia’s first six-chip AI platform, developed using an "extreme codesign" approach where multiple components are designed together. This platform integrates Rubin GPUs, Vera CPUs, NVLink 6 networking, Spectrum-X Ethernet Photonics, ConnectX-9 networking cards, and BlueField-4 data processing units. Nvidia states this integrated design is intended to reduce bottlenecks and enhance performance for substantial AI workloads.

The Rubin GPU is capable of delivering up to 50 petaflops of inference performance, utilizing NVFP4 precision. Complementing the GPU, the Vera CPU is engineered to manage data movement and AI agent processing tasks. Nvidia emphasizes that the platform is built to facilitate large-scale AI training and inference with increased efficiency.

Nvidia claims the Rubin platform can significantly reduce the cost of generating AI tokens, bringing it down to approximately one-tenth of the cost associated with previous platforms. In conjunction with the Rubin platform, the company also introduced an AI-focused storage system named Inference Context Memory Storage. This new storage solution is designed to improve long-context AI processing by boosting both speed and efficiency.

The Rubin platform is slated for use alongside Nvidia’s open AI models and software tools. This integration is intended to support a wide range of applications across various sectors, including autonomous driving, robotics, healthcare, and climate research.

The announcement of Nvidia’s Rubin AI chip was shared by X user Sawyer Merritt, who also posted a video showcasing the next-generation chips. Elon Musk responded to this post, highlighting practical considerations for deployment.

"It will take another 9 months or so before the hardware is operational at scale and the software works well"

This statement from Musk points to the realities of bringing advanced hardware and software to full operational capacity, emphasizing that the journey from announcement to widespread, effective use involves significant time and development.

GENERAL

TOI GLOBAL DESK