Nvidia CEO Huang unveils latest AI products for the enterprise along with advances in autonomous vehicle and humanoid robot capabilities during GTC 2024 keynote

Nvidia CEO Jensen Huang unveils Blackwell GPU 18x+ faster than Hopper at GTC 2024. (Source: Nvidia on YouTube)

Nvidia CEO Jensen Huang has unveiled the company’s latest AI products and services for the enterprise during GTC 2024 (GPU Technology Conference) in San Jose, CA. The company announced the Blackwell GPU platform as part of their new lineup powering AI in areas such as robotics, drug discovery, and warehouse automation. Advances in autonomous vehicles and humanoid robots were demonstrated.

David Chien, Published 03/19/2024 🇫🇷 🇪🇸 ...

Nvidia GPU Server/Datacenter AI Biotech E-Mobility Business Virtual Reality (VR) / Augmented Reality (AR) Apple

Nvidia CEO Jensen Huang has revealed the company’s latest AI products and services during the keynote at GTC 2024 on March 18, 2024 in San Jose, CA. Targeted to enterprise customers, the new Nvidia offerings promise to greatly accelerate AI learning and application to many fields such as robotics, weather forecasting, drug discovery, and warehouse automation. The Nvidia Blackwell GPU, NIMs, NEMO, and AI foundry were among the notable offerings unveiled.

The AI training process is much more computer intensive than the use of an AI model because millions of input documents must be processed. This can take weeks, even months on the very fastest supercomputers available today for companies like Microsoft. Nvidia previously released the Hopper GPU platform in 2022 to handle such large compute loads.

Blackwell GPU platform

At GTC 2024, Huang unveiled Blackwell as the follow-up to Hopper and is available on a drop-in board replacement for current Hopper installations. The Blackwell GPU consists of dual-104 billion transistor GPU dies on a TSMC 4NP process for 20 FP4 petaflops with 192GB of HBM3e memory capable of transferring 8TB/s. In comparison, a desktop 4090 GPU has a single 76.3 billion transistor GPU, 5NM process, up to 1.32 Tensor FP8 petaflops of performance with 24GB of GDDR6X memory. If one doubles the 4090 FP8 number to adjust to 4-bits, that’s roughly 20 petaflops Blackwell vs 3 petaflops 4090.

When 72 Blackwell GPUs are racked in a single DGX GB200 NVL72 cabinet coupled with liquid cooling, improved NVLink CPUs, and other interconnect improvements, performance over Hopper cabinets jumps by 22x for FP8 AI training and 45x for FP4 AI inference. Blackwell AI training power consumption is also reduced approximately 4x versus Hopper.

NIMS pre-packaged AI models

To take advantage of this leap in performance, Nvidia introduced pre-packaged AI models called NIMS (Nvidia Inference Microservice), utilizing Kubernetes to run on Nvidia CUDA GPUs locally or on the cloud. Access to the stand-alone NIMS is through a simplified HUMAN API. The goal behind this is to create a future where AI services are created by asking an AI to create an app with certain features, then the AI mixes and matches various NIMS together without requiring low-level programming. Finally to assist in the training of NIMS, Nvidia introduced NeMO Microservices to customize, evaluate, and guardrail the training process on corporate documents.

Nvidia BioNeMO and biological NIMS

Huang announced that Nvidia will be developing NIMS trained on biological and medical data to provide researchers with easier access to AI that can improve all aspects of medicine, such as finding drug candidates faster.

Thor ASIL-D and BYD autonomous EV vehicles

Significantly, Huang stated that BYD will be the very first EV car maker in the world to adopt their new Thor ASIL-D computer utilizing an AI SoC to process visual and driving input to provide high safety in autonomous driving. Coupled with the recent announcement of SuperDrive by Plus, this suggests BYD will be one of the first automobile companies to release a Level 4 autonomous EV vehicle.

Nvidia Project Groot humanoid robots

Huang further demonstrated their advances in robotics by showcasing the abilities of their robots, which are first trained in the Omniverse as a digital twin, then allowed to complete tasks in real-life robotic bodies. Baking, finger twirling of sticks, product sorting and assembly, and navigating around obstacles were shown. To achieve this level of robotics, the Nvidia Project Groot AI was created by first training it on text, video, and demonstration inputs, then further refined with actual observations of tasks being done. Coupled with the same Thor computer used in vehicles and Nvidia Tokkio AI language model, demonstration robots were able to observe actions done by humans, then replicate them to make drinks, play the drums, and respond to spoken requests.

Nvidia Omniverse Cloud adds Apple Vision Pro support

Of minor note, Huang stated the Nvidia Omniverse Cloud now streams to the Apple Vision Pro headset in addition to the Meta Quest and HTC Vive Pro, clearly for developers who are utilizing Nvidia cloud GPUs since no Macintosh has a compatible Nvidia GPU.

Readers wanting to join the AI revolution will want a powerful Nvidia graphics card (like this at Amazon) to develop AI skills and apps.

▶ load Youtube video

Source(s)

Nvidia on YouTube, Nvidia press release

▶ ▼ Press Release

NVIDIA Launches Blackwell-Powered DGX SuperPOD for Generative AI Supercomputing at Trillion-Parameter Scale

Scales to Tens of Thousands of Grace Blackwell Superchips Using Most Advanced NVIDIA Networking, NVIDIA Full-Stack AI Software, and Storage Features up to 576 Blackwell GPUs Connected as One With NVIDIA NVLink NVIDIA System Experts Speed Deployment for Immediate AI Infrastructure

March 18, 2024

GTC—NVIDIA today announced its next-generation AI supercomputer — the NVIDIA DGX SuperPOD™ powered by NVIDIA GB200 Grace Blackwell Superchips — for processing trillion-parameter models with constant uptime for superscale generative AI training and inference workloads.

Featuring a new, highly efficient, liquid-cooled rack-scale architecture, the new DGX SuperPOD is built with NVIDIA DGX™ GB200 systems and provides 11.5 exaflops of AI supercomputing at FP4 precision and 240 terabytes of fast memory — scaling to more with additional racks.

Each DGX GB200 system features 36 NVIDIA GB200 Superchips — which include 36 NVIDIA Grace CPUs and 72 NVIDIA Blackwell GPUs — connected as one supercomputer via fifth-generation NVIDIA NVLink®. GB200 Superchips deliver up to a 30x performance increase compared to the NVIDIA H100 Tensor Core GPU for large language model inference workloads.

“NVIDIA DGX AI supercomputers are the factories of the AI industrial revolution,” said Jensen Huang, founder and CEO of NVIDIA. “The new DGX SuperPOD combines the latest advancements in NVIDIA accelerated computing, networking and software to enable every company, industry and country to refine and generate their own AI.”

The Grace Blackwell-powered DGX SuperPOD features eight or more DGX GB200 systems and can scale to tens of thousands of GB200 Superchips connected via NVIDIA Quantum InfiniBand. For a massive shared memory space to power next-generation AI models, customers can deploy a configuration that connects the 576 Blackwell GPUs in eight DGX GB200 systems connected via NVLink.

New Rack-Scale DGX SuperPOD Architecture for Era of Generative AI

The new DGX SuperPOD with DGX GB200 systems features a unified compute fabric. In addition to fifth-generation NVIDIA NVLink, the fabric includes NVIDIA BlueField®-3 DPUs and will support NVIDIA Quantum-X800 InfiniBand networking, announced separately today. This architecture provides up to 1,800 gigabytes per second of bandwidth to each GPU in the platform.

Additionally, fourth-generation NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)™ technology provides 14.4 teraflops of In-Network Computing, a 4x increase in the next-generation DGX SuperPOD architecture compared to the prior generation.

Turnkey Architecture Pairs With Advanced Software for Unprecedented Uptime
The new DGX SuperPOD is a complete, data-center-scale AI supercomputer that integrates with high-performance storage from NVIDIA-certified partners to meet the demands of generative AI workloads. Each is built, cabled and tested in the factory to dramatically speed deployment at customer data centers.

The Grace Blackwell-powered DGX SuperPOD features intelligent predictive-management capabilities to continuously monitor thousands of data points across hardware and software to predict and intercept sources of downtime and inefficiency — saving time, energy and computing costs.

The software can identify areas of concern and plan for maintenance, flexibly adjust compute resources, and automatically save and resume jobs to prevent downtime, even without system administrators present.

If the software detects that a replacement component is needed, the cluster will activate standby capacity to ensure work finishes in time. Any required hardware replacements can be scheduled to avoid unplanned downtime.

NVIDIA DGX B200 Systems Advance AI Supercomputing for Industries
NVIDIA also unveiled the NVIDIA DGX B200 system, a unified AI supercomputing platform for AI model training, fine-tuning and inference.

DGX B200 is the sixth generation of air-cooled, traditional rack-mounted DGX designs used by industries worldwide. The new Blackwell architecture DGX B200 system includes eight NVIDIA Blackwell GPUs and two 5th Gen Intel® Xeon® processors. Customers can also build DGX SuperPOD using DGX B200 systems to create AI Centers of Excellence that can power the work of large teams of developers running many different jobs.

DGX B200 systems include the FP4 precision feature in the new Blackwell architecture, providing up to 144 petaflops of AI performance, a massive 1.4TB of GPU memory and 64TB/s of memory bandwidth. This delivers 15x faster real-time inference for trillion-parameter models over the previous generation.

DGX B200 systems include advanced networking with eight NVIDIA ConnectX™-7 NICs and two BlueField-3 DPUs. These provide up to 400 gigabits per second bandwidth per connection — delivering fast AI performance with NVIDIA Quantum-2 InfiniBand and NVIDIA Spectrum™-X Ethernet networking platforms.

Software and Expert Support to Scale Production AI
All NVIDIA DGX platforms include NVIDIA AI Enterprise software for enterprise-grade development and deployment. DGX customers can accelerate their work with the pretrained NVIDIA foundation models, frameworks, toolkits and new NVIDIA NIM microservices included in the software platform.

NVIDIA DGX experts and select NVIDIA partners certified to support DGX platforms assist customers throughout every step of deployment, so they can quickly move AI into production. Once systems are operational, DGX experts continue to support customers in optimizing their AI pipelines and infrastructure.

Availability
NVIDIA DGX SuperPOD with DGX GB200 and DGX B200 systems are expected to be available later this year from NVIDIA’s global partners.

For more information, watch a replay of the GTC keynote or visit the NVIDIA booth at GTC, held at the San Jose Convention Center through March 21.

AMD CEO Lisa Su showcases a Ryzen processor. Her 2025 pay package has been raised to $33 million. (Image source: AMD)

AMD boosts CEO Lisa Su’s paycheck to $33M — still below Nvidia’s Huang 07/03/2025

Free AgiBot World Alpha robotic learning dataset accelerates AI humanoid development. (Image source: AgiBot)

AgiBot releases free humanoid robot training dataset 12/30/2024

IBM Granite 3.0 open-source AI models for businesses now available (Image source: IBM)

IBM launches Granite 3.0 open-source AI models for businesses 10/21/2024

Humans vs AI (Image source: Generated using DALL·E 3)

Humans can easily outsmart AI according to Apple-funded study 10/14/2024

Team Green keeps coming up with new ways of making its AI accelerators indispensable (Image source: Notebookcheck)

Nvidia Earth-2 platform gets StormCast new AI model for more nuanced bad weather warnings 08/22/2024

Nvidia engineers are scraping videos from YouTube and other sources to train the company's Cosmos video foundation model. (Image Source: Nvidia)

Leaked internal comms reveal Nvidia scraping lifetime worth of YouTube videos daily to train video AI model, Jensen happy with the progress 08/06/2024

Vayu One AI-driven delivery robot with passive sensors (Image source: Vayu Robotics)

World's first on-road delivery robot with AI unleashed in the US 07/24/2024

Nameless AI chip concept render (Source: DALL·E 3-generated image)

Elon Musk unveils plan to acquire $9 billion worth of AI chips from Nvidia by next summer 06/04/2024

Nvidia adds new vision, speech, and language capabilities to ChatRTX. (Source: Nvidia)

Nvidia adds new vision, speech, and language capabilities to ChatRTX - a free, local chatbot for PCs with Nvidia RTX graphics cards 05/06/2024

Is that blood? Atlas after a heavy fall. (Image: Boston Dynamics)

Boston Dynamics bids farewell to HD Atlas after a long line of accidents 04/18/2024

The Meta Quest build 64.0 improves Passthrough for the Quest 3. (Image source: Meta)

Meta Quest v64 with improved Passthrough now rolling out 04/10/2024

Nine companies have launched the Consortium to train and reskill over 95 million people within eight years to meet tech skills demand in AI-era. (Source: AI image Dall-E 3)

Nine major companies launch Consortium to address major, upcoming shifts in employment due to growing AI use 04/05/2024

Apple is exploring robotics technologies as it seeks to find the "next big thing". (Image: Dall.E)

Apple exploring home robotics in wake of failed Apple Car project 04/04/2024

ETH Zürich researchers develop state-of-art modules enabling ANYmal D robot to navigate complex terrains and obstacles. (Source: ETH Zürich on YouTube)

ETH Zurich researchers unveil four-legged ANYmal AI robot able to complete obstacle courses like K-9s in boot camp 03/31/2024

The Apple Vision Pro headset will be released in China later this year. (Image via Apple and Wikimedia Commons, w/ edits)

Apple Vision Pro will launch in China sometime in 2024 03/26/2024

LATTE3D can interpret highly specific text prompts to generate a 3D model (Image Source: NVIDIA)

NVIDIA unveils LATTE3D text-to-3D generative AI model dubbed “virtual 3D printer” 03/24/2024

The Husqvarna Automower 520 EPOS robotic lawn mower is now available in Europe. (Image source: Husqvarna)

Husqvarna Automower 520 EPOS new robot lawn mower with GPS launches 03/23/2024

At least 15 Tesla Cybertrucks are affected by a serious door striker issue that causes misalignment in the door panel. (Image source: Auto Focus on YouTube - edited)

Elon Musk confirms Cybertruck door gap issue, at least 15 production units affected by poorly installed latch 03/22/2024

The Cybertruck's large bed works to its detriment when left open while driving. (Image source: Tesla)

Cybertruck tonneau cover an efficiency win, at least 25 miles of range lost when open at 75 mph 03/20/2024

The Ford Mustang Mach-E is currently the company's smallest electric vehicle — but not for long. (Image source: Ford)

Ford's Tesla Model 2 killer to spawn affordable electric pickup truck with US-made LFP batteries 03/19/2024

Nvidia's upcoming gaming GPUs will be manufactured on the TSMC 4NP node (image via Nvidia)

Nvidia GeForce RTX 50 series Blackwell GPUs to be manufactured on TSMC's 4NP node 03/19/2024

1X NEO humanoid robot can learn how to complete tasks by observing humans. (Source: 1X Technologies)

1X Technologies unveils NEO humanoid robot that learns to tidy rooms and help around the house by watching you 02/26/2024

Varjo XR-4 mixed reality headsets with 28 MP display resolution. (Source: Varjo)

Varjo announces XR-4 mixed reality headsets with 28 MP displays that leap past Apple Vision Pro 12/05/2023

New information about Nvidia's upcoming GeForce RTX 50 series graphics cards has emerged online (image via Nvidia)

Nvidia GeForce RTX 5000 graphics cards could launch with GDDR7 memory and the same bus width as the RTX 4090 11/15/2023

New information about the GeForce RTX 5090 has emerged online (image via Nvidia)

Nvidia GeForce RTX 5090 could feature a significantly higher memory bus than the RTX 4090 07/27/2023

Nvidia's new Grace Hopper Superchip is now official (image via own)

Nvidia Grace Hopper Superchip unveiled for AI-centric workloads 05/29/2023

Wide performance differences between the mobile and desktop GeForce RTX 4090 show how power constrained gaming laptops have become (Image source: Nvidia)

Wide performance differences between the Mobile and desktop GeForce RTX 4090 show how power constrained gaming laptops have become 02/13/2023

Nvidia has launched the GeForce RTX 4090 and RTX 4080 for laptops (image via Nvidia)

Nvidia GeForce RTX 4090 and RTX 4080 laptop graphics cards announced with significant performance and efficiency gains over Ampere 01/03/2023

The H100 GPU is launching in Q3 2022. (Image Source: Nvidia)

Nvidia unveils H100 Hopper compute GPU and Grace superchip architectures 03/22/2022

Loading Comments

Comment on this article

Apple MacBook Pro 14 (2023) with M3...

Cheaper Galaxy Z Fold6 variant pric...

David Chien - Tech Writer - 667 articles published on Notebookcheck since 2023

Having worked at Activision, UCLA, Anime Expo and more, I've seen technology being used to save lives, create games, and create fantastic 3D VR/AR worlds. There's always something fun in emerging technology that I want to get my hands on and all my friends turn to me to find the best for their needs, so I'm glad to bring my experience to Notebookcheck.

Please share our article, every link counts!

> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2024 03 > Nvidia CEO Huang unveils latest AI products for the enterprise along with advances in autonomous vehicle and humanoid robot capabilities during GTC 2024 keynote

David Chien, 2024-03-19 (Update: 2024-08-15)

Source(s)

Related Articles