CloudMatrix 384: Huawei's 384-chip AI cluster challenges Nvidia amid US export curbs

Huawei's CloudMatrix 384 stakes a claim in China’s high-performance AI race. Pictured: Huawei's Atlas 900 AI cluster (Image source: Huawei)

At WAIC Shanghai, Huawei unveiled CloudMatrix 384: a system of 384 Ascend 910C accelerators connected via a proprietary "super-node" interconnect, designed to rival Nvidia's GB200 NVL72 platform.

Nathan Ali, Published 07/26/2025 🇪🇸 🇵🇹 ...

Huawei Technologies has unveiled its most ambitious AI system to date, the CloudMatrix 384, at the World Artificial Intelligence Conference (WAIC) in Shanghai. Shown publicly for the first time, the system is designed to accelerate large-scale model training and positions the company as a domestic alternative to Nvidia’s high-end GB200 NVL72 platform.

The system’s core features 384 Ascend 910C accelerators linked by a proprietary “super-node” interconnect. By clustering more chips, the design compensates for lower per-device throughput, achieving aggregate performance that, according to SemiAnalysis, can surpass Nvidia’s GB200 on some benchmarks. WAIC did not reveal exact performance figures, but analysts note Huawei is prioritizing bandwidth and latency optimization over individual processor performance.

The launch occurs as US export restrictions block Nvidia’s fastest GPUs from China, creating an opening for Huawei. As noted by Nvidia’s CEO in May, Huawei is "moving quite fast." The company can now supply domestic hardware to cloud providers and research institutes, utilizing an in-house approach that bypasses licensing constraints that limit many local chip designers.

Founder Ren Zhengfei acknowledges that Ascend chips trail US rivals in raw power, but claims that mathematical optimization and cluster computing can close performance gaps for real workloads. The company dedicates approximately ¥180 billion (≈ US$25 billion) annually to R&D, with a third allocated to long-term theoretical research, which Ren considers essential for reducing reliance on Moore's Law.

Whether CloudMatrix 384 translates into wide commercial adoption will depend on price, software maturity and Beijing’s evolving cloud-procurement policies. Nonetheless, its appearance underlines how quickly China’s AI-hardware ecosystem is pivoting toward home-grown solutions—and how competition is shifting from individual chips to full-stack, system-level innovation.

Source(s)

Reuters (in English)

Loading Comments

Comment on this article

Oppo Find X9 Ultra, Vivo X300 Ultra...

Casio G-Shock Black and Gold: Three...

Nathan Ali - Tech Writer - 353 articles published on Notebookcheck since 2024

I'm a tech geek at heart, and it all started back in middle school. I've always loved messing around with gadgets—rooting Android phones and jailbreaking iPhones was my thing. I've definitely bricked a few phones along the way, but that never stopped me from trying. For over a decade, I've been glued to tech news, always trying to keep up with the latest and greatest. But I'm not just about tech; I'm also really into cars and love following what's new in the automotive world. Oh, and I should mention that I also worked as a freelance writer. I can't name-drop the companies I wrote for (you know how it is), but it was a pretty cool experience. I switch between reading, gaming, and keeping up with all the tech and car stuff in my downtime. It's a mix that keeps things interesting and fun for me.

contact me via: @Painite6

Please share our article, every link counts!