AMD announces CDNA-based Instinct MI100 GPU with 120 CUs for HPC, promises up to 2.1x more performance per dollar compared to the NVIDIA A100

AMD Instinct MI100 HPC accelerator. (Image Source: AMD)

AMD has announced what it calls the world's fastest HPC GPU, the Instinct MI100 based on the CDNA architecture. The Instinct MI100 offers up to 11.5 TFLOPs of FP64 compute performance when paired with 2nd gen EPYC processors. The MI100 is slated to offer better performance per dollar compared to the NVIDIA A100 GPU along with support for the new ROCm 4.0 software platform.

Vaidyanathan Subramaniam, Published 11/16/2020 🇪🇸 🇫🇷 ...

AMD has announced the Instinct MI100 based on the new CDNA architecture targeted at machine learning (ML) and high performance computing (HPC) workloads. The MI100 is slated to offer 10 teraflops of FP64 performance that goes up to 11.5 TFLOPS when paired with second gen AMD EPYC processors.

During the presentation, AMD also confirmed that the 3rd gen EPYC processors based on Zen 3 codenamed Milan are now being sampled to select OEMs and are slated for a Q1 2021 launch.

AMD said that it is developing different architectures tailored for specific applications with some overlap. While RDNA will cater to gaming, CDNA is more focused towards compute and HPC applications. The Instinct MI100 offers a Matrix Core Technology that enables single and mixed precision matrix operations such as FP32, FP16, bFloat16, Int8, and Int4.

The second gen Infinity Fabric in the MI100 features 32 GB of HBM2 memory at 1.2 GHz delivering 1.23 TB/s of bandwidth.

The following table illustrates the specifications of the AMD Instinct MI100:

Design	Full-height, Dual-slot, 10.5 in. long
Compute Units	120
Stream Processors	7,680
FP64 TFLOPs (Peak)	11.5
FP32 TFLOPs (Peak)	23.1
FP32 Matrix TFLOPs (Peak)	46.1
FP16/FP16 Matrix TFLOPs (Peak)	184.6
Int4/Int8 TOPS (Peak)	184.6
bFLOAT16 TFLOPs (Peak)	92.3
HBM2 ECC Memory	32 GB
Memory Interface	4,096-bit
Memory Clock	1.2 GHz
Memory Bandwidth	1.23 TB/s
PCIe Support	Gen4
Infinity Fabric Links/Bandwidth	3 / 276 GB/s
TDP	300 W
Cooling	Passively cooled

While the MI100 is designed to work well with EPYC processors, AMD confirmed that the new GPU supports Intel processors as well. Overall, up to 7x FP16 performance can be expected from the MI100 compared to previous generation AMD HPC GPUs.

The Instinct MI100 delivers up to 64 GB/s of Infinity Fabric bandwidth between the CPU and the GPU without the need to use any PCIe switches. There are a total of three Infinity Fabric links that offer up to 276 GB/s throughput. Essentially, a quad-GPU hive of the MI100 can yield up to 1.1 TB/s of total bandwidth. According to AMD, these features give the MI100 significant leads over the NVIDIA A100 in FP16/FP32 loads while also offering higher performance per dollar (see slides below).

The Instinct MI100 supports the new ROCm 4.0 ecosystem, which AMD pegs as a complete exascale solution for ML and HPC workloads. ROCm 4.0 now uses an open source compiler and supports OpenMP 5.0 and HIP. Additionally, PyTorch and TensorFlow are now optimized for ROCm 4.0.

The AMD Instinct MI100 can be expected this year end in major OEM and ODM systems from the likes of Dell, Gigabyte, HP, and SuperMicro.

AMD Instinct MI100 - Die Shot. (Image Source: AMD)

AMD Instinct MI100 - Left. (Image Source: AMD)

AMD Instinct MI100 - Bottom. (Image Source: AMD)

AMD Instinct MI100 - Right. (Image Source: AMD)

AMD Instinct MI100 - Back. (Image Source: AMD)

AMD Instinct MI100 - Top. (Image Source: AMD)

Here are some of the slides from AMD's press briefing.

Source(s)

AMD Press Release

AMD CEO Lisa Su showcasing the MI300 APU (Image Source: AMD)

AMD introduces Instinct MI300 exascale APU combining Zen 4 EPYC cores with CDNA 3 GPGPU cores and up to 128 GB HBM3 memory 01/05/2023

AMD Instinct MI300 HPC accelerator could be an exascale APU with an integrated Zen 4 CPU. (Image Source: AMD)

AMD Instinct MI300 to be world's biggest x86 exascale APU with integrated Zen 4 CPU, CDNA 3 GPU, and shared HBM DRAM 05/18/2022

WebGL sites are prone to GPU-based privacy exploits. (Image Source: BetaNews)

Researchers demonstrate GPU tracking method that could impact online privacy 02/01/2022

The Instinct MI200 series' new design. (Source: AMD)

AMD launches the next-gen supercomputer-ready Instinct MI200 accelerator series 11/10/2021

The Instinct MI250X will reportedly feature 110 compute units (Image source: AMD)

Tipster outs specs for the AMD Instinct MI250X MCM GPUs: 48 TFLOPs of compute and 110 CUs at 1.7 GHz, with a 500W TDP 10/24/2021

NVIDIA officially allows vGPU only on certain datacenter and high-end Quadro cards. (Image Source: NVIDIA)

Hack allows unlocking GPU virtualization functionality on consumer NVIDIA cards 04/12/2021

2020 HP Spectre x360 13 with 11th gen Core i5 CPU, Xe graphics, 1080p touchscreen, and 8 GB of RAM down to $760 USD (Image source: HP)

2020 HP Spectre x360 13 with 11th gen Core i5 CPU, Xe graphics, 1080p touchscreen, and 8 GB of RAM down to $760 USD 11/30/2020

HP 15 for $589 USD is probably the cheapest laptop with the 11th gen Intel Core i7-1165G7 right now

HP 15 with 11th gen Core i7-1165G7 is only $589 USD right now to be one of the cheapest Tiger Lake laptops available 11/28/2020

The RTX 3060 Ti Founders Edition, according to Videocardz. (Image source: Videocardz)

The NVIDIA GeForce RTX 3060 Ti matches the RTX 2080 SUPER in Ashes of the Singularity benchmarks, but AMD's Radeon RX 6800 leaves it in the dust 11/25/2020

NVIDIA RTX 3000 GPUs will reputedly land in laptop form from January 2021. (Image source: NVIDIA)

Laptops with NVIDIA RTX 3000 GPUs and AMD Ryzen 5000H APUs to arrive in 1H 2021 from as little as US$999 11/24/2020

2020 HP Envy x360 15z with Ryzen 7 4700U, 16 GB RAM, and 512 GB NVMe SSD now on sale for $830 USD (Image source: HP)

2020 HP Envy x360 15z with Ryzen 7 4700U, 16 GB RAM, and 512 GB NVMe SSD now on sale for $830 USD 11/23/2020

The RTX 3060 Ti Founders Edition will apparently look a lot like the RTX 3070. (Image source: Videocardz)

The NVIDIA GeForce RTX 3060 Ti comes close to matching the performance of the RTX 3070 with 38 RT cores and a 1.67 GHz boost clock speed 11/22/2020

The Ryzen 5 5500U is effectively a Ryzen 5 4500U that supports SMT. (Image source: Digital Trends)

Geekbench confirms that the AMD Ryzen 5 5500U is effectively a rebranded Ryzen 5 4600U 11/19/2020

The RTX 3060 Ti may offer RTX 2080 SUPER levels of performance. (Image source: NVIDIA)

Leaked NVIDIA GeForce RTX 3060 Ti benchmarks suggest superiority over the RTX 2080 SUPER at under 60% of the price 11/17/2020

The MI1000 Instinct compute GPU is expected to launch this December. (Image Source: Videocardz)

New leak: AMD's Radeon Instinct MI100 compute GPU is more than 100% faster compared to Nvidia's A100 Ampere GPU in FP32 workloads 07/30/2020

The Arcturus-based MI100 will likely be dead on arrival (Image source: AMD)

Leaked Radeon Instinct MI100 GPU and HBM clockspeeds disappoint: Arcturus slower than RTX 2080 Ti, crushed by Ampere? 02/09/2020

AMD Radeon Navi graphics cards coming soon, but not as soon as expected (Source: Wccftech)

Navi 20-based AMD Radeon Instinct delayed to Q1 2020 05/06/2019

The 7nm Vega 20 architecture. (Source: AMD)

The new, 7nm Vega-based AMD Radeon Instinct line-up is released 11/13/2018

The 7nm Vega finds applications in High Performance Computing. (Source: AMD)

AMD demonstrates Radeon Instinct 7nm Vega GPU platform with 32 GB HBM2 memory 06/06/2018

AMD Radeon Instinct Vega graphics card will carry a 300 W TDP

AMD Radeon Instinct MI25 will be a 300 W Vega graphics card 06/22/2017

Loading Comments

Comment on this article

PlayStation 5 expectations rocket a...

Eve V 2020: fans can now vote on th...

Vaidyanathan Subramaniam - Managing Editor - 2024 articles published on Notebookcheck since 2012

Though a cell and molecular biologist by training, I have been drawn towards computers from a very young age ever since I got my first PC in 1998. My passion for technology grew quite exponentially with the times, and it has been an incredible experience from being a much solicited source for tech advice and troubleshooting among family and friends to joining Notebookcheck in 2017 as a professional tech journalist. Now, I am a Lead Editor at Notebookcheck covering news and reviews encompassing a wide gamut of the technology landscape for Indian and global audiences. When I am not hunting for the next big story or taking complex measurements for reviews, you can find me unwinding to a nice read, listening to some soulful music, or trying out a new game.

contact me via: @Geeky_Vaidy

Please share our article, every link counts!

> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2020 11 > AMD announces CDNA-based Instinct MI100 GPU with 120 CUs for HPC, promises up to 2.1x more performance per dollar compared to the NVIDIA A100

Vaidyanathan Subramaniam, 2020-11-16 (Update: 2020-11-16)

Source(s)

Related Articles