A startup claims to have an Nvidia-killing AI chip

The Goya AI processor architecture. (Source: Habana)

A start-up called Habana claims that its new chip, the Goya, can out-perform NVIDIA V100 and Intel Xeon platforms in terms of image processing. Goya works on ONNX and MXNet AI frameworks at present, although Habana is working on TensorFlow too. On the other hand, there may be some caveats to the new company's claims.

Deirdre O Donnell, Published 09/28/2018

Habana, an emerging player in the AI processing space, claims that the chip it has designed can beat some Intel and NVIDIA products in the same arena. This processor, called the Goya, is a 16nm platform with novel AI-focused architecture.

Habana claims that the Goya can process up to 15,000 images from the standard ResNet-50 dataset with a latency of 1.3 milliseconds (ms), provided that the batch size is 10. By contrast, the Intel Xeon Platinum 8180 takes 1 second to process 1,225 of these images, whereas the NVIDIA V100 chipset processes 2,657 in the same amount of time.

There may be some room for skepticism in assessing Habana's bold estimations. For example, the better-known NVIDIA chip needs a batch size of 256, whereas Habana is specifying only 10. Furthermore, the start-up claims that the Goya can still process up to 8,550 images in 0.27ms when the batch size is set at 1. In addition, Habana concedes that the new chip needs installation in a proprietary machine, specific memory management and hybrid quantization to perform these feats.

Nevertheless, the Goya may be of interest in the machine-learning and AI ecosystem. On the other hand, it also mainly functions to process graphs under the MXNet and ONNX formats. However, it may also become capable of running the more familiar TensorFlow framework, which is also used by Google. Habana also has another chip, the 16nm Gaudi, which is intended for neural network training. It will be available in sample form by 2019.

A graph portraying Habana's claims of super-high-volume image processing in terms of batch size and latency. (Source: ElectronicDesign)

Source(s)

Engineering.com

Intel's upcoming GPU may offer HBM VRAM at $200 08/02/2019

An 8GB Samsung HBM2 chip. (Source: 3D InCites)

New JEDEC standard increases HBM2 memory limit from 8GB to 24GB 12/18/2018

Google Chrome Labs unveils Squoosh, an open-source image compression tool that runs in your browser 11/14/2018

The RTX 2070 factory overclocked models are indeed faster than the GTX 1080, but the price difference is not really worth it. (Source: HardOCP)

Nvidia RTX 2070 complete review leaks before NDA lift 10/16/2018

The RTX 2000 Mobility GPUs will be launched in early 2019. (Source: Flipboard)

Nvidia RTX 2000-series Mobility GPU lineup leaked 10/11/2018

Compared to the RTX 2080 Ti, the Quadro RTX 6000 comes with a slightly more powerful TU102 core and 24 GB GDDR6 VRAM. (Source: Nvidia)

Nvidia's Quadro RTX 6000 professional GPU is up for pre-order 10/03/2018

AI is increasing its market value by the year, according to new reports. (Source: nanalyze.com)

AI chip market to grow by US$34 billion in 5 years 09/29/2018

NVIDIA'S share price target is reportedly on the rise. (Source: Variety)

New market analysis sends NVIDIA's share-price target sky-high 09/28/2018

Nvidia confirms launch of GeForce RTX 2070 on October 17 for $499 (source: Nvidia)

Nvidia confirms launch of GeForce RTX 2070 on October 17 for $499 09/25/2018

OnePlus 6 Android flagship, OnePlus TV coming in 2019

OnePlus TV coming next year, AI assistant in tow 09/17/2018

Microsoft acquires Lobe to accelerate AI development 09/14/2018

MediaTek believes that its new feature, Active Stereo, is a better form of face-unlocking than Apple's. (Source: Deccan Chronicle)

MediaTek claims that its new Active Stereo will beat Apple Face ID 09/07/2018

Google wants increased market coverage of enterprise data needs. (Source: Google)

Edge TPUs revealed as the future of AI for Google at Next 2018 08/16/2018

Intel plans to release the Copper Lake family a few months before the Ice Lake models. (Source: SegmentNext)

Intel's Scalable Xeon CPUs to arrive by 2020 07/26/2018

Intel Xeon D-2100 processor (Source: Intel Newsroom)

Intel Xeon D-2100 processors now official with up to 18 cores and 36 threads 02/07/2018

There's also an 18-core/36-thread iMac Pro coming later this year. (Source: Apple)

Apple's upcoming iMac Pro with 10-core custom Intel Xeon CPU surfaces on GeekBench 10/19/2017

The Platinum 8180 CPU is part of the server-grade Xeon Processor Scalable Family. (Source: Intel)

Intel's latest server grade Xeon Platinum 8180 CPU has a ridiculous price tag 07/14/2017

Loading Comments

Comment on this article

Intel CFO announces plans to deal w...

New market analysis sends NVIDIA's ...

Deirdre O Donnell - Senior Tech Writer - 8595 articles published on Notebookcheck since 2018

I became a professional writer and editor shortly after graduation. My degrees are in biomedical sciences; however, they led to some experience in the biotech area, which convinced me of its potential to revolutionize our health, environment and lives in general. This developed into an all-consuming interest in more aspects of tech over time: I can never write enough on the latest electronics, gadgets and innovations. My other interests include imaging, astronomy, and streaming all the things. Oh, and coffee.

contact me via: LinkedIn

Please share our article, every link counts!