Notebookcheck Logo

Nvidia unveils Tesla T4 inferencing GPU based on Turing

The Tesla T4 may come in a smaller 75 W TDP PCie factor, but it is almost as fast as RTX 2080 gaming GPU. (Source: Nvidia)
The Tesla T4 may come in a smaller 75 W TDP PCie factor, but it is almost as fast as RTX 2080 gaming GPU. (Source: Nvidia)
The new Tesla T4 is part of the latest enterprise offering based on the Turing architecture. While it features a TDP of only 75 W, it comes with a modified version of the TU104 chip almost as fast as the RTX 2080 gaming GPU. As such, the T4 can decode up to 38 concurrent 1080p video streams, offering maximum efficiency for smart video services based on deep learning frameworks.

Earlier this week at GTC 2018 held in Japan, Jensen Huang presented the first Tesla real-time inference accelerator based on the Turing architecture. With the latest deep learning and neural network advancements touted through the Turing Tensor Cores, Nvidia’s latest Tesla T4 GPU is aimed at accelerating a diverse array of modern AI applications.

According to Nvidia, the new Tesla T4 cards are designed to offer maximum efficiency for scale-out servers. For this purpose, the GPUs come packaged in an-energy-efficient 75-watt PCIe form factor. The Tesla T4 is optimized for hardware video transcoding loads and is meant to improve online video analysis algorithms. In this respect, the T4 can decode up to 38 concurrent 1080p video streams, offering increased performance for smart video services based on deep learning frameworks.

Even though the Tesla T4 has a maximum TDP of 75 W, the hardware specifications are nothing to sneeze at. It integrates a TU104 GPU (similar to the one found in the RTX 2080 gaming cards) with 2560 CUDA cores and 320 Tensor Cores, plus it packs 16 GB of GDDR6 VRAM that allows for up to 320 GB/s of bandwidth. These specs can deliver 8.1 TFLOPS of FP32, 65 TFLOPS of FP16, 130 TOPs of INT8 and 260 TOPs of INT4 performance. The small form factor and the reduced TDP make it easy for enterprise clients to add these cards to 1U and 4U server racks.

Availability and price information are unknown for the time being.

Performance gains over server CPUs (Source: Nvidia)
Performance gains over server CPUs (Source: Nvidia)

Source(s)

static version load dynamic
Loading Comments
Comment on this article
Please share our article, every link counts!
> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2018 09 > Nvidia unveils Tesla T4 inferencing GPU based on Turing
Bogdan Solca, 2018-09-14 (Update: 2018-09-14)