Tencent unveils four compact open-source Hunyuan models with 0.5B, 1.8B, 4B and 7B parameters

Tencent has open-sourced its Hunyuan AI model, releasing four compact open-source language models (Image source: Tencent)

Tencent has open-sourced its Hunyuan AI model, releasing four compact language models with 0.5 billion, 1.8 billion, 4 billion, and 7 billion parameters that can run on a single consumer GPU.

Nathan Ali, Published 08/04/2025 🇪🇸 🇵🇹 ...

AI Open Source Chinese Tech

Tencent has released a new suite of compact Hunyuan models: 0.5 billion, 1.8 billion, 4 billion, and 7 billion parameters; they’re aimed at low-power and edge deployments. All four configurations are now available on GitHub and Hugging Face, and each can run inference on a single consumer-grade graphics card, making them suitable for laptops, smartphones, smart-cabin systems, and other resource-constrained hardware.

Despite their small sizes, the models achieve leading scores in language understanding, mathematics, and reasoning across several public benchmarks. Tencent attributes these results to a "fusion reasoning" architecture that allows users to select between a fast-thinking mode for concise answers and a slow-thinking mode for more elaborate multi-step reasoning.

A key technical feature is the native 256K token context window, which is sufficient to ingest roughly 500,000 English words in a single pass. Tencent highlights in-house applications such as Tencent Meeting and WeChat Reading, where the models can parse an entire meeting transcript or full-length book at once, maintaining character relationships and plot details for downstream queries.

The four compact LLMs integrate with mainstream inference frameworks, including SGLang, vLLM, and TensorRT-LLM, and support multiple quantization formats. Initial endorsements from Arm, Qualcomm, Intel, and MediaTek indicate forthcoming deployment packages optimized for their respective client processors.

Early use cases underscore the practical focus of the release. Tencent Mobile Manager reports millisecond-level spam interception without off-device data transfer. At the same time, a dual-model scheme in Tencent's smart-cabin assistant balances on-board power consumption against conversational depth. These examples, Tencent argues, demonstrate that small models can deliver enterprise-grade agent capabilities when thoughtfully engineered.

Source(s)

Fast Technology (in Chinese)

Tencent Hunyuan 3D 3.0 AI model (Image Source: Tencent Hunyuan)

Tencent's new Hunyuan 3D 3.0 brings realistic AI model generation to developers and creators 09/17/2025

Prinano’s step-and-repeat nanoimprint tool targets memory-first production in China (Image source: Prinano)

Chinese company Prinano delivers first domestic nanoimprint lithography tool 08/16/2025

Abandoned Spotify Car Thing revived by free firmware that restores original functionality and then some 08/13/2025

China questions Nvidia H20 AI GPU security; firm denies backdoor. Pictured: Nvidia's H100 GPU (Image source: Nvidia)

Nvidia rejects China’s H20 GPU security concerns amid renewed US export controls 08/03/2025

Zhaoxin showcases KaiXian KX-7000N AI-PC chip and 96-core Kaisheng KH-50000 server CPU at WAIC 2025 (Image source: Zhaoxin)

Zhaoxin unveils first NPU-equipped CPU and 96-core server chip at WAIC 2025 07/31/2025

China forms two AI alliances to localize chips and models. Pictured: Huawei's Atlas 900 AI cluster (Image source: Huawei)

The WAIC sees the launch of two strategic AI alliances to strengthen China’s chip-to-model ecosystem 07/30/2025

Underground repair shops keep Nvidia’s banned AI GPUs alive in China. Pictured: Nvidia H100 NVL GPU (Image source: Nvidia)

Gray-market repairs for banned Nvidia H100 and A100 GPUs surge in China 07/28/2025

Lisuan 7G100 GPUs debut: TrueGPU architecture targets RTX 4060-class performance (Image source: Lisuan Technology)

Lisuan’s 6 nm domestic GPUs rival RTX 4060 performance, driving China’s gaming and AI self‑sufficiency 07/27/2025

Huawei's CloudMatrix 384 stakes a claim in China’s high-performance AI race. Pictured: Huawei's Atlas 900 AI cluster (Image source: Huawei)

CloudMatrix 384: Huawei's 384-chip AI cluster challenges Nvidia amid US export curbs 07/26/2025

Loading Comments

Comment on this article

HoverAir: New DJI Neo drone rival t...

Xiaomi rumoured to join OnePlus in ...

Nathan Ali - Tech Writer - 351 articles published on Notebookcheck since 2024

I'm a tech geek at heart, and it all started back in middle school. I've always loved messing around with gadgets—rooting Android phones and jailbreaking iPhones was my thing. I've definitely bricked a few phones along the way, but that never stopped me from trying. For over a decade, I've been glued to tech news, always trying to keep up with the latest and greatest. But I'm not just about tech; I'm also really into cars and love following what's new in the automotive world. Oh, and I should mention that I also worked as a freelance writer. I can't name-drop the companies I wrote for (you know how it is), but it was a pretty cool experience. I switch between reading, gaming, and keeping up with all the tech and car stuff in my downtime. It's a mix that keeps things interesting and fun for me.

contact me via: @Painite6

Please share our article, every link counts!