xAI launches Grok 3 AI with chart-topping performance along with beta reasoning models

xAI launches Grok 3 family of leading-edge AI. (Image source: xAI)

The latest xAI large language models have topped the benchmark charts due to their ability to search the Internet for information, reason through complex problems, and process input with a context window of one million tokens. Grok 3 models have beat OpenAI GPT-4o and DeepSeek-V3 on most benchmarks.

David Chien, Published 02/21/2025 🇩🇪 🇫🇷 ...

AI Software Launch

Elon Musk's xAI has launched the Grok 3 family of leading-edge AI large language models that generally outperform other AIs on standardized AI benchmarks.

The Grok 3 models were trained on the company's Colossus supercomputer cluster that uses 100,000 Nvidia Hopper Tensor Core GPUs. A pair of standard and mini non-reasoning models (Grok 3 beta and Grok 3 mini beta) along with a pair of reasoning models (Grok 3 beta (Think) and Grok 3 mini beta (Think)) have been released.

The non-reasoning models generally outperform the prior chart-topping AI, such as OpenAI GPT-4o and DeepSeek-V3. One reason is that they have a one million token context window, which allows the AI to use very large amounts of text. This improves the models' ability to synthesize the correct answer from a variety of sources. That said, the Grok 3 beta models still answer fact-seeking questions with less than 50% accuracy (SimpleQA benchmark), so humans will still have jobs tomorrow.

The reasoning models think through complex prompts step-by-step, allowing the user to see the AI's thought process. This allows these AI to work through problems like an expert would by solving smaller parts of the problem and combining results for a proper answer. Selecting the DeepSearch agent, or search option, will tell Grok 3 to search broadly and deeply across the internet and use code interpreters before generating reports that summarize its findings. The Grok 3 (Think) models generally rank the best at solving math problems, answering graduate-level multiple choice questions, and completing coding tasks versus other AI.

xAI expects to continue tuning Grok 3 for improved performance in upcoming months on a 200,000-GPU supercomputer cluster. Grok 3 is now available to all users on X and Grok.com. Free users may encounter usage limits, while paying users will have access to advanced features.

The Chatbot Arena ELO scores for Grok 3 AI ranks it as the top-performing in the world. (Image source: xAI)

Grok 3 beta and Grok 3 mini beta generally outperfom OpenAI's GPT-4o and DeepSeek-V3 on standardized AI benchmarks. (Image source: xAI)

Grok 3 beta (Think) and Grok 3 mini beta (Think) generally outperform all other reasoning AI models tested. (Image source: xAI)

Source(s)

xAI blog, xAI Colossus AI Supercomputer

xAI releases Grok 4 Fast AI with strong price vs performance. (Image source: xAI)

xAI Grok 4 Fast AI significantly reduces costs while retaining high performance in answering prompts 09/21/2025

Kinderfreundliche Version Grok: xAI kündigt Baby-Grok-Chatbot für junge Nutzer an (Quelle: Eigene)

Child-friendly version of Grok: xAI announces baby Grok chatbot for young users 07/21/2025

Grok 4 beats ChatGPT to become top public AI model as Elon Musk touts $300/month premium subscription 07/10/2025

Musk's xAI has acquired X for a valuation of $45 billion in a pure-stock trade. (Image source: xAI)

Musk's xAI acquires X for $33 billion in stocks 03/29/2025

Manus AI's general AI agent is here to tackle complex tasks like a human assistant. (Image source: Manus AI)

Manus AI launches a general AI agent capable of handling complex real-world tasks, including creating video games 03/12/2025

Youdao has launched the SpaceOne dictionary pen in China. (Image source: JD.com)

New SpaceOne dictionary pen with DeepSeek-R1 deep reasoning arrives 02/24/2025

HIX.AI adds DeepSeek-R1 AI to its log-on free chatbot offerings. (Image source: HIX.AI)

HIX.AI unveils log-on free access to DeepSeek-R1 AI chatbot 02/21/2025

Perplexity adds Deep Research capabilities to free chatbot. (Image source: Perplexity)

Perplexity adds Deep Research feature to its free AI chatbot 02/15/2025

Sam Altman details OpenAI's AI LLM roadmap. (Image source: OpenAI)

Sam Altman tweets OpenAI's AI LLM roadmap including GPT-5 02/13/2025

Google launches even more powerful Gemini 2.0 Pro AI. (Image source: Google)

Google has unveiled its most powerful Gemini 2.0 Pro AI 02/10/2025

OpenAI unveils faster o3-mini AI LLM that outperforms prior o1-mini models. (Image source: AI-generated by Dall-E 3)

OpenAI launches smarter o3-mini AI with free ChatGPT access 02/01/2025

China's open-source AI DeepSeek is a competitor to OpenAI (Image source: Imagen3)

China's free open-source AI DeepSeek is a serious threat to OpenAI's ChatGPT and other AI models 01/27/2025

Read all 1 comments / answer

Loading Comments

Comment on this article

Sleek Nothing Phone (3a) Pro and No...

Upcoming Apple foldable phone to be...

David Chien - Tech Writer - 802 articles published on Notebookcheck since 2023

Having worked at Activision, UCLA, Anime Expo and more, I've seen technology being used to save lives, create games, and create fantastic 3D VR/AR worlds. There's always something fun in emerging technology that I want to get my hands on and all my friends turn to me to find the best for their needs, so I'm glad to bring my experience to Notebookcheck.

Please share our article, every link counts!