AI hallucinations: Researchers have found the cause – and a solution

According to OpenAI researchers, language model hallucinations occur because current evaluations reward incorrect answers while penalizing honest expressions of uncertainty. (Image source: OpenAI)

Anyone using AI to look up information should be aware that it may be fabricated. OpenAI researchers now say they have identified the cause of this problem as well as a way to fix it.

Marius Müller (translated by Marius Müller), Published 09/08/2025 🇩🇪 🇪🇸 ...

AI Science Business

AI assistants are surprisingly adept at making up information and presenting it as fact. False claims, fictional sources and fabricated quotes are all part of the mix. These mistakes are commonly referred to as hallucinations. Many users have likely grown used to the problem, often depending on their own fact-checking to separate truth from fiction. But according to OpenAI, there may be an alternative. On September 5, the company behind ChatGPT released a detailed paper that offers a new explanation for why hallucinations happen – and a potential solution.

Guessing gets rewarded, uncertainty gets punished

The 36-page paper, authored by Adam Kalai, Santosh Vempala from Georgia Tech, and other OpenAI researchers, makes one thing clear: hallucinations aren't caused by sloppy writing, but by the way current evaluation metrics are set up. These metrics tend to reward confident guesses and penalize expressions of uncertainty. The researchers compare this to multiple-choice tests – those who guess can score points, while those who leave questions blank get nothing. Statistically, the guessing model comes out ahead, even if it frequently delivers incorrect information.

As a result, today’s leaderboards – which rank AI performance – focus almost entirely on accuracy, overlooking both error rates and uncertainty. OpenAI is now calling for a change. Instead of simply tallying correct answers, scoreboards should penalize confident mistakes more strongly while awarding some credit for cautious abstention. The goal is to encourage models to acknowledge uncertainty rather than confidently presenting false information as fact.

Less guessing, more honesty

One example from the paper shows the difference this approach can make. In the SimpleQA benchmark, one model chose not to answer more than half of the questions but was wrong in only 26% of the answers it did provide. Another model responded to nearly every question – yet hallucinated in about 75% of cases. The takeaway is clear: showing uncertainty is more trustworthy than confident guessing that only creates the illusion of precision.

Source(s)

OpenAI

A conceptual image of the battery (Image source: AI-generated)

Breakthrough battery is fire-safe, water-based, organic, durable, and recyclable 09/09/2025

rabbitOS 2 is now available for the Rabbit R1. (Image source: Rabbit)

rabbitOS 2 brings a new interface, gesture support, and more to the Rabbit R1 09/09/2025

Decorative image. Scientists have found a new way to control light waves. (Image source: Sebastián Brito via Unsplash)

Faster and more powerful electronics: New method to control terahertz light could transform electronics 09/09/2025

ChatGPT Privacy: What your conversations reveal about you. (Image source: GPT-image-1)

ChatGPT privacy: What your conversations reveal about you 09/08/2025

OpenAI partners with Broadcom on custom AI chips in $10 billion deal (Image source: OpenAI)

OpenAI partners with Broadcom on $10 billion AI chip deal to secure training capacity 09/08/2025

Google's Zurich office entrance (Image source: Google)

Google retreats from 2030 net-zero goal as AI data centers drive soaring energy use 09/08/2025

Anthropic blocks Chinese-owned entities from its AI services over security concerns (Image source: Anthropic)

Anthropic blocks Chinese-owned firms from using Claude models under new terms 09/06/2025

LiftFlick will come preloaded on new Yoga, IdeaPad, and Legion devices. (Image source: Lenovo)

Lenovo wants to save you a trip to Photoshop with FlickLift, its new AI image editor that can remove backgrounds and upscale images in a pinch 09/05/2025

Uncanny valley effect: Study shows immune response to human-like avatars. (Image source: GPT-image-1)

Uncanny valley effect: Study shows immune reaction to human-like avatars 09/04/2025

SwitchBot is showing its new AI Hub (pictured) at IFA 2025. (Image source: SwitchBot)

SwitchBot AI Hub: New smart home hub with VLM shown 09/04/2025

Pictured: Tencent's China office building (Image source: Tencent)

China expands AI subsidies with computing power vouchers to boost SME adoption 09/03/2025

Read all 1 comments / answer

Loading Comments

Comment on this article

Anbernic RG476H: New pocketable gam...

Qualcomm dismisses Intel foundry fo...

Editor of the original article: Marius Müller - Tech Writer - 2504 articles published on Notebookcheck since 2024

As a child in the 90s, my Gameboy was my steady companion. After school, the PlayStation was fired up. When I finally got my first PC, I was completely hooked. My passion for gaming has never waned since. For me, writing for Notebookcheck means reporting on topics that are really close to my heart - in addition to gaming, I also like to write about e-mobility, photovoltaics and innovative gadgets. When I'm not sitting at my computer, I'm probably on water rescue duty on the Baltic coast or trying to counteract the downsides of my geek life - namely sitting for long periods - at the local swimming pool.

Please share our article, every link counts!