ChatGPT vs Gemini vs Grok credibility study shows Google misleads less yet AI disinformation has doubled

Google's Gemini is one of the more credible AI-powered chatbots out there. (Image source: Google)

Unlike in the previous years of their existence, AI-powered chatbots now spew out answers and information snippets to all news related questions they get asked. Unfortunately, this also means that the number of answers that are demonstrably false has risen, too, save for one single AI tool.

Daniel Zlatev, Published 09/12/2025 🇪🇸 🇵🇹 ...

AI Fail

Google's Gemini answers are the second most credible among ten leading AI chatbots, while ChatGPT is seventh with 40% falsehoods in its responses to questions concerning relevant news topics. The rate of Google Gemini misinformation more than doubled in the span of a year, though, rising from about 7% in August 2024, to 17% when the test study was repeated this past August.

The researchers, who do regular credibility audits of the ten most popular AI tools, attributed the drastic rise in falsehoods they spew out - 18% in 2024 vs 35% now - to the increased competition among AI-powered chatbots. When a chatbot didn't know the answer to a news question in 2024, for instance, it simply returned an empty query in 31% of the cases.

In August 2025, however, the appearances of non-answers fell to zero, with the corresponding rise in falsehoods passing as responses. The worst offender was Inflection, whose Pi chatbot prides itself in trying to mimic the emotional intelligence of a human. Emotional intelligence, however, apparently comes with falling for fake news sources and outright propaganda crafted to flood the Internet with falsehoods designed to slant the AI algorithms in a particular direction.

OpenAI's Sam Altman acknowledged the ChatGPT disinformation problem in a recent interview, saying that what keeps him awake at night is the discrepancy between how easy it is to embed it in future models, and the level of trust that people express towards ChatGPT's answers.

The most credible AI tool turned out to be Anthropic's Claude, with only 10% false answers to the same queries that were run by the others, a level that is unchanged from the same audit done in August 2024. If it wasn't for Claude's reliability, the level of overall trustworthiness for the leading AI chatbots would have fallen even more drastically.

After numerous rounds of testing, Apple recently discovered that Claude is the most credible AI tool to power its Siri virtual butler, too, and opened talks with Anthropic, pitting it against Google Gemini for custom private AI models that will run on its own cloud servers.

Most credible AI tools ranking

Claude - 10% wrong answers.
Gemini - 17% wrong answers.
Grok/You - 33% wrong answers.
Copilot/Mistral - 36% wrong answers.
ChatGPT/Meta - 40% wrong answers.

The AI tool credibility study concerns queries about news topics, as that is where the majority of AI targeting propaganda efforts go to. The researchers found that Russian influence operations, for instance, keep flooding the zone with millions of seemingly nonsensical AI picture collages, posts, or news pieces distributed by the Pravda network of websites that may look innocuous, but are designed to push the attitude of AI search tools in a certain direction.

There are plenty of other actors trying to influence the AI chatbot responses, too, and the study showed that once Google, OpenAI, or Anthropic tried to update their algorithms to plug one type of fake news sources, the misinformation campaigns moved to other loopholes in what is shaping up to be a constant cat and mouse game. The end result is that more than a third of AI chatbot answers to the news queries in the study aren't credible, while the share of AI-powered misinformation has risen twofold in just a year.

The level of false Ai-powered information keeps rising. (Image source: NewsGuard)

Source(s)

Newsguard (PDF)

Walmart partners with OpenAI to let customers shop directly in ChatGPT, showing how AI is reshaping online retail 10/14/2025

Gemini 2.5 Deep Think achieved gold-medal level at ICPC 2025. (Image source: GPT-image-1)

Gemini 2.5 Deep Think achieves gold-medal level at ICPC 2025 09/24/2025

Vibe coders have to face inconsistent performance of AI models (Image source: Generated using OpenAI)

Open source tool measures the stupidity level of AI models 09/17/2025

Promotional artwork that displays new updates for Google's Discover feed on smartphones. (Image Source: Google)

Google's Discover feed will now show social media posts and YouTube Shorts from creators you follow 09/17/2025

Nano Banana AI helped Gemini app finally beat ChatGPT (Image source: Generated using OpenAI)

Nano Banana has turned Gemini into App Store's top AI app 09/17/2025

ChatGPT Privacy: What your conversations reveal about you. (Image source: GPT-image-1)

ChatGPT privacy: What your conversations reveal about you 09/08/2025

ChatGPT down: Users report outage globally 09/03/2025

An emergency contacts feature will soon be added to be reached out to from ChatGPT directly. Generic image showing a person interacting with ChatGPT. (Image source: Matheus Bertelli, Pexels)

ChatGPT will introduce parental control and other safeguards following lawsuit 08/28/2025

Google's Gemini AI may become the backbone of Siri and Apple Intelligence. (Image source: Apple and Google, w/ edits)

Apple may tap Google's Gemini for Apple Intelligence and Siri revamp 08/22/2025

In an AgentFlayer attack, images are used to deliver hidden prompts. (Image source: OpenAI)

Data theft with invisible text: How easily ChatGPT and other AI tools can be tricked 08/18/2025

OpenAI CEO Sam Altman talks about the dangers of advanced AI. (Image source: YouTube/This Past Weekend)

OpenAI CEO Sam Altman speaks out on underestimated risks of ChatGPT 5.0 08/05/2025

An illustrative image depicting how Scinapse compares against the competition (Image source: Google, Anthropic, and Pluto Labs; edited)

Korean startup claims its 'AI scientist' outperforms Gemini 2.5 Pro and Claude Opus 4 07/16/2025

Apple prides itself with the privacy features on the iPhone. (Image source: Apple)

Apple throws in the towel on Siri AI to adopt private ChatGPT or Claude models 07/01/2025

According to a recent study, the use of generative AI, such as Microsoft Copilot, results in reduced brain activity. (source: Microsoft).

MIT study reveals how ChatGPT affects brain health and cognitive performance 06/30/2025

Gemini 2.5 Flash is one of the biggest upgrades announced by Google at I/O 2025. (Image source: Google)

Google rolls out new Gemini 2.5 updates with Agent Mode, Deep Think, and learning tools 05/20/2025

Loading Comments

Comment on this article

150,000+ Steam wishlists: New myste...

Full PS6 specs, performance, and re...

Daniel Zlatev - Senior Tech Writer - 1940 articles published on Notebookcheck since 2021

Wooed by tech since the industrial espionage of Apple computers and the times of pixelized Nintendos, Daniel went and opened a gaming club when personal computers and consoles were still an expensive rarity. Nowadays, fascination is not with specs and speed but rather the lifestyle that computers in our pocket, house, and car have shoehorned us in, from the infinite scroll and the privacy hazards to authenticating every bit and move of our existence.

Please share our article, every link counts!