AI battle: Grok surprises Mrwhosetheboss with its performance and ChatGPT wins

Gemini, ChatGPT, Grok, and Perplexity (Image source: Gemini)

In a video posted by Mrwhosetheboss on YouTube, he tested four AI models from different brands and scored them based on performance in each task. Mrwhosetheboss went from simple queries to tricky questions and research, pushing each model to its limit.

Chibuike Okpara, Published 07/04/2025 🇩🇪 🇪🇸 ...

AI Opinion

In the video, Mrwhosetheboss tested Grok (Grok 3), Gemini (2.5 Pro), ChatGPT (GPT-4o), and Perplexity (Sonar Pro). He made it clear throughout the video that he was impressed by the performance Grok was delivering. Grok started off really well, slacked a bit, then came back to claim the second position behind ChatGPT. To be fair, ChatGPT and Gemini got their score boosted, thanks to a feature which the others simply lack — video generation.

To kick off the test, Mrwhosetheboss tested the models' real-world-problem-solving capabilities, he gave each AI model this prompt: I drive a Honda Civic 2017, how many of the Aerolite 29" Hard Shell (79x58x31cm) suitcases would I be able to fit in the boot? Grok's answer was the most straightforward as it correctly answered “2”, ChatGPT and Gemini stated it could theoretically fit 3, but practically 2. Perplexity went off the rails and did simple mathematics forgetting the object in question wasn't shapeless, and it came up with “3 or 4”

For the next question, he didn't go easy on the chatbots — he asked for advice on making a cake. Alongside his query, he uploaded an image showing 5 items, one of which isn't used for making cakes — a jar of dried Porcini mushrooms — all but one of the models fell for the trap. ChatGPT identified it as a jar of ground mixed spice, Gemini said it was a jar of crispy fried onions, Perplexity baptized it instant coffee, while Grok correctly identified it as a jar of dried mushrooms from Waitrose. Here is the image he uploaded:

An altered image of the 5 ingredients Mrwhosetheboss uploaded to the AI chatbots highlighting the jar of mushrooms (Image source: Mrwhosetheboss; cropped)

Moving on, he tested them on math, product recommendation, accounting, language translation, logical reasoning, etc. One thing was universal for them — hallucination — each of the models exhibited some level of hallucination at some point(s) in the video; talking about things that simply didn't exist with confidence. Here is how each AI ranked in the end:

ChatGPT (29 points)
Grok (24 points)
Gemini (22 points)
Perplexity (19 points)

Artificial intelligence has helped make most tasks less burdensome, especially since the arrival of LLMs. The book Artificial Intelligence (curr. $19.88 on Amazon) is one of the books that seek to help people take advantage of AI.

Source(s)

Mrwhosetheboss

Read all 4 comments / answer

Loading Comments

Comment on this article

⟨

Google Pixel Watch 4 leak: Two sizes, five colors and many accessories plus bands planned

⟩

Add as a preferred source on Google

Chibuike Okpara - Tech Writer - 506 articles published on Notebookcheck since 2024

I have always been fascinated by technology and digital devices my entire life and even got addicted to it. I have always marveled at the intricacy of even the simplest digital devices and systems around us. I have been writing and publishing articles online for about 6 years now, just about a year ago, I found myself lost in the marvel of smartphones and laptops we have in our hands every day. I developed a passion for learning about new devices and technologies that come with them and at some point, I asked myself, "Why not get into writing tech articles?" It is useless to say I followed up the idea — it is evident. I am an open-minded individual who derives an infinite amount of joy from researching and discovering new information, I believe there is so much to learn and such a short life to live, so I put my time to good use — learning new things. I am a 'bookworm' of the internet and digital devices. When I am not writing, you will find me on my devices still, I do explore and admire the beauty of nature and creatures. I am a fast learner and quickly adapt to changes, always looking forward to new adventures.

contact me via: @chibuikeokparaf, Facebook

> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2025 07 > AI battle: Grok surprises Mrwhosetheboss with its performance and ChatGPT wins

Chibuike Okpara, 2025-07- 4 (Update: 2025-07- 4)

Source(s)

Related Articles