Researchers are saying that OpenAI's Whisper tool makes stuff up

Researchers are saying that OpenAI's Whisper tool makes stuff up (Image Source: OpenAI)

OpenAI's transcription tool Whisper is used in medical centers to translate and transcribe patient interviews. According to researchers, it is prone to hallucinating information, including racial commentary and fake medications.

Rohith Bhaskar, Published 10/28/2024 🇳🇱 🇫🇷 ...

AI Audio

According to a new report from ABC News (via Engadget), OpenAI's audio transcription tool, Whisper, is prone to hallucinate transcriptions that are not part of the original recordings.

This is troubling because Whisper is already being used in several industries, including medical centers that rely on the tool to transcribe consultations. This is despite OpenAI's stern warning to not use it in "high-risk domains".

A machine learning engineer discovered hallucinations in half of over 100 hours of transcriptions, while another developer said he found them in all of the 26,000 transcriptions he analyzed. Researchers said this could lead to faulty transcriptions in millions of recordings worldwide. An OpenAI spokesperson told ABC News that the company has studied these reports and will include their feedback in model updates. The tool is incorporated into Oracle and Microsoft Cloud. These services have thousands of clients worldwide, increasing the scope of risk.

Professors Allison Koenecke and Mona Sloane examined thousands of short snippets from TalkBank. They found that 40% of the hallucinations discovered were harmful. For example, in one of the recordings, a speaker said, "He, the boy, was going to, I'm not sure exactly, take the umbrella." but the tool transcribed it as, "He took a big piece of the cross, a teeny, small piece...I'm sure he didn't have a terror knife so he killed a number of people".

Source(s)

Engadget, ABC News

Planet friendly proton batteries join the roster for lithium-ion alternatives 12/05/2024

New York Times claims OpenAI deleted evidence in copyright lawsuit (Image Source: Photo by Sara Groblechner on Unsplash)

New York Times claims OpenAI deleted evidence in copyright lawsuit 11/22/2024

OpenAI is struggling to gather training data for new models (Image Source: OpenAI)

OpenAI is struggling to gather training data for new models 11/11/2024

The Bose QuietComfort Ultra has three different noise-cancelling modes - Quiet, Aware and Immersion. (Image source: Bose)

Bose QuietComfort Ultra headphones with ANC and Bluetooth 5.3 hit all-time low price with biggest discount yet 11/01/2024

OpenAI is developing an AI inference chip in collaboration with Broadcom (Image Source: OpenAI)

OpenAI is developing an AI inference chip in collaboration with Broadcom 10/30/2024

Linux creator Linus Torvalds dismisses current AI technology as 90 percent marketing (Image source: DALL·E 3)

Linux creator Linus Torvalds labels AI industry as 90 percent marketing 10/29/2024

Sam Altman denies reports of a new OpenAI model in December (Image Source: OpenAI)

Sam Altman denies reports of a new OpenAI model in December 10/25/2024

OpenAI o1 and o1-mini arrive – AI that reason better on STEM questions than prior models. (Image source: AI-generated, Dall-E 3)

OpenAI o1 and o1-mini arrive as AIs that handle STEM questions better than prior models 09/16/2024

OpenAI CEO Sam Altman (Image source: Korea Metro)

OpenAI allegedly valued at $150 billion, in talks to raise $6.5 billion 09/12/2024

OpenAI is expected to exceed $100 billion in valuation in its next funding round. (Image source: WikiMedia)

Apple, Nvidia reportedly considering investments in OpenAI, following Microsoft's $13 billion stake 08/31/2024

Loading Comments

Comment on this article

Is the Pixel smartphone losing its ...

GMK claims Ryzen AI HX 370-powered ...

Rohith Bhaskar - Tech Writer - 329 articles published on Notebookcheck since 2024

I might look like a normal human being, but I’m secretly powered by tech news and bad puns. With a soft spot for all things digital, I dive into the world of innovation and bring back stories that make sense to techies and newbies alike.

contact me via: LinkedIn

Please share our article, every link counts!