Whisper-Medusa is aiOla’s new open-source speech-recognition AI model, claiming to be 50% faster than OpenAI's Whisper

aiOla is an Israel-based company that uses AI-driven solutions for digitizing paper-based workflows. (Image source: aiOla)

aiOla has launched Whisper-Medusa, an open-source AI model designed to improve automatic speech recognition. Combining OpenAI's Whisper with aiOla's technology, Whisper-Medusa claims to operate 50% faster than Whisper itself. This model supports over 100 languages and transforms unstructured speech data into actionable insights, showing future promise in industries such as aviation, logistics, and healthcare.

Anubhav Sharma, Published 08/03/2024 🇫🇷 🇪🇸 ...

AI Software

aiOla is an Israel-based company founded in 2019 that specializes in AI-driven solutions for digitizing paper-based workflows. The company recently introduced Whisper-Medusa, an open-source AI model that's a combination of OpenAI’s Whisper and aiOla’s tech. It claims to operate over 50% faster while maintaining high accuracy. This speed is achieved through a unique token prediction method, predicting ten tokens at a time instead of one, as seen in OpenAI’s Whisper.

Whisper-Medusa was developed using weak supervision. This process involves using Whisper to transcribe audio datasets, which then serve as labels to train Medusa’s token prediction modules.

Whisper-Medusa could turn out to be a great asset for businesses that still rely on paper-based workflows in day-to-day operation. aiOla’s technology, through its backend system 'aiOla Jargonic' can assist frontline workers across various industries. For instance, in the food manufacturing industry, aiOla streamlined quality control by transforming manual checklists into digital workflows. The company says that the whole process is "as easy as uploading a photo or file of your existing processes".

Supporting over 100 languages and various accents, Whisper-Medusa could also be useful in industries such as aviation, food manufacturing, logistics, and healthcare. By converting unstructured speech data into actionable insights, businesses can cut their costs and improve resource allocation.

Those interested can find the open-source files on Hugging Face and GitHub.

aiOla's Whisper-Medusa claims to be 50% faster than OpenAI's Whisper. (Image source: aiOla)

Source(s)

aiOla via PR Newswire

Safe AI system (Image source: Generated using DALL·E 3)

OpenAI's former chief scientist raises $1 billion for safe AI systems development 09/05/2024

OpenAI is expected to exceed $100 billion in valuation in its next funding round. (Image source: WikiMedia)

Apple, Nvidia reportedly considering investments in OpenAI, following Microsoft's $13 billion stake 08/31/2024

GIMP 3.0 is nearing release, and it is set to introduce a number of game-changing features to the free Photoshop alternative. (Image source: Julian van der Merwe / Notebookcheck)

GIMP 3: 5 game-changing new features coming to the free Photoshop alternative 08/22/2024

OpenAI's fingerprinting too is said to be 99.9% accurate (Image source: OpenAI [edited])

Insider reports OpenAI's powerful anti-plagiarism tool for ChatGPT is stalled due to internal debates 08/05/2024

The SearchGPT prototype claims to provide relevant sources for all search results. (Source: OpenAI)

OpenAI's ‘SearchGPT’ prototype enters limited testing, bringing conversational AI and source attribution to searches 07/25/2024

OpenAI has launched a cheaper version of its most-powerful GPT-4o LLM, GPT-4o mini. (Image source: AI-generated, Dall-E 3)

OpenAI unveils GPT-4o mini with a price 25x lower than GPT-4o, allowing more businesses and users to access quality AI 07/19/2024

ChatGPT on Mac has issues. (Source: OpenAI)

OpenAI app for macOS updates in response to non-encrypted chats fiasco 07/06/2024

Fans will soon be able to see Scarlett Johansson on the big screen again. Her new drama-comedy "Fly Me to the Moon" opens in US-american cinemas on July 12. (Source: OpenAi)

Scarlett Johansson vs. OpenAI: ChatGPT's AI assistant "Sky" is supposed to use Johansson's voice without her consent 05/21/2024

Shy Kids made Air Head in collaboration with OpenAI's Sora video generation model. (Image source: Shy Kids on YouTube)

OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story 04/27/2024

Layered Reality will bring Elvis back to the stage in November 2023 during a virtual concert. (Source: Layered Reality)

Elvis returns to the stage November thanks to Layered Reality's artificial intelligence and machine learning magic 01/05/2024

Loading Comments

Comment on this article

OPPO Reno13 and 13 Pro tipped to la...

Ai Bundle: Expansion board allows f...

Anubhav Sharma - Tech Writer - 900 articles published on Notebookcheck since 2024

Fueled by a childhood spent taking apart video game consoles to see how they worked, I turned my passion for tech into writing. I have a double Bachelor's in Computer Science Engineering (2018) and English (2024). I've been writing on a variety of tech topics since 2016, with a particular interest in gaming. When I'm not hunting down the latest tech news, you'll find me producing music, gaming, or hiking.

contact me via: @lottamuzic, LinkedIn

Please share our article, every link counts!