Notebookcheck Logo

Whisperfile by Mozilla Ocho team delivers local and easy ASR

llamafile project (Image Source: Mozilla)
llamafile project (Image Source: Mozilla)
The team known for innovation and experiments at Mozilla has integrated OpenAI’s Whisper automatic speech recognition system into their llamafile project. The project aims to deliver local hardware-friendly large language models in a single-file executable. The ability to locally run transcription software can benefit privacy-minded users looking to transcribe sensitive audio.

For those unfamiliar with the llamafile project, the Mozilla Ocho team is working on removing the complexity surrounding the installation of LLMs by combining llama.cpp and Cosmopolitan Libc both of which utilize the C/C++ programming language. Llamafile uses the inference of Meta’s LLaMA model provided by llama.cpp and pairs it with the build-once run-anywhere transformation of C by Cosmopolitan Libc. Cosmopolitan Libc also removes the need for an interpreter or virtual machine to run C on Linux, Mac, and Windows machines. 

The latest addition to the llamafile project, Whisperfile, was detailed on Mozilla’s Hugging Face profile. According to their page, “Whisperfile is a high-performance implementation of OpenAI's Whisper created by Mozilla Ocho as part of the llamafile project, based on the whisper.cpp software written by Georgi Gerganov, et al.” For the project, the team packaged all necessary weights for the model into “whisperfiles” that maintain the features found in Cosmopolitan Libc allowing the model to run on most systems. 

The “Getting Started with Whisperfile” documentation on the team's GitHub profile details a simple install process consisting of only three steps. Once a user downloads the model, builds the software, and runs the program they can choose from three different models to start transcribing audio. The models range from tiny to large with speed being traded for accuracy in the transcription process. Additionally, Whisperfile can translate different languages into English using the multilingual model and the “--translate” flag. 

For information on llamafile, Whisperfile, and additional details on the installation process, kindly visit the resources below. 

Source(s)

static version load dynamic
Loading Comments
Comment on this article
Please share our article, every link counts!
Mail Logo
> Expert Reviews and News on Laptops, Smartphones and Tech Innovations > News > News Archive > Newsarchive 2024 08 > Whisperfile by Mozilla Ocho team delivers local and easy ASR
Stephen Pereyra, 2024-08-23 (Update: 2024-08-23)