Last Updated on December 19, 2023
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. Powered by deep learning and neural networks, Whisper is a natural language processing system that’s built on PyTorch.
The software offers transcription in multiple languages, as well as translation from those languages into English.
This is free and open source software.
I’ve updated this section
We tested Whisper originally with Ubuntu 22.04 LTS (as we ran into issues using Ubuntu 22.10), as well as more recently Ubuntu 23.10.
To avoid polluting your system, we recommend installing Whisper with Anaconda or Miniconda (if you only want conda).
Download and install Anaconda using wget.
$ wget https://repo.anaconda.com/archive/Anaconda3-2023.09-0-Linux-x86_64.sh
Run the shell script:
$ bash Anaconda3-2023.09-0-Linux-x86_64.sh
You’ll be asked to accept Anaconda’s license and whether to initialize Anaconda3 by running conda init. For changes to take effect, close and re-open your current shell.
Create a conda environment, and activate it.
$ conda create --name whisper
$ conda activate whisper
Now we’re ready to install Whisper using pipx.
$ pipx install openai-whisper