ostt - Open Speech-to-Text

ostt is an interactive terminal-based audio recording and speech-to-text transcription tool. Record audio with real-time waveform visualization, automatically transcribe using multiple AI providers and models, and maintain a browsable history of all your transcriptions.

Built with Rust for performance and minimal dependencies, ostt works seamlessly on Linux and macOS.

This is free and open source software.

Key Features

Real-time waveform visualization with sparkline graphs.
dBFS-based volume metering (industry standard).
Configurable reference level for clipping detection.
Audio clipping detection with pause/resume support.
Audio compression for fast API calls.
Multiple transcription providers (OpenAI, Deepgram).
Browsable transcription history.
Keyword management for improved accuracy.
Cross-platform: Linux and macOS support.

Website: github.com/kristoferlund/ostt
Support:
Developer: Kristofer Lund
License: MIT License

ostt is written in Rust. Learn Rust with our recommended free books and free tutorials.

Related Software

Speech Recognition Tools
Whisper	Automatic speech recognition (system trained on 680,000 hours of data
Flashlight	Fast, flexible machine learning library written entirely in C++.
Coqui STT	Deep-learning toolkit for training and deploying speech-to-text models
Kaldi	C++ toolkit designed for speech recognition researchers.
SpeechBrain	All-in-one conversational AI toolkit based on PyTorch
Handy	Offline speech-to-text application
ESPnet	End-to-End speech processing toolkit
deepspeech.pytorch	Implementation of DeepSpeech2 using Baidu Warp-CTC.
Whispering	Transcription application with global speech-to-text functionality
Julius	Two-pass large vocabulary continuous speech recognition engine
CMUSphinx	Speech recognition system for mobile and server applications
Simon	Flexible speech recognition software
hyprwhspr	Native speech-to-text designed for Arch / Omarchy
ostt	Open Speech-to-Text
DeepSpeech	TensorFlow implementation of Baidu's DeepSpeech architecture.
OpenSeq2Seq	TensorFlow-based toolkit for sequence-to-sequence models
Eesen	End-to-End Speech Recognition

Read our verdict in the software roundup.

Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.

Documents	Internet	Education
Audio	Video	Graphics
Admin	Desktop	Productivity
Science	Games	Security
Utilities	Coding	Finance
Web Apps	Other	Books

Google	Microsoft	Apple
Adobe	IBM	Autodesk
Oracle	Atlassian	Corel
Cisco	Intuit	SAS
Progress	Salesforce	Citrix