Speech

ostt – Open Speech-to-Text

ostt is an interactive terminal-based audio recording and speech-to-text transcription tool. Record audio with real-time waveform visualization, automatically transcribe using multiple AI providers and models, and maintain a browsable history of all your transcriptions.

Built with Rust for performance and minimal dependencies, ostt works seamlessly on Linux and macOS.

This is free and open source software.

Key Features

  • Real-time waveform visualization with sparkline graphs.
  • dBFS-based volume metering (industry standard).
  • Configurable reference level for clipping detection.
  • Audio clipping detection with pause/resume support.
  • Audio compression for fast API calls.
  • Multiple transcription providers (OpenAI, Deepgram).
  • Browsable transcription history.
  • Keyword management for improved accuracy.
  • Cross-platform: Linux and macOS support.

Website: github.com/kristoferlund/ostt
Support:
Developer: Kristofer Lund
License: MIT License

ostt commands and examples

ostt is written in Rust. Learn Rust with our recommended free books and free tutorials.


Related Software

Speech Recognition Tools
WhisperAutomatic speech recognition (system trained on 680,000 hours of data
FlashlightFast, flexible machine learning library written entirely in C++.
Coqui STTDeep-learning toolkit for training and deploying speech-to-text models
KaldiC++ toolkit designed for speech recognition researchers.
SpeechBrainAll-in-one conversational AI toolkit based on PyTorch
HandyOffline speech-to-text application
ESPnetEnd-to-End speech processing toolkit
deepspeech.pytorchImplementation of DeepSpeech2 using Baidu Warp-CTC.
WhisperingTranscription application with global speech-to-text functionality
JuliusTwo-pass large vocabulary continuous speech recognition engine
CMUSphinxSpeech recognition system for mobile and server applications
SimonFlexible speech recognition software
hyprwhsprNative speech-to-text designed for Arch / Omarchy
osttOpen Speech-to-Text
DeepSpeechTensorFlow implementation of Baidu's DeepSpeech architecture.
OpenSeq2SeqTensorFlow-based toolkit for sequence-to-sequence models
EesenEnd-to-End Speech Recognition

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments