Handy – offline speech-to-text application

Handy is a cross-platform desktop speech-to-text application that lets you dictate directly into any text field using configurable keyboard shortcuts.

It’s designed for privacy-focused local transcription, runs entirely on your own computer rather than sending audio to the cloud, and supports a range of speech recognition models so you can balance speed, language coverage, and accuracy to suit your system.

This is free and open source software.

Key Features

  • Performs speech transcription entirely offline, keeping audio processing on your own machine.
  • Lets you start and stop recording with configurable keyboard shortcuts, with both push-to-talk and toggle modes available.
  • Supports multiple local recognition models, including Whisper, Parakeet, Moonshine, Canary, SenseVoice, and GigaAM, along with support for custom Whisper-compatible models.
  • Stores transcription history with timestamps, audio playback, copy and delete actions, starring, and automatic cleanup options for recordings.
  • Includes advanced output controls such as auto-submit after insertion, clipboard handling, trailing spaces, and custom word correction for commonly misheard terms.
  • Offers an experimental post-processing feature that can refine grammar, reformat text, or translate output using local or external AI providers.

Website: github.com/cjpais/handy
Support:
Developer: CJ Pais
License: MIT License

Handy is written in Python. Learn Python with our recommended free books and free tutorials.


Related Software

Speech Recognition Tools
WhisperAutomatic speech recognition (system trained on 680,000 hours of data
FlashlightFast, flexible machine learning library written entirely in C++.
Coqui STTDeep-learning toolkit for training and deploying speech-to-text models
KaldiC++ toolkit designed for speech recognition researchers.
SpeechBrainAll-in-one conversational AI toolkit based on PyTorch
ESPnetEnd-to-End speech processing toolkit
deepspeech.pytorchImplementation of DeepSpeech2 using Baidu Warp-CTC.
DeepSpeechTensorFlow implementation of Baidu's DeepSpeech architecture.
JuliusTwo-pass large vocabulary continuous speech recognition engine
OpenSeq2SeqTensorFlow-based toolkit for sequence-to-sequence models
CMUSphinxSpeech recognition system for mobile and server applications
EesenEnd-to-End Speech Recognition
SimonFlexible speech recognition software

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments