speech recognition Archives

Handy – offline speech-to-text application

April 1, 2026 Steve Emms Multimedia

Handy is a cross-platform desktop speech-to-text application that lets you dictate directly into any text field using configurable keyboard shortcuts.

ostt – Open Speech-to-Text

December 4, 2025 Steve Emms CLI, Multimedia

ostt is an interactive terminal-based audio recording and speech-to-text transcription tool

Whispering – speech recognition tool

October 28, 2025 Steve Emms GUI, Multimedia

Press shortcut → speak → get text. Desktop transcription that cuts out the middleman.

sherpa-onnx is speech-to-text and text-to-speech software

June 30, 2025 Steve Emms Scientific

sherpa-onnx is software for Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime

Simon – frontend for simon speech recognition solution

October 21, 2023 Steve Emms Multimedia

Simon is open source speech recognition software which aims to be flexible and highly customizable.

Eesen – End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

October 21, 2023 Steve Emms Multimedia

Eesen is to simplify the existing complicated, expertise-intensive ASR pipeline into a straightforward sequence learning problem.

CMUSphinx – Open Source Speech Recognition System for Mobile and Server Applications

October 21, 2023 Steve Emms Multimedia

CMUSphinx (Sphinx) is a collective term to describe a group of speech recognition systems developed at Carnegie Mellon University.

Julius – large vocabulary continuous speech recognition decoder software

October 21, 2023 Steve Emms Multimedia

Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) engine. It supports N-gram based dictation.

OpenSeq2Seq – TensorFlow-based toolkit for sequence-to-sequence models

October 21, 2023 Steve Emms Multimedia

OpenSeq2Seq is a toolkit for distributed and mixed precision training of sequence-to-sequence models.

DeepSpeech – TensorFlow implementation of Baidu’s DeepSpeech architecture

October 21, 2023 Steve Emms Multimedia

DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques.

deepspeech.pytorch – Implementation of DeepSpeech2 using Baidu Warp-CTC

October 21, 2023 Steve Emms Multimedia

deepspeech.pytorch is an implementation of DeepSpeech2 using Baidu Warp-CTC. It creates a network based on the DeepSpeech2 architecture.

ESPnet – end-to-end speech processing toolkit

October 21, 2023 Steve Emms Multimedia

ESPnet is an end-to-end speech processing toolkit, mainly focuses on end-to-end speech recognition and end-to-end text-to-speech.

Kaldi Speech Recognition Toolkit – Designed for Speech Recognition Researchers

October 21, 2023 Steve Emms Multimedia

Kaldi is a state-of-the-art speech recognition toolkit written in C++. It’s intended to be used mainly for acoustic modelling research.

SpeechBrain – conversational AI toolkit

October 21, 2023 Steve Emms Multimedia

SpeechBrain is an all-in-one conversational AI toolkit based on PyTorch. This is free and open source software written in Python.

Flashlight – C++ standalone library for machine learning

October 21, 2023 Steve Emms Multimedia

Flashlight is a fast, flexible machine learning library written entirely in C++. It provides apps for research across multiple domains.

Documents	Internet	Education
Audio	Video	Graphics
Admin	Desktop	Productivity
Science	Games	Security
Utilities	Coding	Finance
Web Apps	Other	Books

Google	Microsoft	Apple
Adobe	IBM	Autodesk
Oracle	Atlassian	Corel
Cisco	Intuit	SAS
Progress	Salesforce	Citrix