Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) engine.
It supports N-gram based dictation, DFA grammar based parsing, and one- pass isolated word recognition. Phone context dependencies are supported up to triphone. It can perform a multi-model decoding, a recognition using several LMs and AMs simultaneously with a single processor, and also supports “hot plugging” of arbitrary modules at run time.
The acoustic models and language models are pluggable, and you can build various types of speech recognition system by building your own models and modules to be suitable for your task. It also adopts standard formats to cope with other toolkit such as HTK, CMU-Cam SLM toolkit, etc. The core engine is implemented as embeddable library, to aim to offer speech recognition capability to various applications. The recent version supports plug-in capability so that the engine can be extended by user.
Key Features
- Real-time, hi-speed, accurate recognition based on 2-pass strategy.
- Low memory requirement: less than 32MBytes required for work area (<64MBytes for 20k-word dictation with on-memory 3-gram LM).
- Supports LM of N-gram, grammar, and isolated word.
- Language and unit-dependent: Any LM in ARPA standard format and AM in HTK ascii hmmdefs format can be used.
- Highly configurable: can set various search parameters. Also alternate decoding algorithm (1-best/word-pair approx., word trellis/word graph intermediates, etc.) can be chosen.
- Full source code documentation and manual in Engligh / Japanese.
- On-the-fly recognition for microphone and network input.
- GMM-based input rejection.
- Successive decoding, delimiting input by short pauses.
- N-best output.
- Word graph output.
- Forced alignment on word, phoneme, and state level.
- Confidence scoring.
- Server mode and control API.
- Many search parameters for tuning its performance.
- Character code conversion for result output.
- Long N-gram support.
- Run with forward / backward N-gram only.
- Confusion network output.
- Arbitrary multi-model decoding in a single thread.
- Rapid isolated word recognition.
- User-defined LM function embedding.
Website: github.com/julius-speech/julius
Support:
Developer: Lee Akinobu
License: Open source license
Julius is written in C. Learn C with our recommended free books and free tutorials.
Related Software
| Speech Recognition Tools | |
|---|---|
| Whisper | Automatic speech recognition (system trained on 680,000 hours of data |
| Flashlight | Fast, flexible machine learning library written entirely in C++. |
| Coqui STT | Deep-learning toolkit for training and deploying speech-to-text models |
| Kaldi | C++ toolkit designed for speech recognition researchers. |
| SpeechBrain | All-in-one conversational AI toolkit based on PyTorch |
| ESPnet | End-to-End speech processing toolkit |
| deepspeech.pytorch | Implementation of DeepSpeech2 using Baidu Warp-CTC. |
| DeepSpeech | TensorFlow implementation of Baidu's DeepSpeech architecture. |
| Julius | Two-pass large vocabulary continuous speech recognition engine |
| OpenSeq2Seq | TensorFlow-based toolkit for sequence-to-sequence models |
| CMUSphinx | Speech recognition system for mobile and server applications |
| Eesen | End-to-End Speech Recognition |
| Simon | Flexible speech recognition software |
Read our verdict in the software roundup.
| Universal Access Tools | |
|---|---|
| Orca Screen Reader | Scriptable screen reader |
| Liblouis | Braille translator, back-translator and formatter |
| Florence Virtual Keyboard | Extensible scalable virtual keyboard |
| Speakup | Kernel-based speech synthesizer driver suite |
| BRLTTY | Gives a braille user access to the text consoles |
| Maliit | Virtual keyboard implemented as a plug-in for Maliit Framework |
| Julius | Two-pass large vocabulary continuous speech recognition engine |
| Emacspeak | Emacs extension that provides spoken output |
| Mouse actions | Execute commands from mouse events |
| squeekboard | Keyboard-shaped input method supporting Wayland |
| GOK | GNOME On Screen Keyboard |
| CoreKeyboard | Simple virtual keyboard with word suggestions |
| Dasher | Graphical predictive text input system |
| borse | Practice braille, Morse, semaphore |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

