Voice Recognition

CMU Flite – small, fast run time synthesis engine

CMU Flite (festival-lite) is a small, fast run-time open source text to speech synthesis engine developed at CMU and primarily designed for small embedded machines and/or large servers.

Flite is designed as an alternative text to speech synthesis engine to Festival for voices built using the FestVox suite of voice building tools.

Flite is designed for very small devices, such as PDAs, and also for large server machines which need to serve lots of ports. It’s not a replacement for Festival but an alternative run time engine for voices developed in the FestVox framework where size and speed is crucial.

Key Features

  • Thread safe.
  • Multi-voice, multi-language.
  • Supports synthesis of individual strings or files (utterance by utterance) to direct audio devices or to waveform files.
  • Offers simple functions suitable for use in specific applications.
  • Voices, lexicons and language descriptions can be compiled (mostly automatically for voices and lexicons) into C representations from their FestVox formats.
  • All voices, lexicons and language model data are const and in the text segment (i.e. they may be put in ROM). As they are linked in at compile time, there is virtually no startup delay.
  • For standard diphone voices, maximum run time memory requirements are approximately less than twice the memory requirement for the waveform generated. For 32bit archtectures this effectively means under 1MB.
  • Support for converting the latest Clustergen Random Forest voices for CMU Flite.
  • Cross-platform support – runs under Linux, FreeBSD, Solaris, Mac OS X, Windows, Android, and openwrt devices.

Website: github.com/festvox/flite
Support:
Developer: Alan W Black and contributors
License: Core code is published under a BSD-like copyright

CMU Flite is written in C. Learn C with our recommended free books and free tutorials.


Related Software

Speech Tools
PiperFast, local neural text to speech system
TortoiseMulti-voice text-to-speech system trained with an emphasis on quality
Coqui TTSOffers pretrained models in more than 1,100 different languages
BarkTransformer-based text-to-audio model.
Dia1.6B parameter text to speech model
FestivalGeneral multi-lingual speech synthesis system
PraatSpeechAnalyserSoftware for speech analysis and synthesis
Speech NoteSpeech to Text, Text to Speech and Machine Translation
Mimic 3Lightweight Text to Speech engine
OrcaScreenReaderScriptable screen reader
MeloTTSHigh-quality multi-lingual text-to-speech library
Parler-TTSLightweight text-to-speech (TTS) model
FliteSmall, fast run time text to speech synthesis engine
RHVoiceGives the visually impaired a synthesis voice with their screen reader
eSpeak NGContinuation of the eSpeak project
eSpeakSpeech synthesizer using a formant synthesis method
Orpheus-TTS-FastAPIHigh-performance self-hosted text-to-speech server
GespeakerGTK-based frontend for eSpeak
VoiceGenSimple text-to-speech application
GlateGoogle Translator and Text To Speech Service

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments