Voice Recognition

Festival – framework for building speech synthesis systems

Festival offers a full text to speech system with various APIs, as well an environment for development and research of speech synthesis techniques. It includes a Scheme-based command interpreter.

Besides research into speech synthesis, festival is useful as a stand-alone speech synthesis program. It is capable of producing clearly understandable speech from text.

The system is written in C++ and uses the Edinburgh Speech Tools Library for low level architecture and has a Scheme (SIOD) based command interpreter for control. There are a number of graphical user interfaces that rely on Festival.

Key Features

  • Externally configurable language independent modules:
    • phonesets.
    • lexicons.
    • letter-to-sound rules.
    • tokenizing.
    • part of speech tagging.
    • intonation and duration .
  • Waveform synthesizers:
    • Multisyn unit selection engine.
    • HTS parametric synthesis engine.
    • Clustergen parametric synthesis engine.
    • Clunits unit selection engine.
    • diphone based: residual excited LPC (and PSOLA not for distribution).
    • MBROLA database support.
  • SABLE markup, Emacs, client/server, scripting interfaces.
  • HTS hidden Markov model based synthesis engine API integration.

Website: www.cstr.ed.ac.uk/projects/festival
Support: Documentation
Developer: Centre for Speech Technology Research (CSTR) of the University of Edinburgh
License: Free X11-type license

Festival is written in the C++. Learn C++ with our recommended free books and free tutorials.


Related Software

Speech Tools
PiperFast, local neural text to speech system
TortoiseMulti-voice text-to-speech system trained with an emphasis on quality
Coqui TTSOffers pretrained models in more than 1,100 different languages
BarkTransformer-based text-to-audio model.
Dia1.6B parameter text to speech model
FestivalGeneral multi-lingual speech synthesis system
PraatSpeechAnalyserSoftware for speech analysis and synthesis
Speech NoteSpeech to Text, Text to Speech and Machine Translation
Mimic 3Lightweight Text to Speech engine
OrcaScreenReaderScriptable screen reader
MeloTTSHigh-quality multi-lingual text-to-speech library
Parler-TTSLightweight text-to-speech (TTS) model
FliteSmall, fast run time text to speech synthesis engine
RHVoiceGives the visually impaired a synthesis voice with their screen reader
eSpeak NGContinuation of the eSpeak project
eSpeakSpeech synthesizer using a formant synthesis method
Orpheus-TTS-FastAPIHigh-performance self-hosted text-to-speech server
GespeakerGTK-based frontend for eSpeak
VoiceGenSimple text-to-speech application
GlateGoogle Translator and Text To Speech Service

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments