Speech

RHVoice – multilingual speech synthesizer

In Operation

We can test the software by piping text to rhvoice.test e.g.

$ echo text to be spoken|rhvoice.test

Here’s example audio generated with RHVoice. The output has some popping and sounds somewhat robotic.

Compare that to the audio generated with Tortoise, software which leverages both an autoregressive decoder and a diffusion decoder

Of course, it’s unfair to compare the audio quality between RHVoice and Tortoise, given the time that Tortoise takes to generate the audio.

The real purpose of RHVoice is to integrate it with a screen reader such as Orca. We need to manually connect RHVoice to Speech Dispatcher. Open a terminal:

$ cd /usr/lib/speech-dispatcher-modules/

Create a symbolic link to RHVoice’s module for Speech Dispatcher:

$ sudo ln -s /snap/rhvoice/current/bin/sd_rhvoice

After rebooting the system, start Orca.

Summary

RHVoice works well with a screen reader such as Orca.

It supports a variety of languages including American and Scottish English, Brazilian Portuguese, Esperanto, Georgian, Polish, and Russian.

Voices are built from recordings of natural speech. They have small footprints, because only statistical models are stored on users’ computers. And though the voices lack the naturalness of the synthesizers which generate speech by combining segments of the recordings themselves, they are still very intelligible and resemble the speakers who recorded the source material.

Website: rhvoice.org
Support: GitHub Code Repository
Developer: Olga Yakovleva
License: GNU General Public License v2.0

RHVoice is written in C++ and C. Learn C++ with our recommended free books and free tutorials. Learn C with our recommended free books and free tutorials.

Pages in this article:
Page 1 – Introduction and Installation
Page 2 – In Operation and Summary


Related Software

Speech Tools
PiperFast, local neural text to speech system
TortoiseMulti-voice text-to-speech system trained with an emphasis on quality
Coqui TTSOffers pretrained models in more than 1,100 different languages
BarkTransformer-based text-to-audio model.
Dia1.6B parameter text to speech model
FestivalGeneral multi-lingual speech synthesis system
PraatSpeechAnalyserSoftware for speech analysis and synthesis
Speech NoteSpeech to Text, Text to Speech and Machine Translation
Mimic 3Lightweight Text to Speech engine
OrcaScreenReaderScriptable screen reader
MeloTTSHigh-quality multi-lingual text-to-speech library
Parler-TTSLightweight text-to-speech (TTS) model
FliteSmall, fast run time text to speech synthesis engine
RHVoiceGives the visually impaired a synthesis voice with their screen reader
eSpeak NGContinuation of the eSpeak project
eSpeakSpeech synthesizer using a formant synthesis method
Orpheus-TTS-FastAPIHigh-performance self-hosted text-to-speech server
GespeakerGTK-based frontend for eSpeak
VoiceGenSimple text-to-speech application
GlateGoogle Translator and Text To Speech Service

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments