text to speech Archives

Glate – Google Translator and Text To Speech Service on Linux Desktop

Glate is a desktop application for Linux that provides text translation and text-to-speech capabilities using Google’s translation services.

Parler-TTS is a lightweight text-to-speech model and library for generating high-quality natural-sounding speech from text.

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework.

Orpheus-FastAPI is a high-performance self-hosted text-to-speech server built with FastAPI.

Dia is a text to speech model capable of generating ultra-realistic dialogue in a single pass.

sherpa-onnx is software for Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime

RHVoice is a multilingual speech synthesizer. The aim is to give visually impaired people access to a synthesis voice with a screen reader.

Gespeaker is a GTK-based frontend for eSpeak. Like eSpeak, Gespeaker is free and open source software.

eSpeak NG (Next Generation) is a continuation of the eSpeak with more feedback from native speakers. It supports more than 100 languages and accents.

Coqui TTS (TTS) is a library for advanced Text-to-Speech generation. It offers pretrained models in more than 1,100 different languages.

Tortoise TTS is a multi-voice text-to-speech system trained with an emphasis on quality.