Parler-TTS is a lightweight text-to-speech model and library for generating high-quality natural-sounding speech from text.
Read more
Parler-TTS is a lightweight text-to-speech model and library for generating high-quality natural-sounding speech from text.
Read more
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework.
Read more
Orpheus-FastAPI is a high-performance self-hosted text-to-speech server built with FastAPI.
Read more
Dia is a text to speech model capable of generating ultra-realistic dialogue in a single pass.
Read more
sherpa-onnx is software for Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime
Read more
RHVoice is a multilingual speech synthesizer. The aim is to give visually impaired people access to a synthesis voice with a screen reader.
Read more
Gespeaker is a GTK-based frontend for eSpeak. Like eSpeak, Gespeaker is free and open source software.
Read more
eSpeak NG (Next Generation) is a continuation of the eSpeak with more feedback from native speakers. It supports more than 100 languages and accents.
Read more
Coqui TTS (TTS) is a library for advanced Text-to-Speech generation. It offers pretrained models in more than 1,100 different languages.
Read more
Tortoise TTS is a multi-voice text-to-speech system trained with an emphasis on quality.
Read more