this post was submitted on 25 Feb 2025
39 points (93.3% liked)

Open Source

33339 readers
101 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
 

I would be plus if it has a simple CLI or GUI.

top 15 comments
sorted by: hot top controversial new old
[–] sonalder@lemmy.ml 1 points 1 hour ago

I think Bark from Suno is quite good : https://github.com/suno-ai/bark

[–] sp3ctre@feddit.org 4 points 12 hours ago

F5-TTS. Only needs 15 seconds of reference audio and you're good to go.

[–] Guenther_Amanita@slrpnk.net 8 points 15 hours ago (1 children)
[–] Trent@lemmy.ml 6 points 16 hours ago (1 children)

I use piper TTS. Probably not as good as the fancy AI APIs, but it's all local and runs from command line and is good enough for my purposes. YMMV.

[–] Neptr@lemmy.blahaj.zone 2 points 14 hours ago (1 children)
[–] Tundra@lemmy.ml 1 points 13 hours ago (1 children)

I was disappointed with this at first, until I loaded the "Cori" voiceset. It outshines the others

[–] Neptr@lemmy.blahaj.zone 1 points 12 hours ago

The ones I liked the most was Kusal and Lessac.

[–] Xanza@lemm.ee 2 points 13 hours ago (1 children)

Depends on your setup, but generally I recommend: https://github.com/SYSTRAN/faster-whisper

If you have an available GPU for processing it's insanely quick and better than OpenAI's whisper.

[–] octochamp@lemmy.ml 7 points 10 hours ago

this is speech-to-text! OP is looking for text-to-speech.

[–] BuboScandiacus@mander.xyz 2 points 15 hours ago
[–] ililiililiililiilili@lemm.ee 1 points 14 hours ago
[–] spikesforeyes@lemmy.ml 2 points 17 hours ago

There’s zonos, and I heard of another one called GPTsovit or something like that, but I haven’t tried that one. Zonos is pretty easy to setup and run though. Another one is Kokoro, search for Kokoro TTS to find it on google.