this post was submitted on 25 Feb 2025
42 points (93.8% liked)

Open Source

33367 readers
99 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
 

I would be plus if it has a simple CLI or GUI.

top 15 comments
sorted by: hot top controversial new old
[–] sonalder@lemmy.ml 3 points 5 hours ago

I think Bark from Suno is quite good : https://github.com/suno-ai/bark

[–] sp3ctre@feddit.org 4 points 16 hours ago

F5-TTS. Only needs 15 seconds of reference audio and you're good to go.

[–] Guenther_Amanita@slrpnk.net 8 points 19 hours ago (1 children)
[–] Trent@lemmy.ml 6 points 20 hours ago (1 children)

I use piper TTS. Probably not as good as the fancy AI APIs, but it's all local and runs from command line and is good enough for my purposes. YMMV.

[–] Neptr@lemmy.blahaj.zone 2 points 18 hours ago (1 children)
[–] Tundra@lemmy.ml 1 points 17 hours ago (1 children)

I was disappointed with this at first, until I loaded the "Cori" voiceset. It outshines the others

[–] Neptr@lemmy.blahaj.zone 1 points 16 hours ago

The ones I liked the most was Kusal and Lessac.

[–] Xanza@lemm.ee 2 points 17 hours ago (1 children)

Depends on your setup, but generally I recommend: https://github.com/SYSTRAN/faster-whisper

If you have an available GPU for processing it's insanely quick and better than OpenAI's whisper.

[–] octochamp@lemmy.ml 8 points 14 hours ago

this is speech-to-text! OP is looking for text-to-speech.

[–] BuboScandiacus@mander.xyz 2 points 19 hours ago
[–] ililiililiililiilili@lemm.ee 1 points 17 hours ago
[–] spikesforeyes@lemmy.ml 2 points 21 hours ago

There’s zonos, and I heard of another one called GPTsovit or something like that, but I haven’t tried that one. Zonos is pretty easy to setup and run though. Another one is Kokoro, search for Kokoro TTS to find it on google.