this post was submitted on 13 Aug 2025
52 points (100.0% liked)

Linux

12577 readers
42 users here now

Welcome to c/linux!

Welcome to our thriving Linux community! Whether you're a seasoned Linux enthusiast or just starting your journey, we're excited to have you here. Explore, learn, and collaborate with like-minded individuals who share a passion for open-source software and the endless possibilities it offers. Together, let's dive into the world of Linux and embrace the power of freedom, customization, and innovation. Enjoy your stay and feel free to join the vibrant discussions that await you!

Rules:

  1. Stay on topic: Posts and discussions should be related to Linux, open source software, and related technologies.

  2. Be respectful: Treat fellow community members with respect and courtesy.

  3. Quality over quantity: Share informative and thought-provoking content.

  4. No spam or self-promotion: Avoid excessive self-promotion or spamming.

  5. No NSFW adult content

  6. Follow general lemmy guidelines.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] infjarchninja@lemmy.ml 5 points 1 day ago (4 children)

I have used Open-whisper and Fast-whisper to do subtitles.

Open whisper is easy to set up and install locally. I tried various models.

Recently I tried to do the French series En Therapie (In Therapy) which has 35 short episodes.

https://www.arte.tv/fr/videos/RC-020578/en-therapie/

Each episode is only 20 minutes long, so I thought that open whisper would be great to translate from French to English.

However. It failed dismally. Constant, regurgitation of repeated sentences. Throughout entire episodes open whisper used "him" instead of "her" and many other instances of misspelling. It would fail if there was music playing in the background.

I extracted the audio from the videos into small .wav format and .mp3 format but both failed.

I spent over a week trying to create suitable subtitles to no avail.

[–] MysteriousSophon21@lemmy.world 2 points 1 day ago (1 children)

Whisper struggles with non-english languages and background noise - you might get better results using the larger models (medium/large) with a lower temperature setting to reduce the hallucinations and repetitions your experiencing.

[–] infjarchninja@lemmy.ml 1 points 23 hours ago

hey MysteriousSophon21

I did use the larger and medium models with Open whisper and Fast whisper.

I did not consider the lower termperature settings

Thank you

load more comments (2 replies)