this post was submitted on 12 Jan 2025
658 points (98.0% liked)
Technology
60456 readers
4098 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The technology is nowhere near being good though. On synthetic tests, on the data it was trained and tweeked on, maybe, I don't know.
I corun an event when we invite speakers from all over the world, and we tried every way to generate subtitles, all of them run on the level of YouTube autogenerated ones. It's better than nothing, but you can't rely on it really.
Really? This is the opposite of my experience with (distil-)whisper - I use it to generate subtitles for stuff like podcasts and was stunned at first by how high-quality the results are. I typically use distil-whisper/distil-large-v3, locally. Was it among the models you tried?
I unfortunately don't know the specific names of the models, I will comment additionally if I will not forget to ask people who spun up the models themselves.
The difference might be that live vs recorded stuff, I don't know.
is your goal to rely on it, or to have it as a backup?
For my purpose of having backup nearly anything will be better than nothing.
When you do live streaming there is no time for backup, it either works or not. Better than nothing, that's for sure, but also maybe marginally better than whatever we had 10 years ago
No, but I think it would be super helpful to synchronize subtitles that are not aligned to the video.
This is already trivial. Bazarr has been doing it for all my subtitles for almost a decade.
You were not able to test it yet calling it nowhere near good 🤦🏻
Like how should you know?!
Relax, they didn't write a new way of doing magic, they integrated a solution from the market.
I don't know what the new BMW car they introduce this year is capable of, but I know for a fact it can't fly.