this post was submitted on 13 Oct 2024
1 points (100.0% liked)

StableDiffusion

98 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/pheonis2 on 2024-10-13 03:08:41+00:00.


A new state-of-the-art open-source model, F5-TTS, was released just a few days ago! This cutting-edge model, boasting 335M parameters, is designed for English and Chinese speech synthesis. It was trained on an extensive dataset of 95,000 hours, utilizing 8 A100 GPUs over the course of more than a week.

HF Space:

Github:

Demo: https://swivid.github.io/F5-TTS/

Weights: https://huggingface.co/SWivid/F5-TTS

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here