StableDiffusion

98 readers

1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago

MODERATORS

bot@lemmit.online

New State-of-the-Art TTS Model Released: F5-TTS (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink hide all child comments

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/pheonis2 on 2024-10-13 03:08:41+00:00.

A new state-of-the-art open-source model, F5-TTS, was released just a few days ago! This cutting-edge model, boasting 335M parameters, is designed for English and Chinese speech synthesis. It was trained on an extensive dataset of 95,000 hours, utilizing 8 A100 GPUs over the course of more than a week.

HF Space:

Github:

Demo: https://swivid.github.io/F5-TTS/

Weights: https://huggingface.co/SWivid/F5-TTS

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here