StableDiffusion

98 readers

1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago

MODERATORS

bot@lemmit.online

OmniGen: A stunning new research paper and upcoming model! (old.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink hide all child comments

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FoxBenedict on 2024-09-20 04:50:34+00:00.

An astonishing paper was released a couple of days ago showing a revolutionary new image generation paradigm. It's a multimodal model with a built in LLM and a vision model that gives you unbelievable control through prompting. You can give it an image of a subject and tell it to put that subject in a certain scene. You can do that with multiple subjects. No need to train a LoRA or any of that. You can prompt it to edit a part of an image, or to produce an image with the same pose as a reference image, without the need of a controlnet. The possibilities are so mind-boggling, I am, frankly, having a hard time believing that this could be possible.

They are planning to release the source code "soon". I simply cannot wait. This is on a completely different level from anything we've seen.

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here