StableDiffusion

98 readers

1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago

MODERATORS

bot@lemmit.online

This week in Stable Diffusion - all the major developments in a nutshell (old.reddit.com)

submitted 2 weeks ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink hide all child comments

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-10-01 09:22:59+00:00.

Interesting find of the week: Kat, an engineer who built a tool to visualize time-based media with gestures.
Flux updates:
- Outpainting: ControlNet Outpainting using FLUX.1 Dev in ComfyUI demonstrated, with workflows provided for implementation.
- Fine-tuning: Flux fine-tuning can now be performed with 10GB of VRAM, making it more accessible to users with mid-range GPUs.
- Quantized model: Flux-Dev-Q5_1.gguf quantized model significantly improves performance on GPUs with 12GB VRAM, such as the NVIDIA RTX 3060.
- New Controlnet models: New depth, upscaler, and surface normals models released for image enhancement in Flux.
- CLIP and Long-CLIP models: Fine-tuned versions of CLIP-L and Long-CLIP models now fully integrated with the HuggingFace Diffusers pipeline.
James Cameron joins Stability.AI: Renowned filmmaker James Cameron has joined Stability AI's Board of Directors, bringing his expertise in merging cutting-edge technology with storytelling to the AI company.
Put This On Your Radar:
- MIMO: Controllable character video synthesis model for creating realistic character videos with controllable attributes.
- Google's Zero-Shot Voice Cloning: New technique that can clone voices using just a few seconds of audio sample.
- Leonardo AI's Image Upscaling Tool: New high-definition image enlargement feature rivaling existing tools like Magnific.
- PortraitGen: AI portrait video editing tool enabling multi-modal portrait editing, including text-based and image-based effects.
- FaceFusion 3.0.0: Advanced face swapping and editing tool with new features like "Pixel Boost" and face editor.
- CogVideoX-I2V Workflow Update: Improved image-to-video generation in ComfyUI with better output quality and efficiency.
- Ctrl-X: New tool for image generation with structure and appearance control, without requiring additional training or guidance.
- Invoke AI 5.0: Major update to open-source image generation tool with new features like Control Canvas and Flux model support.
- JoyCaption: Free and open uncensored vision-language model (Alpha One Release) for training diffusion models.
- ComfyUI-Roboflow: Custom node for image analysis in ComfyUI, integrating Roboflow's capabilities.
- Tiled Diffusion with ControlNet Upscaling: Workflow for generating high-resolution images with fine control over details in ComfyUI.
- 2VEdit: Video editing tool that transforms entire videos by editing just the first frame.
- Flux LoRA showcase: New FLUX LoRA models including Simple Vector Flux, How2Draw, Coloring Book, Amateur Photography v5, Retro Comic Book, and RealFlux 1.0b.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here