StableDiffusion

98 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
501
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/PixarCEO on 2024-09-14 08:19:27+00:00.

502
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-14 02:04:54+00:00.

503
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DiienOfficial on 2024-09-14 01:42:36+00:00.

504
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-13 19:27:53+00:00.

505
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/xbcm1037 on 2024-09-13 18:41:37+00:00.

506
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ectoblob on 2024-09-13 18:52:05+00:00.

507
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/vmandic on 2024-09-13 17:02:38+00:00.


SD.Next Release 2024-09-13

Just under two weeks since last SD.Next release, here's another update!

Highlights

Major refactor of FLUX.1 support:

  • Full ControlNet support, better LoRA support, full prompt attention implementation
  • Faster execution, more flexible loading, additional quantization options, and more...
  • Added image-to-image, inpaint, outpaint, hires modes
  • Added workflow where FLUX can be used as refiner for other models
  • Since both Optimum-Quanto and BitsAndBytes libraries are limited in their platform support matrix, try enabling NNCF for quantization/compression on-the-fly!

Few image related goodies...

  • Context-aware resize that allows for img2img/inpaint even at massively different aspect ratios without distortions!
  • LUT Color grading apply professional color grading to your images using industry-standard .cube LUTs!
  • Auto HDR image create for SD and SDXL with both 16ch true-HDR and 8-ch HDR-effect images ;)

Few video related goodies...

  • CogVideoX 2b and 5b variants with support for text-to-video and video-to-video!
  • AnimateDiff prompt travel and long context windows! create video which travels between different prompts and at long video lengths!

And few other updates...

  • Built-in prompt-enhancer, TAESD optimizations, new DC-Solver scheduler, global XYZ grid management, etc.
  • Updates to ZLUDA, IPEX, OpenVINO...

Plus tons of other items and fixes!

For more details see: Changelog | ReadMe | Wiki | Discord

508
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-13 17:40:01+00:00.

509
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/blackmixture on 2024-09-13 12:42:48+00:00.

510
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/redd9it on 2024-09-13 09:36:37+00:00.


I was struggling to find a tool that could handle multiple image cropping, caption file generation, and zip downloads all in one go. Most tools I came across either didn’t support these features together or were cluttered with ads that made the experience frustrating.

The result is BatchCropper—a free, ad-free tool designed to streamline your workflow and make preparing FLUX LoRA training datasets as easy as possible.

I used cursor with v0 dev to generate the complete website in a few hours. Give it a try and let me know how it goes.

Link to the tool: batchcropper.com

511
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/kopasz7 on 2024-09-13 15:04:19+00:00.

512
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-09-13 15:00:59+00:00.

Original Title: Tried Expressions with FLUX LoRA training with my new training dataset (includes expressions and used 256 images (image 19) as experiment) - even learnt body shape perfectly - prompts, workflow and more information at the oldest comment

513
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/TerryCrewsHasacrew on 2024-09-13 12:56:44+00:00.

514
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZALIA_BALTA on 2024-09-13 12:03:51+00:00.

515
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-09-13 11:23:29+00:00.

516
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-13 09:22:22+00:00.


  • Open-source of Qwen2-VL (VLM) coming soon (GITHUB) via NielsRogge on X
  • FineVideo: 66M words across 43K videos spanning 3.4K hours - CC-BY licensed video understanding dataset. It enables advanced video understanding, focusing on mood analysis, storytelling, and media editing in multimodal settings (HUGGING FACE)
  • Fluxgym Update: automatically generates sample images during training; use ANY resolution, not just 512 or 1024 (for example 712, etc.) via cocktailpeanut on X (creator)
  • Fish Speech 1.4: text to speech model trained on 700K hours of speech, multilingual (8 languages); voice cloning; low latency; ~1GB model weights (OPEN WEIGHTS) (HUGGING FACE SPACES)
  • Out of Focus v1.0: uses diffusion inversion for prompt-based image manipulation using Gradio UI, requires a high-end GPU for optimal performance (GITHUB)
  • Google NotebookLM launches "Audio Overview" feature: can turn any document into a podcast conversation. Once you upload the document and hit the generate button, two AI moderators will kick off a conversation-like discussion, diving deep into the main takeaways from the document (LINK)
  • Video Model is coming to Adobe Firefly via icreatelife on X
  • Midjourney is pioneering a new 3D exploration format for images, led by Alex Evans, innovator behind Dreams' graphics via MartinNebelong on X
  • FBRC & AWS present Culver Cup GenAI film competition at LA Tech Week via me :) on X
  • Coming soon: Vchitect 2.0 - A new text-to-video and Image-to-video model.
  • UVR5 UI: Ultimate Vocal Remover with Gradio UI (GITHUB)
  • Vidu AI Update: new "Reference to Video" feature, you can now apply consistency to anything—whether real or fictional (LINK)
  • Vchitect 2.0: new image2video/text2video model soon (LINK)
  • and slightly unrelated, but special mention: 🍓!

Wednesday's updates - link

Last week's updates - link

517
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hackerzcity on 2024-09-13 00:40:59+00:00.


Now you Can Create a Own LoRAs using FluxGym that is very easy to install you can do it by one click installation and manually

This step-by-step guide covers installation, configuration, and training your own LoRA models with ease. Learn to generate and fine-tune images with advanced prompts, perfect for personal or professional use in ComfyUI. Create your own AI-powered artwork today!

You just have to follow Step to create Own LoRs so best of Luck

518
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DiienOfficial on 2024-09-13 06:43:10+00:00.

519
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/deadlyorobot on 2024-09-13 04:51:32+00:00.

520
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hudsonreaders on 2024-09-13 03:23:38+00:00.

521
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/theroom_ai on 2024-09-12 18:11:57+00:00.

522
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ChristinaTreasure on 2024-09-12 16:32:54+00:00.

523
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/phr00t_ on 2024-09-12 23:57:28+00:00.


A commit yesterday to the CogVideo repo added Image2Video support!

Merge pull request #272 from THUDM/CogVideoX_dev · THUDM/CogVideo@87ad61b (github.com)

I added a feature request on the ComfyUI wrapper:

Image2Video Support (CogVideo recent update) · Issue #54 · kijai/ComfyUI-CogVideoXWrapper (github.com)

EDIT: This isn't Image2Video yet, it is work towards supporting Image2Video. The developer said it will be released within the month:

hope for image to video · Issue #270 · THUDM/CogVideo (github.com)

524
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-12 21:40:22+00:00.

525
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Z3ROCOOL22 on 2024-09-12 21:18:04+00:00.


view more: ‹ prev next ›