StableDiffusion

I love the new CogVideoX-5b model and think it's great that we finally have a strong competitor in the open-source space, rivaling Kling, Runway, and others. However, I believe the community's demand for an image-to-video (img2vid) feature is evident.

Fine-tuned image-to-video model of curent text-to-video model existing but not released

After doing some research on GitHub, I found that the authors have stated they have no plans to open-source their current Image-to-Video model, which I find disappointing. I hope they reconsider in the future.

I believe that the first person or team to fine-tune the current model to handle image-to-video (which I know is no small task) and open-source it will gain a lot while also becoming a community legend. Alternatively, if someone develops a software solution, similar to inpainting I guess, that allows setting the first latent image, they would also be eligible for that recognition.

Keeping my fingers crossed for any of the above.

Links:

Authors response to Image To Video request in their github

kijai mention it as a reply in his ComfyUI-wrapper node

718

1

Gradio sends IP address telemetry by default (old.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/campingtroll on 2024-09-01 09:52:55+00:00.

Apologies for long post ahead of time, but its all info I feel is important to be aware is likely happening on your PC right now.

I understand that telemetry can be necessary for developers to improve their apps, but I find this be be pretty unacceptable when location information is sent without clear communication.. and you might want to consider opting out of telemetry if you value your privacy, or are making personal AI nsfw things for example and don't want it tied to you personally, sued by some celebrity in the future.

I didn't know this until yetererday, but Gradio sends your actual IP address by default. You can put that code link from their repo in chatgpt 4o if you like. Gradio telemetry is on by default unless you opt out. Search for ip_address.

So if you are using gradio-based apps it's sending out your actual IP. I'm still trying to figure out if "Context.ip_address" they use bypasses vpn but I doubt it, it just looks like public IP is sent.

Luckily they have the the decency to filter out "str" and "dict" and set it to None, which could send sensitive info like prompts or other info when using kwargs, but there is nothing stopping someone from just modifying and it and redirecting telemetry with a custom gradio.

It's already has been done and tested. I was talking to a person on discord. and he tested this with me yesterday.

I used a junk laptop of course, I pasted in some modified telemetry code and he was able to recreate what I had generated by inferring things from the telemetry info that was sent that was redirected (but it wasn't exactly what I made) but it was still disturbing and too much info imo. I think he is security researcher but unsure, I've been talking to him for a while now, he has basically kling running locally via comfyui... so that was impressive to see. But anyways, He said he had opened an issue but gradio has a ton of requirements for security issues he submitted and didn't have time.

I'm all for helping developers with some telemetry info here and there, but not if it exposes your IP and exact location...

With that being said, this gradio telemetry code is fairly hard for me to decipher in analytics.py and chatgpt doesn't have context of other the outside files (I am about to switch to that new cursor ai app everyone raving about) but in general imo without knowing the inner working of gradio and following the imports I'm unsure what it sends, but it definitely sends your IP. it looks like some data sent is about regarding gradio blocks (not ai model blocks) but gradio html stuff, but also a bunch of other things about the model you are using, but all of that can be easily be modified using kwargs and then redirected if the custom gradio is modified or requirements.txt adjusted.

The ip address telemetry code should not be there imo, to at least make it more difficult to do this. I am not sure how a guy on discord could somehow just infer things that I am doing from only telemetry, because he knew what model I was using? and knew the difference in blocks I suppose. I believe he mentioned weight and bias differences.

OPTING OUT: To opt out of telemetry on windows can be more difficult as every app that uses a venv is it's own little virtual environment, but in linux or linux mint its more universal. But if you add this to activate.bat in /venv/scripts/activate on your ai app in windows you should be good besides windows andtelemetry, add this to any activate.bat and your main python PATH environment also just to be sure:

export GRADIO_ANALYTICS_ENABLED="False"

export HF_HUB_OFFLINE=1

export TRANSFORMERS_OFFLINE=1

export DISABLE_TELEMETRY=1

export DO_NOT_TRACK=1

export HF_HUB_DISABLE_IMPLICIT_TOKEN=1

This opts out of both gradio and huggingface telemetry, huggingface sends quite a bit if info also without you really knowing and even send out some info on what you have trained on, check hub.py and hf_api.py with chatgpt for confirmation, this is if diffusers being used or imported.

So the cogvideox you just installed and that you had to pip install diffusers is likely sending telemetry right now. Hopefully you add opt out code on the right line though, as even as being what I would consider failry deep into this AI stuff I am still unsure if I added it to right spots, and chatgpt contradicts itself when I ask.

But yes I had put this all in the activate.bat on the Windows PC and Im still not completely sure, and Nobody's going to tell us exactly how to do it so we have to figure it out ourselves.

I hate to keep this post going.. sorry guys, apologies again, but feels this info important: The only reason I confirmed gradio was sending out telemetry here is the guy I talked to had me install portmaster (guthub) and I saw the outgoing connections popping up to "amazonaws.com" which is what gradio telemetry uses if you check that code, and also is used many things so I didn't know, Windows firewall doesn't have this ability to realtime monitor like these apps.

I would recommend running something like portmaster from github or wfn firewall (buggy use 2.6 on win11) from guthub to monitor your incoming and outgoing traffic or even wireshark to analyze packets if you really want i get into it.

I am identity theft victim and have been scammed in the past so am very cautious as you can see... and see customers of mine get hacked all the time.

These apps have popups to allow you to block the traffic on the incoming and outgoing ports in realtime and gives ore control. It sort of reminds me of the old school days of zonealarm app in a way.

Linux OPT out: Linux Mint user that want to opt out can add the code to the .bashrc file but tbh still unsure if its working... I don't see any popups now though.

Ok last thing I promise! Lol.

To me I feel this is AI stuff sort of a hi-res extension of your mind in a way, just like a phone is (but phone is low bandwidth connection to your mind is very slow speed of course) its a private spaces and not far off from your mind, so I want to keep the worms out that are trying to sell me stuff, track me, fingerprint browser, sell me things, make me think I shouldn't care about this while they keep tracking me.

There is always the risk of scammers modifying legitimate code like the example here but it should not be made easier with io address code (btw that guy I talk to is not a scammer.

Tldr; it should not be so difficult to opt out of ai related telemetry imo, and your personal ip address should never be sent. Hope this is useful to someone.

719

1

LORA Block weight for Flux (inspire pack in comfyui) : face (old.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Excellent-Attempt-40 on 2024-09-01 09:22:51+00:00.

Hello ! I am just a regular user without any knowledge of what's going on under the hood, but I wanted to share something about Lora Block weight : I think I found the face layer.

Since this topic came up, I was thinking about this node pack and I wanted to run some tests. First, I checked the Lora was actually making the face correctly before testing. Then, I loaded the node and used the first presets. I generated one picture with these presets : DBL-ALL ; DB-FRONT-7 ; DBL MID-6 ; DBL-TAIL-6. I both used fixed seed and random seed in the block Lora weight and got the same results.

Here are the tests made with a Gigachad Lora but I tried others Loras with same results. I won't put every pictures I made, but you can try by yourself and maybe discover other things :)

Without the last block, NO Gigachad

1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1 : Gigachad is here

About the single

Without DB but with ALL the single : Gigachad is back

I tried the other singles preset but I don't get his face anymore, so this is the moment where I can't explain what's happening and I will need to do... more tests (removing one block at a time), will update when it will be done :)

So, what's the point? It needs some more tests, but it could be interesting when you want to combine Loras (example a style and a character), or if the Lora was trained on specific clothes you don't want to get (example : a dataset with only bikinis).

TLDR : use 1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1 on this node pack in comfyui and you will get the face trained on your Lora. Need more tests for the single part and I don't know yet if it can be useful.

I think there is more to find with this, so if you try it and get something interesting, please share it here !

Thanks for reading

720

1

Flux is a whole new level bruh 🤯 (i.redd.it)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/theAstroBruh on 2024-09-01 09:20:26+00:00.

721

1

Flux finetunes of my cat(last 3 images real) (www.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Puzzleheaded_Cow2257 on 2024-09-01 06:58:38+00:00.

722

1

Latest version of ComfyUI now includes native support for upgraded Union Controlnet Pro (pose/depth/canny/tile/+others) - workflow included (i.redd.it)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/eesahe on 2024-09-01 08:46:40+00:00.

723

1

This week in r/StableDiffusion - all the major developments in a nutshell (old.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-01 08:41:29+00:00.

⚓ FLUX UPDATES: Various improvements and insights for FLUX model usage shared:
- Joy Caption tool updated with batching support and optimizations
- New insights on FLUX's semantic understanding and training techniques
- Techniques for generating more realistic images using FLUX
⚓ California AI Image Ban: Proposed bill AB 3211 could significantly impact AI image generation:
- Requires robust watermarking technology for AI-generated images
- Potential to effectively ban most existing AI image generation tools in California
- Supported by major tech companies, raising concerns about regulatory capture
⚓ Juggernaut XI: Enhanced SDXL model released with improved features:
- Better prompt adherence and expanded dataset
- Enhanced style control options
- Now available for public use
⚓ FLUX.1 ai-toolkit UI: New Gradio interface for easier FLUX usage:
- Drag and drop image functionality
- AI caption generation option
- No code/yaml required
⚓ CogVideoX-5B: New open-source text-to-video model released:
- Generates 6-second, 720x480 videos at 8 FPS
- Handles complex prompts up to 226 tokens
- Optimized for consumer GPUs
⚓ Melyn's 3D Render: New SDXL LoRA model for 3D-style renders:
- Trained on creator's personal 3D artwork
- Compatible with SDXL
- Future FLUX Dev version planned
⚓ FluxForge v0.1: Tool for searching Flux LoRAs updated:
- Searches Civitai and Hugging Face repositories
- Updates every 2 hours
- Plans to add platform filtering
⚓ Regional Prompt Support: New Photoshop extension for ComfyUI integration:
- Custom nodes for Photoshop integration
- Text layer support for regional prompting
- Compatible with dense diffusion and ComfyUI's masked condition
⚓ GenWarp: AI model for generating new viewpoints from a single image:
- Works on both in-domain and out-of-domain images
- Uses diffusion model to learn geometric relationships
- Can be used for 3D reconstructions
⚓ Flux Latent Detailer Workflow: Experimental ComfyUI workflow shared:
- Enhances fine details using latent interpolation
- Option to vary images while maintaining quality
- Uses FLUX dev version and specific safetensors
⚓ FLUX LoRA Showcase: Various new LoRAs highlighted:
- Convenience Store CCTV style
- Moody Photography style
- PHLUX (Extreme Realism)
- PS1/PS2 style
- TTRPG Maps
- Naoki Urasawa Manga Style

Click here to read the full newsletter with proper formatting, links, visuals, etc.

Want updates emailed to you weekly? Subscribe.

724

1

Loving Vincent - Flux Lora (www.reddit.com)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/dal_mac on 2024-09-01 03:22:00+00:00.

725

1

Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model (i.redd.it)

submitted 1 month ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/NunyaBuzor on 2024-08-31 23:10:30+00:00.