This lora is meant to be used as a helper for Wan's under-trained genitals. It was trained with the T2V model and diffusion-pipe on a dataset of 50 images and 30 videos with the faces cropped out for 20k steps, it's quite over trained but works well with I2V. You may need to reduce strength to 0.6-0.8 for best results.
The dataset uses 2 models, one with an innie pussy (small labia) and one with an outie (large labia), for both consistency and variety.
The Lora was trained on nude poses in the following positions:
Standing front and rear.
Sitting/Lying with legs spread open.
Bending over or on all fours from behind.
So it should help with genitals in any of those positions. The trigger words are Innie vagina or Outie vagina, depending on your preference.
I've planned this lora for quite a while with a very carefully selected dataset, and I'm excited to release it here to the community. Please upload your creations and tag this model so they are featured in the gallery! I'll buzz and feature the best gens.
For other trainers interested in the details, I set up this lora so that it would train on high res images and low res videos at the same time, the images for the detail and the videos for the movement. This seems like an effective technique for Wan as it's good with motion so the images don't degrade the lora much and provide great detail where needed. Using low res videos also greatly speeds up the training. Learning rate was 2e-5.
Description
Initial version wan T2V/I2V
FAQ
Comments (88)
Oh, this looks really useful. Thanks for this, I'll check it out and post some results later.
This looks great, I think another pain point is genital interactions like pulling out
Thanks! Yeah that is true, like it doesn't know where the end of the penis is, though I've seen more penis/sex focused loras than vagina ones so that's why I wanted to train this one. It would probably be harder to train but maybe a similar technique could help for hardcore/penetration vids.
Thank you! I will definitely consider it, I have already been asked a few times. I don't use 1.3b much personally but with Vace I can see the value. Plus I've done all the hard work so it would just be the compute time. Give me a week or 2 minimum!
@definitelynotadog Thank you very much for your feedback. I'm very happy with this news!! We will wait anxiously! The 1.3B WAN is great and very fast for generating videos on modest computers, but it needs great quality loras like yours to generate nsfw content. I'm very happy to see producers taking an interest in making loras for version 1.3b!
hands down the best main preview video ever. The move from lying down to right in the cooter is hilarious
Haha yep as soon as it gen'd I knew it would be the main video 😂
This lora gives me much better and consistent results than the other wan pussy lora, which gives me weird body horror results.
Hi ! Nice work according to the examples, I can't wait to test it. However, I don't understand the difference between innie pussy and outie pussy.
It has less effect with I2V than T2V, I guess because the input image has an influence over prompt, but it's basically small labia or large labia to match the 2 types my dataset was trained on. Just use innie or outie as trigger words.
@definitelynotadog Thanks for your reply.
Do you train locally on your computer or in the cloud?
If locally, what GPU do you have?
What are you using for training:
musubi-tuner-gui:
https://github.com/bmaltais/musubi-tuner-gui
musubi-tuner-wan-gui:
https://github.com/Kvento/musubi-tuner-wan-gui
ai-toolkit:
https://github.com/ostris/ai-toolkit
Could you share some example prompts you used for your video samples?
By the way, I'm a bit surprised that almost no creators include their prompts.
The full workflow including prompts should be included in the metadata if you download the videos. It was trained using diffusion-pipe on a 5090 on the cloud.
Thanks for reply, How to read this metadata ? best way ?
@CyberAImania Drag and drop the mp4 into comfyui.
Holy shit this looks amazing, i have no idea how you did it, but i i am you did
Thanks for this and the training deets. How low res is "low res"? Like 250ish? And how many images/vids?
Yeah about 256px. Images were 1280px. 50 images/30 vids.
Nice. 250ish seems like a sweet spot for efficient movement training. 1280 is high! Maybe makes sense though since the part of the image you care about for this lora is pretty small. I've been doing 720x720 for images, maybe I should try higher.
@logenninefingers888 Yeah the images train really fast even at that res, I think it's worth doing. If you use professionally shot images you get way more detail than even a 1080p video frame.
I need to ask, when you train multiple videos, how do you use that for the training? Raw video? Or extract the frames, train for X amount of epochs, Swap dataset for the next video's frames, resume training? Or how is the dataset setup for you for doing multiple videos? As i got an idea for a video model, and will require 10's of few second clips.
A dataset is a folder with all the videos you want to train. In my case this was 30 mp4's each 5 seconds long. The lora trainer then caches those videos into latents which it trains on. An epoch represents 1 training pass of your whole dataset. You don't need to swap anything or pause training, If you have a specific concept you want to train 10-20 videos is usually fine. I trained on the same images and videos for 219 epochs.
@definitelynotadog Gotcha. So what if i'm going to train long fights that can be several minutes long, then i guess it's best to split those up to as long as i have video memory to cache? And do you extract frames? Or raw mp4?
@Therma Yeah you have to split them up, each video needs to be about 5 seconds long otherwise you would need huge amounts of VRAM. Use some editing software to extract the clips you want as mp4s. The number of videos doesn't increase VRAM usage, just the length and resolution of each individual video.
@definitelynotadog Gotcha. And if the video clip i want to train is longer than 5 seconds, just repeat the other text file for the split half for it's own text file? Or train as if it's a wan prompt using punctuation to end one sequence before starting the next?
@Therma Try and split up longer videos into multiple, each with their own caption file. Usually you set in your config how many frames of each video to use, so you would set it to 81 frames for example to use the first 81 frames of each video in your dataset, if you had longer videos it would ignore anything after that. Wan works best at 5sec/81 frames so I don't think there's much point training longer. If you're willing to rent out a H100 with 96GB of VRAM you could try longer videos but you probably don't want to do that for your first lora!
@definitelynotadog Gotcha. So i if i for instance used ffmpeg to export all frames as png's, and named the different videos as "frame000, frames000, framer000" as an example which would allow me to easily cache and process 1 frame at a time, would that work? As i'm 1 epoch away from finishing 25 epocks of 1730 frames from 3 clips, some over a minute each. would that then not work? And wan's requirement is "absolute", whether it's extracted as frames or as a video, each "segment" can't be longer than 5 seconds?
If so, if there's a motion i wish to train being longer than 5 sec, how would i train that? Just repeat the same dataset text file for both of the half split videos? Or are my wan loras going to be limited to only get to use 5 sec max clips regardless if split or not?
Just so i understand the core limitations/restrictions the wan training requires and works with.
@definitelynotadog Ohh, so that's what the frame numbers are for in the config. It's being told how many frames each video is? And how can it tell which is which? Cause if 2 clips are equally long, say 120 seconds, won't that mix up with other clips also being 120 frames long and mess up the sequence of split clips?
And iirc i read that it's faster/easier on the memory to make each frame into png's instead? Or is video files best for consistency and full-er context?
@Therma It's telling the trainer how many frames to use from each video. I don't think exporting as pngs will work, I use mp4s encoded with H264. Wan works at 16fps, so 81 frames = 5 seconds. So for example using diffusion-pipe trainer, if you had a video 10 seconds long @ 24fps in your dataset, it would first convert the video to 16fps then extract the first 81 frames from it as a latent, discarding the rest of the video. So yes if you want to train a 10 second video, you would split it into two 5 second videos and caption each one separately. Note frames to extract must satisfy the equation n * 4 + 1, hence 81 in this case.
@definitelynotadog Ah ok. Gotcha. I shall experiment with multi second videos by splitting them at half each +1 on each (same with clips longer than 10 sec, split to 81+1 frame clips, and then try first with 1 text file duplicated for each split segment describing the same for all for that split clip, and a second training where i try to describe what happens in clip 1 part 1, then a new with what happens in part 2, and so on.
So now that clip assembly will be tested, for the dataset text files, did you write the text for each clip as if describing everything that happens down to the smallest detail if i'd want it to be as dynamic as possible? Or simply just state
"golden labrador, runs towards a skateboard and jumps on it, street at sunrise"?
Instead of a larger description like this.
"widescreen shot of a golden labrador standing in front of a skateboard. The golden rabrador starts to run towards the skateboard, and jumps gracefully on top. It achievines a perfect landing, and camera follows the dog as the labrador rolls down the street."
thanks for elaborating on your training process @definitelynotadog
definitelynotadog Long random realization from my side i think, so if i want to train something that might be 20 seconds long, all i need to do is either to extract each frame as png for the entire dataset, or split into 1-5 seconds videos if doing video files as far as my vram goes, and let it rip?
The men are complaining! We also have genitals :-)
Have you not seen my other wan lora? 😁
@definitelynotadog Ah... you mean this one here?
https://civitai.com/models/1368557/masturbation-cumshot-for-wan-i2v-480p
@CyclopsGER Yes! But you make a good point I didn't specify male/female. I just thought there was enough penis related loras out there already and it probably wouldn't work as well to mix the two into one lora.
does this make genitals better for undressing loras ? cheers
how good does it work with I2V?
I think it works well, the samples I posted are all I2V
I've found a couple of things that help, firstly you want to downscale your image to your wan resolution using a lanczos downscaler, I use KJNodes Resize Image. Then you want to make sure teacache is not set too aggressively, I set mine to 0.2 strength starting at 20%. But the more you try and speed up Wan the more the consistency will suffer in my experience.
To add to this because I just did a test, using a higher CFG also degrades input image, CFG 3 was much better quality than CFG 5
@definitelynotadog Thanks. I wil try.
Anyone have good example prompts for this? Seems like none of the posts include prompt info.
确实下载了,没效果,没有提示词案列
I've added the prompts to all my sample videos, the comfyui workflow was attached but now you can quickly look at the prompt in civitai.
I hope the next version will be optimized and the faces will be cropped when training the pictures. Cuzafter I added this lora, the faces will change and the consistency of the characters will decrease.
The faces were cropped out of all the pictures and videos in my dataset. It didn't change consistency of the faces when I tested it and it seems fine looking at the videos people have posted so far.
@definitelynotadog The face of a person will change. Even if it is made with the last frame, the face will become more and more different. And if there is a light or an obstruction in front of the face, such as a microphone, the face of the finished product will change during the first production. It will not become ugly, but it will not look like the original character.
@fctq9169 Is that I2V or T2V? it's probably over-trained T2V so maybe that could happen. But I2V should be fine and looking through the videos posted here the faces seem fine. I think there might have been 1 or 2 faces visible in my entire dataset of nearly 100, where I couldn't avoid it, so it really shouldn't be a problem with my lora. If you have any examples or a workflow where this happens send it to me and I'll take a look.
I'm not saying it's impossible but looking at everyone's posts with this lora and my own tests I've not seen the issue yet.
@fctq9169 is there any way to keep original face consistent?
@definitelynotadog I2V
Maybe it's because the character has bangs or wears glasses? Or she made a change in head pose (for example, she raised her head) so the face changed
@fctq9169 the original face always change to another specific face
I am using promts similar to your promt "She spreads open her legs revealing her outie vagina" but only like 1/5 times she actually moves the legs in a way that you see her vagina. Any idea on how to make this more consistent?
Similarly, I tested 10 images. Only one had legs spread open. But she was wearing underwear.
One image of a person sitting did not successfully spread legs. But she lifted up her pajamas and her vagina was seen.
Then I did the same thing twice. The last one even had a vagina. One of them suddenly turned around with her hips facing me. In the rest of the images, the person did not move. So I also want to know how to trigger the correct opening of legs.
Maybe. The author's description of the posture, "Sitting/Lying with legs spread open", you need to describe it separately.
1. Describing her posture with a subject
2. Legs spread open with a subject
EX: [A nude woman] lying on a bed. [she] legs spread open.
I used to use "A nude woman lying on a bed with her legs spread." didn't work because "legs spread open" has no subject.
You can try my way of saying it.
@fctq9169 Yes, I usually prompt the position the woman is currently in, followed by the pose I want her to move to. But it can be very dependent on the starting image, often that can influence the movement as much as the prompt.
I'm sorry,Sir. Maybe I'm blind or too stupid.
I tested 10 images before, but I didn't find that I had a language grammar error.
After adjustment, all of them were successful. But I have one thing to ask.
How to make the screen perspective not move? I added "The perspective of the video does not change from the beginning to the end. You can see the woman's whole body". But not every time I succeeded in not moving the POV
I noticed the camera zooms in more with the Innie vagina trigger word, so it might have confused "innie" for zoom in, so try outie vagina if you're having that problem.
@definitelynotadog
Today I tested two images. The same prompt words. The difference is Innie vagina or Outie vagina. As a result, both images have a close-up. I don't know if it's because the proportion of the person in the image is relatively large, that is, the close-up. But it is true that after the other images were changed from Innie vagina to Outie vagina, there was no problem of the camera zooming in.
Do you have anything like this in the pipeline for male models/genitals? It's severely missing and often in my scenes containing nude male models, the penis transforms into a hand or other crazy stuff! It should be all kind of penis types, big/small, erect/flaccid, etc.
Report again
"sitting with legs spread open" for the posture of having one's legs crossed/one leg over the other
or sitting sideways with only one leg showing. Or holding a pillow in her hand (she only raised her hands and stretched). There is a high chance of failure. Similarly, if it is a lying position, there will be similar situations.
Also, I would like to ask if this model can make the vagina spray or peeing
Awesome, Lora! Fingers crossed there's a breats helper too!
Great model, does exactly what it says! It's also helpful for breasts/nipples as Vanilla Wan isn't great at that. A suggestion for future training - get a diversity of breasts/nipples as a lot of them look the same!
Thanks! You're right, because it was trained on images/videos of only 2 women for consistency, the breasts and nipples are good quality but limited. It would be worth training a separate lora for breasts with more variety.
is it pubic hair or razor burn that shows up? It's like purple discolored above. Working on prompting it out now, lmk if you know how
I2V If the LoRA weight is set too high, Asian faces will turn into Western faces.
How you guys manage to keep the face consistent.Sometimes the character's face just change.What's going on?
Dayum, it looks like somebody tried to open a can of corned-beef with a firecracker.
I absolutely know what you mean.....or a cancerous growth, often in dark reddish or even pink colors. If someone knows what (setting) causes this, let us know
OP, this lora is 100% critically important. Excellent work!
Is there any consideration of a V2 at some point?
BTW: Nice choice with Maria R as the donor innie! Right up there with Mary Q.
do you know 90% of women only shave their pussies when having sex ? why didn't you include hairy ones ? shame
Because if I mix some hairy ones in nearly every gen became hairy, even when correctly captioned in the dataset, I already tried it. It would need to be a separate lora.
I'm not against doing that but this lora was time consuming and expensive to train, so another version is not high on my list right now.
Where can I find a workflow to show where this hooks in? The official guides obviously don't show it.
Ive used "LoraLoaderModelonly" for both, and hooked them into the "ModelSamplingSD3"
This shit has so much white pussy it's making black pussy white. Add some colored women to this dataset please!
lol can you share some failed experiments?
No thanks.
Does it work okay on a pc with 16GB of ram?
hey bro do you mean video memory? RAM doesnt matter much in these, its actually VRAM that comes into play. (the ammount of memory you gpu has 16gb's of vram is more than enough, depending on your workflow and precision 8-10 gb is more than enough already. (Mine has 12 and its reasonably enough for almost everything)
Is it possible to have consistency in the vagina type ? I need this for an AI model.
It's good, but I use it at 0.35 weight which is enough to fix pussies. Anything above 0.6 and up it acts like a breast enlarger.
great result, did you run the training locally? if yes, could you share some details what it takes to train such model in terms of resources and how time consuming it was? thanks for the model by the way))
Great work! Can you please make it compatible with the on-site I2V wan? For some reason it doesn't appear in "add" option - i.e. in the list of available options compatible with I2V... Many thanks in advance
@definitelynotadog needs to upload the same file again as i2v, there is no setting to otherwise add it at the moment.
Hello, the program doesn't work for you, please fix it
Details
Files
genitals_helper_v1.0_e219.safetensors
Mirrors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
i2v_t2v_genitals_helper.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
highnoise.safetensors
genitals_helper_v1.0_e219.safetensors
wan21_i2v_Innie-vagina.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
145_genitals_helper.safetensors
LoRA_GH.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
GenitalsHelper.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
fen_gen_helper.safetensors
Female_Genitals_Helper_v1.0_I2V__e219.safetensors
Female_Genitals_Helper_v1.0_T2V__e219.safetensors
genitals_helper_v1.0_e219.safetensors
升职器H.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
g-helper.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
展示yindaoH.safetensors
genitals_helper_v1.0_e219 (1).safetensors
LoRA_GH.safetensors
genitals_helper_v1.0_e219.safetensors
WAN2.2genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
Wan22_Gen_helper.safetensors
g_helper.safetensors
helper_v1.0.safetensors
fen_gen_helper.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helpe.safetensors
Wan_genitals_helper_lora.safetensors
wan_show_genitals.safetensors
pussy_helper.safetensors
genitals_helper_v1.0_e219 (3).safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
genitals_helper_v1.0_e219.safetensors
完美阴户.safetensors
genitals_helper_v1.0_e219.safetensors