(ALL previews are I2V. V2 previews made from lazy Chroma outputs and some others I quickly found on CivitAI and Booru images. Most previews are single-shot gens but some were selected as the best of 2~3 generated samples)
This LoRA is mostly meant to animate good sexual input images (I2V) with basic sex animations using very simple and short prompts and/or to assist other motion-specific loras.
This LoRA wants sex. It wants it so much that sometimes it gives you 'ok' animations even without a prompt. Its extremely biased in that regard so its really not good for advanced use cases such as changing characters positions or scenes.
If you give it a sexual scene input image that you can tell with just a glance 'what kind of movement the characters should be doing' - than this LoRA should be able to figure that movement on its own with just the correct trigger words.
Audio: not trained at all. It contains audio layers from other loras merged together then injected into this one.
V1: may be better at anime
V2: better at realism, less over-fitting overall and broader knowledge
Terminology: penis/pussy/fellatio/insert instead of dick/cock/vagina/blowjob/penetrate
Trigger words with notes (colored by how good the model is at it in a I2V context):
having sex (this one is powerful due to its versatility)
having sex in the cowgirl/reverse cowgirl/congress/reverse congress/mating press/missionary/doggystyle position (if only 1 of these scenes exists in the input image than you should use this instead of just 'having sex')
having sex while lifted in the air (for scenes where the female's legs are in the air and should dangle around during sex)
performing fellatio (not very good. you should probably pair it with a fellatio-exclusive lora)
performing a handjob with her right/left hand
performing a handjob with his right/left hand on his own penis (male masturbation. V2 exclusive)
masturbating with a dildo
masturbating with the fingers of her right/left hand
thrusting in and out of her pussy (helpful to complement sex and masturbation scenes)
causing a stomach bulge with each thrust (only knows a little bit of this so if you want this concept you should pair it with an actual stomach bulge lora)
spreading her pussy with both hands // the fingers of her right/left hand (not good at this either but it does have some knowledge)
pubic hair // shaved pubic hair (this is useful if you pair it in T2V mode with other loras. you can add it to the negative prompt on workflows with CFG higher than 1 to significantly reduce the chance of it appearing)
horsedong (not much but some e621 data is in my datasets and I used this word instead of 'penis' in cases where it made sense. it can help in furry animations that contain this kind of genital)
inserts into her pussy (this was used for penis, fingers, dildos and other objects)
comes out of her pussy (did not use this for penis removal but it may help with that. it was mostly for fully/half inserted big objects coming out from a pussy - to teach the model pussy physics of such animations)
Extra trigger words (mostly for negative prompts in CFG>1 workflows):
static image, no movement. (used on images during training)
the camera is slowly panning left and right // up and down // in circular motions
Downsides/Weaknesses:
Too much data to fit in a rank 128 LoRA, causing over-fitting. I believe what I tried to accomplish may be possible at rank 256 with batch_size = 16 training but that's something I can't do.
In I2V mode this lora will ignore most of your prompt if its not something in the available trigger words. It highly depends on the input image and prompt but this can be heavily mitigated by not using distillation or reducing it and using CFG>1.
Camera panning bias. It likes to move the camera around a little even when not prompted for it. Can be mitigated with negative prompting - see 'Extra trigger words' section.
Breast movement bias so strong it may cause flat chests to move when they should not to the point it may even affect males depending on the input image and your prompt. May be slightly mitigated with negative prompting + CFG>1.
Cowgirl sex scenes with only the woman moving are very difficult to generate due to the lora's 'having sex' bias which in turn means 'man thrusting a penis inside a pussy' - as far as the lora understands it. This happens even without mentioning 'having sex' in the prompt.
Trained on very, very little anal sex data so it may hurt anal animations a bit. V1 probably does this more than V2.
Cant draw good genitals from scratch (T2V).
Recommended workflow info:
Option A (RECOMMENDED): 3-stage workflow with input image proportionally resized to a max dimension of 512.
Option B: 2-stage workflow with input image proportionally resized to a max dimension of 1024.
For input images with very complex backgrounds: use Option B, otherwise stick to Option A. Doing Option B helps avoiding unwanted motion of background objects and stuff because LTX sometimes does that if it does not fully understand the input. But this lora performs the best motion in 512px buckets so avoid Option B.
For better I2V results avoid distillation or set it to just 25% + CFG=1.5 + 12~16 1st stage steps (2nd stage steps should be 75% of stage1 and 3rd stage should be 50%)
Description
Initial release
FAQ
Comments (21)
letss gooooooooo
gay?
Sorry no. Focuses 100% on females from various races and hentai in many different styles - from grayscale sketch all the way to full anime style. But the LoRA struggles immensely with drawing genitals from nothing so if you just use it as a motion assistant and you have a good starting point (image) you can still animate it properly even if both characters are male - I think.
@huj0ps1t6 ok, thanks. If you do an update though please remember us gays
@ForeverNecessary737716 you will not be left behind
you are
work perfect. Better then others. I tested 20 video
Thank you for the kind words. Still its far from perfect. There are many concepts in my datasets that the LoRA simply refuses to learn and despite the fact I changed my datasets to enhance genital learning - it still struggles to learn both pussy and penis at the same time. I barely tested it myself so I do not know how good or bad it truly is - but from the little testing that I did I can say there are so many flaws with it if I started writing about them all this comment would turn into a LTX-style prompt
But it best at the moment.
I have to agree, easiest to use and just "works". relative to the others I have tried anyways.
Very interesting, your samples look like Live2D animations, which is an interesting effect.. any plans for a V1.1?
Yes I'm planning on making a new version with major changes but it will take me about a week - all things considered. Plus, I won't release it if somehow it turns worse than this. No promises but I'll try
@huj0ps1t6 awesome, looking forward to it.
Its been 9 days and I said it would only take a week so I feel like I should update on this.
I continued training on top of the released version and came to realize it contains heavy biases and is very overfit. Not possible/ideal to even attempt to fix that so I'm training again from scratch and this time I'm using a much lower learning rate. Which means it will take much longer than v1 did.
On top of that I'm gonna experiment with trying to train single-concepts loras on top of 'lora v2 - base' (once that's done) and merge them wisely into the base lora based on which blocks are affected more by each concept. All in all this is really gonna take some time and I can't even say with confidence it will be worth it - but I'll do my best.
@huj0ps1t6 No worries, there's so much stuff on Civitai to play around, we can keep ourselves entertained in the meantime ;) Best of luck for the new training, and thanks for your hard work!
Thank you for your efforts. I would like to ask: how much training data did you use in total during the training process?
This will be a bit long.
Ballpark of 300 images though they were mostly for teaching genitals - not sex positions. Though to be honest it doesn't seem like they were of much help. The model struggles to learn both penis and pussy at the same time and it didn't help that I was also teaching 'puffy pussy' and 'spreading pussy' in the mix.
For videos it was roughly 250 T2V videos with WAN-like captions then I duplicated those into a separate dataset and simplified their captions to be just mention 'motion/what happens', ex: 'having sex in the cowgirl position'. Those with short captions were trained exclusively in I2V mode while the others exclusively in T2V mode. For this to be possible I had to add a few lines of code into 3 files of the Musubi LTX-2 branch files.
I kept downloading more videos but I was lazy with the captions so I added the extra ones just to my I2V datasets.
At the end I still had ~250 T2V and ~400 I2V videos. 121 frames 24 fps - all of them.
Trained in multiple stages, first at LR=0.00008 gradient_accumulation_steps=1 for roughly 4 to 6 thousand steps initially using only [512,512] buckets for videos and both [512,512] and [1024,1024] for images. I ALWAYS train videos on FULL frames. I don't trust whatever Musubi is doing with other 'frame_extraction' modes.
I interrupted this run halfway to manually test it and found out that when used with the upscaler -> the upscaled video would look very bad. Kinda seemed to add 'noise'. Figured it was because I was not training the videos at higher res but with my 16GPU I'm kinda limited. So what I did was create 49 frames 24 fps clips out of all of my videos that could be used for [1024,1024], [1280,720] and [720,1280] buckets. Did this to both the T2V and I2V datasets and ofc I had to re-caption many of them because suddenly some parts of the captions no longer happened in the shorter clips. Added those to my training configuration at the mentioned buckets based on their aspect ratio and once again trained on full frames. This solved the upscaling problem.
At some point, I classified my videos in 'tiers' from 1 to 4 based on how visually pleasing they are to me and how good their animation is. I added extra separate datasets for tiers 2, 3 and 4 to serve as extra repeats for those 'better-looking' videos.
Also at some point, I wrote a custom scheduler that would multiply the base LR of a sample based on either the filename or dataset directory name. I2V datasets had higher multipliers and the files of tiers above 1 also had higher multipliers as their tier increases - something soft ofc - the max a multiplier could be was '2.0'. This was kinda like 'increasing repeats' for those files but by increasing their learning rate directly.
Finally, trained further with LR=0.00005 gradient_accumulation_steps=2 for an additional ~2000 steps (where a step here is actual 2 micro-steps cause Musubi counts a 'step' only when the weights updates - which with GAS=2 is every 2 actual steps). I wanted to train it for much longer but even this took me days on my 16Gb GPU.
Important: I used '--lora_target_preset v2v' which is mentioned in the musubi ltx-2 branch to be for 'video-to-video/IC LoRA'. Good thing I ignored that and went along with my guts because I'm pretty sure for NSFW the regular 't2v' mode wouldn't cut it. I know that from experimenting - during training breaks - with weights filtering and comparing other loras that people released on this site. My overall impression was that: every LoRA that contained the extra weights used in 'v2v' mode performed much better than those who did not.
That was even longer than I thought lol and I think I'm forgetting a bunch of things but this basically covers at least 80% of what I did.
@huj0ps1t6 Thank you so much for your detailed reply and explanation. It has been extremely helpful to me.❤
So far this is the best NSFW lora!!!
I am getting amazing motion at 0.50 using it on I2V. It generates the motion without manipulating the image. Somehow, this lora just knows what I want! Great job with this!
Yeah its way more overfitted than I thought.
May sound crazy but I only noticed that yesterday after training for another whole day and testing the results. I released it without realizing - had I noticed these biases earlier I may not had enough confidence to release.
There is very heavy cowgirl and breast bias in its current state - prompting for 'small breasts/flat chest' in T2V mode will still give you at least medium sized ones or it will be completely ignored.
Now to make a new version I'll probably have to either start over from scratch with my newly modified and more balanced datasets or use a very early checkpoint -.-
Details
Files
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
Mirrors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
helper.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1-mid_2527064-vid_2840118.safetensors
ltx23_nsfw_helper_multi_concept_lora_v1-mid_2527064-vid_2840118.safetensors