This LoRA can only do I2V of Frieren.
Jk.
Multi-concept experimental LoRA. Work in progress. Or not?. Don't expect it to become better because I'm already hitting diminishing returns.
(previews are not cherry picked and use super basic captions because part of the experiment is to see how well LTX-2 adapts to short captions - so yeah don't complain unless you can do better EDIT: kinda forgot to mention cause its extremely obvious but all previews were done in I2V mode - as recommended for this LoRA. Also it can do more than just hentai ofc)
Trained without audio but had audio weights (from other loras) injected at low, balanced ratios - post training. Mostly for automated female moans though that pretty much doubled the size of the file. Whatever.
Separate T2V datasets (with WAN-like captions) and I2V datasets (with very short, movement-focused captions) were used with modified training code to ensure I2V mode exclusive and 100% for I2V datasets.
Meant for 2/3-stage I2V workflows. Body horror with T2V - most of the time. Input should be proportionally resized with width = 512 for 3-stage workflows and 1024 for 2-stage workflows.
Terminology: penis/pussy/fellatio/insert instead of dick/vagina/blowjob/penetrate
Main triggers:
- having sex in the missionary/cowgirl/reverse cowgirl/doggystyle position
- penis thrusting in and out (forgot to include this in the previews lol)
Many concepts haven't been captured yet - you can see in the previews that masturbation and handjobs are not working at all depending on the scene and sometimes it works even without prompting for it. Also it sucks at fellatio.
Description
Initial release
FAQ
Comments (21)
letss gooooooooo
gay?
Sorry no. Focuses 100% on females from various races and hentai in many different styles - from grayscale sketch all the way to full anime style. But the LoRA struggles immensely with drawing genitals from nothing so if you just use it as a motion assistant and you have a good starting point (image) you can still animate it properly even if both characters are male - I think.
@huj0ps1t6 ok, thanks. If you do an update though please remember us gays
@ForeverNecessary737716 you will not be left behind
you are
work perfect. Better then others. I tested 20 video
Thank you for the kind words. Still its far from perfect. There are many concepts in my datasets that the LoRA simply refuses to learn and despite the fact I changed my datasets to enhance genital learning - it still struggles to learn both pussy and penis at the same time. I barely tested it myself so I do not know how good or bad it truly is - but from the little testing that I did I can say there are so many flaws with it if I started writing about them all this comment would turn into a LTX-style prompt
But it best at the moment.
I have to agree, easiest to use and just "works". relative to the others I have tried anyways.
Very interesting, your samples look like Live2D animations, which is an interesting effect.. any plans for a V1.1?
Yes I'm planning on making a new version with major changes but it will take me about a week - all things considered. Plus, I won't release it if somehow it turns worse than this. No promises but I'll try
@huj0ps1t6 awesome, looking forward to it.
Its been 9 days and I said it would only take a week so I feel like I should update on this.
I continued training on top of the released version and came to realize it contains heavy biases and is very overfit. Not possible/ideal to even attempt to fix that so I'm training again from scratch and this time I'm using a much lower learning rate. Which means it will take much longer than v1 did.
On top of that I'm gonna experiment with trying to train single-concepts loras on top of 'lora v2 - base' (once that's done) and merge them wisely into the base lora based on which blocks are affected more by each concept. All in all this is really gonna take some time and I can't even say with confidence it will be worth it - but I'll do my best.
@huj0ps1t6 No worries, there's so much stuff on Civitai to play around, we can keep ourselves entertained in the meantime ;) Best of luck for the new training, and thanks for your hard work!
Thank you for your efforts. I would like to ask: how much training data did you use in total during the training process?
This will be a bit long.
Ballpark of 300 images though they were mostly for teaching genitals - not sex positions. Though to be honest it doesn't seem like they were of much help. The model struggles to learn both penis and pussy at the same time and it didn't help that I was also teaching 'puffy pussy' and 'spreading pussy' in the mix.
For videos it was roughly 250 T2V videos with WAN-like captions then I duplicated those into a separate dataset and simplified their captions to be just mention 'motion/what happens', ex: 'having sex in the cowgirl position'. Those with short captions were trained exclusively in I2V mode while the others exclusively in T2V mode. For this to be possible I had to add a few lines of code into 3 files of the Musubi LTX-2 branch files.
I kept downloading more videos but I was lazy with the captions so I added the extra ones just to my I2V datasets.
At the end I still had ~250 T2V and ~400 I2V videos. 121 frames 24 fps - all of them.
Trained in multiple stages, first at LR=0.00008 gradient_accumulation_steps=1 for roughly 4 to 6 thousand steps initially using only [512,512] buckets for videos and both [512,512] and [1024,1024] for images. I ALWAYS train videos on FULL frames. I don't trust whatever Musubi is doing with other 'frame_extraction' modes.
I interrupted this run halfway to manually test it and found out that when used with the upscaler -> the upscaled video would look very bad. Kinda seemed to add 'noise'. Figured it was because I was not training the videos at higher res but with my 16GPU I'm kinda limited. So what I did was create 49 frames 24 fps clips out of all of my videos that could be used for [1024,1024], [1280,720] and [720,1280] buckets. Did this to both the T2V and I2V datasets and ofc I had to re-caption many of them because suddenly some parts of the captions no longer happened in the shorter clips. Added those to my training configuration at the mentioned buckets based on their aspect ratio and once again trained on full frames. This solved the upscaling problem.
At some point, I classified my videos in 'tiers' from 1 to 4 based on how visually pleasing they are to me and how good their animation is. I added extra separate datasets for tiers 2, 3 and 4 to serve as extra repeats for those 'better-looking' videos.
Also at some point, I wrote a custom scheduler that would multiply the base LR of a sample based on either the filename or dataset directory name. I2V datasets had higher multipliers and the files of tiers above 1 also had higher multipliers as their tier increases - something soft ofc - the max a multiplier could be was '2.0'. This was kinda like 'increasing repeats' for those files but by increasing their learning rate directly.
Finally, trained further with LR=0.00005 gradient_accumulation_steps=2 for an additional ~2000 steps (where a step here is actual 2 micro-steps cause Musubi counts a 'step' only when the weights updates - which with GAS=2 is every 2 actual steps). I wanted to train it for much longer but even this took me days on my 16Gb GPU.
Important: I used '--lora_target_preset v2v' which is mentioned in the musubi ltx-2 branch to be for 'video-to-video/IC LoRA'. Good thing I ignored that and went along with my guts because I'm pretty sure for NSFW the regular 't2v' mode wouldn't cut it. I know that from experimenting - during training breaks - with weights filtering and comparing other loras that people released on this site. My overall impression was that: every LoRA that contained the extra weights used in 'v2v' mode performed much better than those who did not.
That was even longer than I thought lol and I think I'm forgetting a bunch of things but this basically covers at least 80% of what I did.
@huj0ps1t6 Thank you so much for your detailed reply and explanation. It has been extremely helpful to me.❤
So far this is the best NSFW lora!!!
I am getting amazing motion at 0.50 using it on I2V. It generates the motion without manipulating the image. Somehow, this lora just knows what I want! Great job with this!
Yeah its way more overfitted than I thought.
May sound crazy but I only noticed that yesterday after training for another whole day and testing the results. I released it without realizing - had I noticed these biases earlier I may not had enough confidence to release.
There is very heavy cowgirl and breast bias in its current state - prompting for 'small breasts/flat chest' in T2V mode will still give you at least medium sized ones or it will be completely ignored.
Now to make a new version I'll probably have to either start over from scratch with my newly modified and more balanced datasets or use a very early checkpoint -.-