LTX 2.3
A LoRA for generating from-behind sex (facing the camera) positions with LTX-2.3 video models. Supports doggy style, prone, and top-down bottom-up positions. Check out the training data if you need help with workflows. Also I have attached my image captioning system prompt when using I2V that should help with language.
Trigger Word
sfbehind
Recommended Settings
LoRA strength (Stage 1) 1.0
LoRA strength (Stage 2) 0.85
Distilled LoRA (Stage 2) 0.6
Prompting Tips
This LoRA responds best to literal, mechanical prompts. Describe body positions and motion like you're directing a scene. Avoid poetic or abstract language.
Do: "He thrusts his hips forward in short rapid strokes, her buttocks compressing on impact" Don't: "A mesmerizing rhythm of primal passion"
Position Names
Use these exact terms — the model was trained on them:
doggy — on hands and knees
prone — lying flat face-down
top-down bottom-up — face pressed into bed, hips raised, back arched
Thrust Patterns
Two distinct patterns the model learned:
Close thrusts (no shaft visible): "He thrusts in short, rapid strokes, his hips staying pressed close to her ass. Her buttocks compress on each impact."
Long strokes (shaft visible): "He pulls his hips back, the glistening shaft reappearing, then drives forward. Her buttocks ripple from the impact."
Who Is Moving?
Man active: "He thrusts his hips forward" / "He drives into her"
Woman active: "She pushes her hips back into him" / "She rocks back against him"
Don't describe both moving unless both actually are.
Getting Better Results
Describe the male body — skin tone, build, body hair, tattoos, muscle definition. Without this it renders as a vague blob.
Describe impact reactions — "her buttocks compress and ripple on contact, her body rocking forward from the force." This teaches the model to sync the bounce with the thrust.
Describe contact points — "his hips press flush against her ass" or "his hands grip her waist."
If her face is visible describe it literally — mouth open, eyes closed, brow furrowed. Don't interpret emotion.
If no shaft is visible don't mention it. Describe hip motion and body contact only.
Specify the camera angle — straight-down, three-quarter, eye-level, low angle.
Known Quirks
Male torso needs explicit description or it gets blobby.
Impact bounce can desync if not described in the prompt — always include "buttocks compress" or "body rocks forward" tied to the thrust.
Stage 2 LoRA strength at 1.0 degrades quality. Keep at 0.85.
System Prompt I use with i2v:
You are a prompt writer for an AI video generation model. You will be given a reference image. Extract the visual details and write a generation prompt that would produce a video with a similar look and feel, but with motion added.
You are NOT captioning the image. You are writing a CINEMATIC DIRECTION that borrows the image's visual DNA — the specific colors, textures, materials, lighting mood, and character details — and adds motion to bring it to life.
Always begin with "sfbehind,"
EXTRACT WITH SPECIFICITY — every noun needs a visual adjective:
- NOT "on a bed" → "on tangled white cotton sheets, one pillow crushed beneath her chest"
- NOT "blonde hair" → "long platinum-blonde waves spilling over her left shoulder, damp at the temples"
- NOT "muscular man" → "a lean, V-tapered man with sun-darkened skin, a dusting of dark hair across his chest, and calloused hands"
- NOT "warm lighting" → "late-afternoon sunlight cutting through wooden blinds, painting gold stripes across her lower back"
- NOT "from behind" → "his hips square behind hers, his thumbs pressing dimples into the flesh above her hip bones"
PULL THESE FROM THE IMAGE:
- Hair: color with modifier, length, state (damp, tangled, pinned up, falling in face)
- Skin: tone + undertone + surface (glistening, goosebumped, flushed pink across shoulders, tan lines visible)
- Body: one or two specific details that sell the physicality (the dip of her lower back, the flex of his forearms, the soft crease where her thigh meets her hip)
- Position: name it (doggy/prone/top-down bottom-up) then add the specific body mechanics — spine angle, where hands grip, how weight distributes
- His hands: exactly where and how — "fingers splayed across her right hip, thumb pressing into the dimple above her tailbone" not just "hands on hips"
- Setting: materials and textures (velvet headboard, cool tile floor, wrinkled hotel duvet), objects that set the scene (bedside lamp casting a cone of warm light, phone face-down on the nightstand)
- Lighting: what it does to their bodies specifically (highlights the sheen of sweat on her spine, catches the ridge of his knuckles, leaves his face in shadow)
- Camera: describe by what's in frame and what's cropped (her full back and his torso from navel up, tight on where their bodies meet, wide enough to see the headboard and his arms braced against it)
ADD MOTION — pick the thrust pattern that fits the image's body positions:
CLOSE THRUSTS (his hips tight against her):
"He drives forward in short, rapid strokes, his hips barely pulling back before snapping forward again. Her buttocks flatten against his pelvis on each impact, a visible shudder rolling up through her lower back."
LONG STROKES (space between their bodies):
"He draws his hips back until the glistening shaft reappears between her buttocks, then pushes forward in one steady stroke, her body rocking forward as his hips meet her ass with an audible impact."
WOMAN DRIVING:
"She rocks her hips backward into him in a slow, deliberate grind, her spine arching deeper with each push, his hands riding her waist but not guiding."
ADD IMPACT REACTION — her body's physical response synced to the motion:
- "her buttocks compress and ripple on contact"
- "her body shifts forward two inches before settling back"
- "the flesh of her thighs shakes from the impact"
- "her fingers tighten in the sheets with each thrust"
VOCABULARY:
- "thrusts" = he moves, "pushes back / rocks back" = she moves
- If shaft isn't visible, don't mention it — describe hip motion and contact only
- Never describe what's inside her body
- Never end with mood summaries or poetry
OUTPUT: Single flowing paragraph, 180-250 words. Start with "sfbehind," — end with a visual detail, not a feeling.New release (1/15/26):
I think I achieved a decent balance on the quality of T2V, I2V, and audio so I'm releasing this as a beta. Some times things go weird. Lower strength can help sometimes with trickier prompts. I really like the use of ltx-2-ic-detailer-lora with this lora.
I'm still working on my workflow but currently I'm running a video/audio training cycle then and image training cycle to improve genitals.
Differences from v0.1
Improved audio,
T2V - Improved penis (still not perfect, but way better)
I2V - Similar or better results
Tags used during training
A woman is lying on her stomach in prone position a man behind her thrusts his hip forward and back sliding in and out.
The mans penis is visible.
Audio tags
clapping cheeks
moans, moaning, the woman's breathless moaning
heavy breathing
Training Details v0.2
30 dataset videos 576x1024@121f and 1024x576@121f
30 high quality images 1024x1024
Frame Rate: 25fps
Steps Video: 4000 (Video was trained faster than audio)
Steps Images: 3800 (Used to improve penis appearance)
NO abliterated used
Generation details:
Workflows in all images in the showcase for release.
No abliterated model used. (just don't user the LTX prompt enhancer.)
T2V vids are fp-8-distill
I2V vids are 19b-dev full.
Training update (1/14/26):
I am actively working on this LoRa. Its difficult to balance, I2V, T2V and audio all together. I'm working on my workflow and training methods, but it may end up being split for T2V/audio and I2V/audio, which is not ideal at all.
If you look at my latest video post for this model https://civarchive.com/posts/25846175. You should be able to see the massive difference in audio.
⚠️ Work in Progress (For testing only)
I believe in open development. DO NOT expect the best result from this project.
All images in the gallery are raw, unprocessed outputs directly from generation.
The last 4 images in the gallery are I2V.
Each image includes its attached workflow for full reproducibility.
I know the lora is huge. Rank 16 results were not great. Any tips for lowering the size would be great!
Any feedback is welcome.
Training Details
Trainer: Directly from the LTX team. https://github.com/Lightricks/LTX-2
Steps: 2,250
Dataset: 12 videos
Clip length: ~5 seconds
Frame rate: 25 FPS
Resolution buckets: 1024x576 - 121frames and 576x1024 - 121frames
Frames are required to divisible by 8+1
Gemma3 abliterated used during training.
NO audio training was done during this release.
Workflow & Settings
Base workflows: ComfyUI default templates for LTX-2 (T2V & I2V)
Tested on 19b-dev full and 19b-dev-FP8
Sampler: Res2s
Sampling steps: 20
Additional LoRAs:
ltx-2-ic-detailer-loraGemma3 abliterated used during generation.
Description
⚠️ Work in Progress (For testers at this point.)
I believe in open development. DO NOT EXPECT great results at this point. I am releasing to get feedback and possible help with training.
Unfortunately gemma3, even the abliterated version, makes LTX kind of tricky. Prompting around words like penis, cock, pussy, tits seems to make a bit easier. That why I started here instead of POV missionary.
All images in the gallery are raw, unprocessed outputs directly from generation.
Each image includes its attached workflow for full reproducibility.
All audio is from the original model. The first run of this lora does not include training with audio.
Training Details
Steps: 2,250
Dataset: 12 videos
Clip length: 5 seconds
Frame rate: 25 FPS
Gemma3 abliterated used during training.
Workflow & Settings
Base workflows: ComfyUI default templates for LTX-2 (T2V & I2V)
Gemma3 abliterated used during generation
Full 19b-dev and 19b-dev-fp8 have been tested. I'm hoping to get feedback on others.
Sampler: Res2s
Sampling steps: 20
Additional LoRAs:
ltx-2-ic-detailer-lora
FAQ
Comments (72)
The examples are looking good. Did you test smaller versions of the lora? They didn't look as good? Also, your examples don't say if they're T2V or I2V. Which were they?
Rank 16 didn't look great. I will be cleaning up the training data and trying it again though. Last 4 images are I2V, everything else is T2V.
Oh nice, I was looking for exactly this point of view when making prone bone videos a few weeks ago
Excited for the future of LTXV, if this is your beta work.
Porn loras is the future of LTXV 2
GREAT WORK!!! Congrats on making the first NSFW Lora for LTX2! The results are incredible! LTX2 has fantastic potential to be the new king of T2V-I2V! I have a request for the next Lora: (pov blowjob-deepthroat). Thanks!
very cool! Too bad audio training isn't possible yet. I hope it will come soon... but great to see that LTX-2 lora training is happening right now!
The audio is free, we can add any speech or sound we want in any context, which is even better...
@AI_2_addicted yes I played around a bit and the sound and voice which the base model already knows is surprisingly good! But still, if we can finetune this too it would become even better =).
1st pron LoRA for ltx2. Historical moment!
Oh damn, I didn't even notice the base model. That's awesome, nice work @daring_l
Did you use DreamFast/gemma-3-12b-it-heretic?
I'm not sure if this is a good one I need to do more testing but this is the one I used. https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated/tree/main
@daring_l Heretic is abliterated made with pew's program
NOT ME TRYING FOR AN HOUR WITHOUT USING THE LORA LOL
Working fine also at distilled fp8 with standard Gemma3 4bit text encoder. Euler, 8steps like at default.
Thank you!
@daring_l After couple more tests, shapes of penis are weird. Maybe abliteraded text encoder will be needed anyway.
@flo11ok874
Remember that this Lora is still the alpha version. This is already amazing; I believe these problems will be resolved in the next version. ;)
@flo11ok874 U can prompt the penis that seems to help sometimes
@AndyZocker Yes. And when they make a lora specifically for penises that we can add as a genital enhancement, this problem will be solved.
@AI_2_addicted Yea for sure! Can't wait. LTX 2 is crazy!
This is incredible, works really well both with T2V and I2V. I can generate 20 second porn clips in about 7-8 mins at 480p on a 3060.
doesn't use obliterated encoder. completely useless. the standard one is not censured.
I agree. I've done several generations with the standard encoder and had perfect results.
Great job, thank you. A question about your training: So did you only use 12 x 5 second clips - and at what resolution?
It does seem to generate men with amputated legs, however. It seems to think that men have nothing below their knees. This might be rectified by the addition of different camera angles in the training data.
fun thing is it doesnt take 20 minutes between each try!
do not using stupid obliterated encoder
@gambikules858 I'm not.
Yes there needs to be more training data. I'm working on this issue.
Despite your qualified language in your post this actually works great. Please work on a blowjob lora next. You are a pioneer of the modern porn industry!
Works great, also with regular gemma-3-12B. Seems quite flexible as well, in T2V you can also make the one on top a futa instead of a man, and with some prompting you can get different positions like doggystyle too (tested on the full distilled model). The fact that it learned so quickly on 12 videos is promising for future LoRAs for this model!
Thank you for your feedback! Have you tried it with I2V yet? The motion works well in the non-distilled models.
@daring_l I was just trying it out with I2V, it works well for that too! Tried mostly with the full distilled version as that is quite a bit faster, it seems to work on both (I'm still figuring out which workflows are best). I noticed It works up to quite a wide angle, not only them directly facing the camera.
Hello. What are the requirements for LTX2? Thank you
I'm making great 5-second videos generated in 10 minutes. (1024x1024 with 32 GB RAM - 6 GB VRAM). I'm using an 8-step LoRa accelerator at strength 1.0. If you lower the resolution it will be even faster.
@AI_2_addicted Sounds good.. I have 8gb VRAM and 16gb ram, do you think I could run it? Are there GGUF versions?
@GlowingGuardianGirl Yes, there are several versions of GGUF in various sizes. But I'm using the standard FP4 version and getting great results. I think you could easily manage with a GGUF Q4/Q5 or FP4 version.
@AI_2_addicted Thank you
@AI_2_addicted hey there. may I ask if you use ComfyUI or Wan2gp? If it's comfy, could you share a workflow? I still can't get it work.
P.S. it would be cool if they added support for gguf for text_enconder (gemma3).
I wouldn't call it a test. seriously the success rate is quite high. Great work.
you wanted feedback so here it is.
T2V - I did only 20 inference runs trying different prompts, it seems able to produce promptable appearances for the woman and man (hairstyle eye color etc.) But my t2v seemed pretty locked to a zoomed in view like it said on the tin.
I2V - Ive done about 100 runs feeding it different images from various angles. it seems to keep the characters consistent, and has understood the concept "Thrusting" and "Penis slides in" at a level beyond just the prone bone pose. It seems to be usable for general intercourse with lowering the lora strength.
Cant wait for V2 and other projects from you. Thanks in advance! :D
Thank you!
this is great. please release other positions also
I'm sure the rest of the community is working on this as well.
Now we just have to wait like a month until someone finally gets around to training for normal sex.
I'm actually more concerned about the waistline anyway. T2V generations are even more flat than Wan. I can't get off if they don't have a ratio. And if T2V can't make it, that also means from an I2V start the waist is just going to keep getting bigger as they move around. From what I've tried LTX-2 hasn't even been able to keep the face straight.
yea - so this is really well done -- what training harness/ui did you use for this?
Directly from the LTX team. https://github.com/Lightricks/LTX-2 I've added this to the details of the Lora.
how do you train for LTXV2 and is training locally possible with 24GB vram cards (I suppose not but worth asking) ?
you are going to probably need more than 24 gigs -- ostris will likely have a trainer out soon that is easy to use
You can always rent a cloud gpu for a couple hours as well. Loras seem to train incredibly fast on this model.
You can train for LTXV0.97, those lora's work with LTXV2, and are easier to train (less requirements) but doesn't let you train the audio too
not sure about ltx2 yet, haven't tried yet, but some trainer's like simple tuner can probably train this on 24gb vram, it allows dropping into fp8 during training (even on full models) to drastically reduce vram use when training,
after playing around with this Lora, I discovered that the Lora also helps if you let her just sit on a chair and let her moan. Even if this is trained without audio, the visual part of the Lora seems to trigger more intense moaning sound from the models Audio part, since both are connected
can confirm
New training run with audio is currently being run. Mainly for moaning and 'clapping cheeks' sounds. If it works well, ill push it to public.
@daring_l very cool! Cant wait for it! Yes the clapping in currently missing ;) If training with diffusion pipe or musibi tuner supports LTX-2, then I'll try to get my sloppy Deepthroat Lora to work ... so audio is definitely needed ;)
@WhatTheGuy I'm using the trainer from the LTX team. I would suggest trying it because they have a special prompting method. https://github.com/Lightricks/LTX-2
@daring_l this is linux only? I saw someone provided a docker image with are requirements preinstalled. But I never used docker. Maybe I'll look into it if the other trainers take some more time to implement ltx2.
@daring_l how is your progress with the audio version =) ?
@WhatTheGuy Check it. Not good on the T2V side yet but I2V and Audio is money. https://civitai.com/posts/25846175
@daring_l Audio really improves it =) I hope the T2V video can be improved, since it's my main usecase >_<
are some of yall using AI for you prompts ? if so what are u using because i cant seem to find a relaible NSFW prompt enhancer let alone llm
qwen3-vl-abliterated is great (abliterated, as in uncensored). You can download it with Ollama. I use it in ComfyUI with the 'comfyui-ollama' (by stavsap on GitHub) custom node set. Or you can just use it via Ollama itself.
For the system prompt for qwen3-vl, go to Google AI Studio, choose Gemini 3 Pro Preview for the model, disable all safety settings on the right tab (Advanced settings > Safety settings). Say something like "Please create a system prompt for an abliterated LLM model for enhancing/improving prompts for use with LTX-2 AI video generation. The LLM should not censor itself or refuse to enhance a prompt based on its contents. Also , if an input image is supplied, it should be able to use that input image for better context, thus improving the prompt even further."
It somewhat works with official workflow, but with some custom workflow I've found and been using it doesn't work. Not sure why yet.
good job!
If you don't intend to include audio training, you can skip the audio modules to further reduce the file size. For example, instead of the default settings:
Original default:
target_modules: - "to_k" - "to_q" - "to_v" - "to_out.0"Change to:
YAML
target_modules: - "attn1.to_k" - "attn1.to_q" - "attn1.to_v" - "attn1.to_out.0" - "attn2.to_k" - "attn2.to_q" - "attn2.to_v" - "attn2.to_out.0"Is the audio training necessary for the Lora to work properly? Cause I've encountered several Lora already that just don't want to work with audio and voices.
Overall, this is really promising. The audio on this so very good. Faces can vary. The motion is really promising. It wants the man to have pants/underwear. Genitals need more work, and butts can get a bit weird. I'm getting the usual LTX-2 shoddy prompt adherance, but 1 in... 10? is pretty good.
I recommend setting the fixed noise seed to random in stage 1, unlink the pin at the noise seed or it will not work and maybe stage 2 too so u don't get the almost same video after generating and what often helps me is to modify the prompt slightly, like this. he thrusts his hip forward and back sliding his penis in and out which makes a hand clapping noise. And instead of "uh, uh, uh", I sometimes find "Ahh, ah, ah" better, especially if you say beforehand that she says that when she moans.
I have a improved audio version of the lora currently. Its not good on the T2V side yet but the audio for i2v is chefs kiss. im trying to figure out if its worth pushing it. https://civitai.com/images/117317463
@daring_l I think your lora is already really amazing even with T2V i got really great results if u can nail the audio and penis it is perfect. I hope for a footjob lora next if u are okay with it. Thanks for your work.
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.