CivArchive
    Wan Multiscene Photoshoot: Softcore Edition - Wan 2.2 i2v High Noise
    NSFW

    This lora creates a longform video (257 frames at once) in the style of a fast-cut edit photoshoot with multiple angles. Have a look at the examples to understand what it does.

    I very strongly recommend using the workflow I made specifically for this model, otherwise you're not going to get similar results. The workflow helps you create the starting i2v image, which is crucial for the lora to function correctly. The workflow also does two phases: The first phase creates 257 frames at a low resolution, making it easier to run on low Vram and to quickly find a seed you like. The second phase then splits the video into 4 parts, which gets denoised at high resolution. The first phase won't have good likeness to the subject, but the second phase will fix that, especially if you use a high resolution like 1920x1280. > > Click here to download the workflow < <

    The quality of your output will depend a lot on the quality of your image inputs!

    The lora is trained at 2x speed, which is why interpolation is needed to bring it back to normal speed. The reason for this was to fit more action into the 257 frames, which then become 513 frames. People with low system RAM might have some difficulty processing so many frames at full res. I'm open to suggestions on how to handle this better in the workflow.

    Support me so I can make more models, faster: https://ko-fi.com/the_cook

    These distillation loras work best with this model:
    Wan_2_2_I2V_A14B_HIGH_lightx2v_4step_lora_v1030_rank_64_bf16 on the high noise

    lightx2v_I2V_14B_480p_cfg_step_distill_rank256_bf16 on the low noise

    Size guide:

    Low Resolution 1st phase size options:

    192 x 128
    384 x 256
    480 x 320
    576 x 384

    High Resolution 2nd phase size options:

    768 x 512
    960 x 640
    1152 x 768
    1344 x 896
    1536 x 1024
    1728 x 1152
    1920 x 1280

    Higher resolution= better likeness, so see what you can manage to fit into VRAM

    Description

    Wan Multiscene Photoshoot: Softcore Edition

    FAQ

    Comments (133)

    exogmpJan 17, 2026
    CivitAI

    Have you tried SVI 2.0 Pro?

    The_Cook
    Author
    Jan 17, 2026

    Yes it's great. I trained this LoRA before SVI 2.0 pro was released.

    exogmpJan 17, 2026

    @The_Cook great, currently testing with my SVI wf and try to use your promt to see if it works in batches.

    erdelman73267Jan 17, 2026
    CivitAI

    got out of vram memory on high res pass with 16gb vram, but low res looks magnificent. Trying again with 720p on high res...

    erdelman73267Jan 17, 2026· 1 reaction

    btw it works great with distilled 1022 lightx2v lora on low...

    The_Cook
    Author
    Jan 17, 2026· 1 reaction

    Yes, in the workflow I have a list of recommended resolutions for lower vram. Glad you're getting good results.

    CharlieBrown0115Jan 17, 2026

    @The_Cook I CANT SEE THE WORKFLOW! :(

    _xxxBigMemerxxx_Jan 17, 2026· 1 reaction

    @CharlieBrown0115 Link works fine for me after providing buzz to the legend who made this

    erdelman73267Jan 18, 2026

    @The_Cook spectacular results, the higher u can go with high res the more stunning it looks as long as u pick the right photos and properly crop as explained in wf. Some issues with how genitals looks, I think I could try wan2.1 female genitals helper, it would be perfect if the innies of that lora could be reproduced in these videos, will try later, it took around 30 minutes 1536 x 1024 to finish with sage attention and fp accumulation on 5060 ti 16gb. trying higher res I get out of vram with Q4 KM model.

    xG00N3RxJan 17, 2026· 7 reactions
    CivitAI

    Genius idea, now if only we can get the same with a montage of sex acts!

    Anyone who is looking to fit the criteria for the images requested, I'd definitely recommend looking into Nano Banana Pro to make the necessary shots of your character to feed into the initial images!

    The_Cook
    Author
    Jan 18, 2026· 2 reactions

    Yes, I have a list of these types of loras that I'm building datasets for.

    And yes, Nano Banana pro can get you the angles if you don't have the data, but just remember that it isn't going to be "ground truth" data, it will be an approximation. That is fine for fictional or animated character consistency, but if you know what the subject should look like, it's best to feed it the real data.

    GayLizardSpyJan 18, 2026· 6 reactions
    CivitAI

    Genius. You are the Heisenberg of AI video 🧪⚗️

    ShinyLillyJan 18, 2026· 2 reactions
    CivitAI

    Fantastic workflow, was just missing four sets of get W & H on the width and height of the stage two, easy fix!

    _xxxBigMemerxxx_Jan 18, 2026

    I wasn't getting RES4LYF generating anything on the 2nd stage and just hanging until I defined these WxH values in the Upscale nodes, thanks for mentioning this! It was either this or lowering the High Rez Pass WxH +(AND) connecting to those undescribed WxH values that I got past the hangup and ClownSharK sampler started actually working haha

    The_Cook
    Author
    Jan 18, 2026· 1 reaction

    Thanks, I updated the workflow with the fix.

    joesixpaqJan 18, 2026
    CivitAI

    Wait, do I need to provide 4 images for the workflow to work?

    The_Cook
    Author
    Jan 18, 2026· 1 reaction

    The quality of your output depends on you input. You can do a single image input, but your output won't have good likeness, just like all other i2v models. Having more angles of the subject boosts the likeness a lot.

    joesixpaqJan 18, 2026

    @The_Cook I guest I got it now. Additionally to the starting frame, I provide the guidance for her face and full body

    AishaAIJan 18, 2026· 4 reactions
    CivitAI

    For some reason, using both models causes a huge loss in image quality during transitions. Hair looks like straw, noise appears in the image, and so on...

    However, removing the Low Noise model solves a good part of the problem, it practically corrects it, and there's an explanation for that:

    The High Noise model is responsible for building the progression of the scene, so it's what handles transitions, movements, objects, physics, space, etc.

    The Low Noise model is responsible for building the video details, such as texture, shadows, colors, faces, and other more specific details that are not directly related to movement but rather to the level of visual detail (it's like a refiner for the frame generated by High Noise).

    The idea of ​​your LoRA is to teach the hardcut, which is this abrupt cut and different poses in a short period of time. This is an exclusive job of High Noise, so it's what will learn this concept. The Low Noise model didn't need training; after all, you're not teaching it new textures, just camera cuts, angles, and poses. Because you trained the Low Noise model, it couldn't quite understand why it was receiving those images, and perhaps your dataset wasn't very high quality, so it thought reducing image quality was part of the training.

    I know your example images are perfect, but that's because you're using two completely different models in High and Low Noise, to the point that your LoRA has almost no effect on the Low Noise model.

    I'll post 2 videos in the comments soon using High + Low and just High, and you'll see a difference in detail quality.

    AishaAIJan 18, 2026

    High + Low = Poor quality
    https://civitai.com/posts/25959349

    AishaAIJan 18, 2026

    Only High = Good quality
    https://civitai.com/posts/25959403

    red0815Jan 18, 2026

    @AishaAI Would you post your workflow? The original poster's link to his workflow isn't available for me and your second result looks amazing!

    GayLizardSpyJan 18, 2026· 2 reactions

    I'm pretty sure the low noise contains information about what vaginas and nipples look like, etc. But if you're using with other genital loras or a custom merge of Wan 2.2 I guess you don't need it.

    Lora_AddictJan 18, 2026

    If i only use the high noise, the face completely changes.

    I use the ultimate anus and pussy lora for the low one and face consistency is acceptable and the genitals are WAY better then with the original low noise lora.

    fronyaxJan 18, 2026
    CivitAI

    Try clear vram or ram node and place it after the last vae decode before combining all of those interpolated frames to avoid OOM.

    WarmKnowledge6820692Jan 18, 2026
    CivitAI

    Seems good but with your workflow I keep getting stuck in the first Video Encode during the High Res pass. 5070ti, 64gb ram.

    The_Cook
    Author
    Jan 18, 2026· 1 reaction

    Get the updated workflow, and try lowering the high res to one of these values until it fits your vram:
    768 x 512
    960 x 640
    1152 x 768
    1344 x 896
    1536 x 1024
    1728 x 1152
    1920 x 1280

    white1147Jan 18, 2026
    CivitAI

    Using Model Patch Torch Settings and Patch Sage Attention KJ may result in image blurriness, and is Sage Attention with enable_fp16_accumulation=true unavailable?

    erdelman73267Jan 18, 2026

    I've added fp16 accu and sageattention to wf and it works just fine no blurriness...

    white1147Jan 18, 2026

    @erdelman73267 I don't run into this issue in other native wan2.2 workflows, but it just doesn't work with this one created by the author. I'm using an RTX 5090, with torch 2.7-cp310-cp310 and CUDA 12.8.ComfyUI 0.9.2. It might be an environment-related problem.

    erdelman73267Jan 18, 2026

    @white1147 I am on cuda 13 on freshly installed UmeAirt comfy https://huggingface.co/UmeAiRT/ComfyUI-Auto_installer

    JibXorzJan 18, 2026
    CivitAI

    got a good video on interpolation for part 1. Part 2 it got very noisy, and by part 4 it is a black video. any thoughts?

    Lora_AddictJan 18, 2026· 14 reactions
    CivitAI

    Do you have any other place where you offer your Loras? Because there is a high chance civit will nuke this one.

    forfreelsd368Jan 18, 2026· 4 reactions

    I hope he has. -_-

    BretChampagneJan 18, 2026
    CivitAI

    Just playing with this lora right now...
    Cool concept and very handy when I get lazy XD
    Like your "posing nude" banger, seems very solid about female anatomy ;-)
    Just a downside : I m using my own wan 2.2 img2vid workflow, with the "PainterI2V" node and I get very "accelerated/fast forward" outputs... Guess I will "cheat" when editing them with Premiere...
    Thanks dude for this cool new toy ;-)

    The_Cook
    Author
    Jan 18, 2026· 2 reactions

    The training was accelerated to fit more motion into the frames. So just add an interpolation node and 2x the frames. It should then be closer to normal speed

    BretChampagneJan 18, 2026

    @The_Cook Thanks for the tips !
    Maybe I should just try your recommended workflow. You ve warned us... Sorry If my comment looked like a complain, it wasn t ;-)
    Was reading a comment here about multiple images input, seems very interesting too...

    BretChampagneJan 18, 2026

    @The_Cook just finished a 3rd render... I tried to mix with a slop more natural bouncing lora, but it s still very "NSFW all in one" amusing bouncing...
    my prompt about it was : "her large breasts sway naturally. she s posing nude, teasing the viewer." > nope... did not work as expected...
    Still don t know If too much "bouncing" is that bad XD

    HenkeiJan 18, 2026· 19 reactions
    CivitAI

    appreciate the low cost unlike some who ask 20k for shit that doesn't even work properly

    Lora_AddictJan 18, 2026· 7 reactions
    CivitAI

    Oh and btw, this Lora is great. Your WF too. But you can also use it with "normal" workflows, extend the length a bit and prompt what you want to see at which second. The possibilities are almost unlimited.

    iPiKoJan 18, 2026· 5 reactions
    CivitAI

    This is insanely good

    JabGalJan 18, 2026· 5 reactions
    CivitAI

    Awesome LORA man. One of the best Wan Lora of this style.

    Congratulations!

    sirbazzrickJan 18, 2026
    CivitAI

    excellent work, works really well with a normal work flow, use high1 and low 0.9 to stop the face changing and the higher the resolution the better.... well done

    Ragamuffin20Jan 18, 2026
    CivitAI

    this is nothing short of amazing but im going for an LTX-version because 30 min aint it

    LetTheBassDropJan 20, 2026

    LTX is terrible. Doesn't adhere to prompts at all.

    Ragamuffin20Jan 20, 2026· 1 reaction

    @LetTheBassDrop neither dose wan withought the right loras

    7180347Jan 20, 2026

    what's the time difference ratio and what your vram?

    Ragamuffin20Jan 20, 2026

    @olivereads38255 omg dont even get me started. i can do a 10 second 1920x1088 video on ltx in 3 min. wan is a good 7-8 min for a 720x512 video. i got 24 gb of vram

    7180347Jan 21, 2026

    @Ragamuffin20 damnn. how's the prompt adherence? do you need about 2x the attempts vs wan or less??

    Ragamuffin20Jan 21, 2026

    @olivereads38255 I mean honestly everyone's experience is different and i haven't run any professional tests with it but from what I've used it for I've come to discover to fairly accurate. as much as wan or more provided you give it an appropriate amount of time for your prompt. like you don't want to prompt something that will take more time play out that how much time you set your generation for. you can also use your own audio which is a HUGE help in that aspect. the REAL problem is though that its heavily censored. people are making loras and text encoders that bypass them perfectly but withought the loras you cant really generate anything NSFW and my god do they take long to train. i have on right now thats been in runpod for the past 2 days.

    7180347Jan 22, 2026

    @Ragamuffin20 ahh, i see. some decent tradeoffs. how long did a wan lora take for you in comparison?

    Ragamuffin20Jan 22, 2026

    @olivereads38255 20 min

    7180347Jan 22, 2026

    @Ragamuffin20 sorry, i meant for training

    Ragamuffin20Jan 22, 2026

    @olivereads38255 i actually never train for wan but i did train for z-image and that only took 3 hours for 5000 steps. this one was like 3000 steps

    7180347Jan 22, 2026· 1 reaction

    @Ragamuffin20 i see, whelp looks like i have a lot to experiment with this weekend! thank muffin

    Ragamuffin20Jan 22, 2026

    @olivereads38255 gl man and it if helps AI toolkit https://github.com/ostris/ai-toolkit has wan2.2 and ltx-2 which makes training 1000x easier and Aitrepreneur https://www.patreon.com/c/aitrepreneur/posts has a batch file for an easy runpod set up. its only like $5 to get in i think

    7180347Jan 25, 2026· 1 reaction

    @Ragamuffin20 thanks man!

    Ragamuffin20Jan 18, 2026
    CivitAI

    also for combining all the videos, just combine the 2 at time. combine the first two videos and combine that video with the 3rd one and vice versa.

    OddDimension99Mar 23, 2026

    how to turn on combine videos? I enable nodes but they dont start

    qdr1enJan 18, 2026
    CivitAI

    Good nice lora., original concept. Could replace Qwen Edit for reproducing a character in various poses. Regarding speed, I prefer interpolating at x2 and set the FPS at 24.

    Mka123Jan 19, 2026
    CivitAI

    Is it normal for the first image of each video to be the grid of the 4 preview images?

    The_Cook
    Author
    Jan 19, 2026

    for the 1st phase, yes, for the 2nd phase the workflow is supposed to delete the first frame after denoising

    Mka123Jan 20, 2026

    @The_Cook It doesn't always delete the first frame, even with the 1.1 WF.

    TheAIAddictFeb 3, 2026

    Having the same issue here. The preview image is sometimes flashing in the beginning of the other phases. seems a bit random.

    Mka123Feb 3, 2026

    Yeah, it's still happening to me for every generation. Not sure why.

    TheAIAddictFeb 3, 2026

    I fixed my issue. If you use the FL Image Batch Node to combine the frames, you need to have the Select Every Nth Image node strip the first frame before sending it to the FL Image Batch Node. So for instance, its: Select Every Nth Image Node > FL Image Batch Node (input image_1) then do the same for (input image 2), etc.

    Mka123Feb 3, 2026

    @TheAIAddict I've actually always had this bypassed at the end because I saw too many say it caused OOM errors, so I just combine the 4 separate outputs with ffmpeg later. But I do have Select Every Nth Image node active between the Video Combine node and the FL RIFE Interpolation nodes. These came this way by default and haven't been moved or turned off.

    ROXBOTJan 19, 2026· 14 reactions
    CivitAI

    back it up on Huggyface...The clock is ticking. :/

    ramnnvJan 21, 2026

    yeah, that really sucks.

    mulbrasydJan 21, 2026

    ???

    happylittleteapotJan 21, 2026

    is there a reference

    jsst1Jan 22, 2026

    What is happening? please link info

    jynkz41Jan 20, 2026· 1 reaction
    CivitAI

    I'm not using your workflow at all as I use wan2gp. This is IMPRESSIVE! congrats on this.

    forfreelsd368Jan 20, 2026
    CivitAI

    "Good luck recombining all frames without running out of system RAM. If somebody finds a way to do this, let me know."
    Well, you just need to add clean vram nodes (I use vram debug to find it where) after some parts of wf.

    4070TiS 16gb vram, 64 ram (oh, and full fp16 model of wan2.2 i2v), 768x512 - first pass, 960x640 second (think I can up this, I'm lower it because of testing - want to see if it combines frames after all) and... 25 mins -_- Pretty heavy, but even first pass looking good.

    I want to try this lora with some custom wf's and see what happens.

    forfreelsd368Jan 21, 2026

    So, after some tests, I think the main problem of OOM error is comfy memory leaks (or, it just offloads everything from vram to ram, cleans vram, but not cleans ram, and finally gets OOM at some VAE decode, like I got recently at 2nd pass at 4th clip - almost at the end -_-).

    I divided authors wf into three separate wf's (1st pass, 2nd pass and final concatenating into one video) and it worked better then one wf (at least, now in 2nd pass there's no loaded at ram high noise model). So, final by now resolutions are 768x512 (that's +- max I can afford for 257 frames on my sys) for 1st pass, 1344x896 for 2nd pass (tried 1536 x 1024 on whole wf, but got OOM on that last 4th clip at VAE decode stage, not tried with separate wf's because...)

    Because it takes about an hour... ^_^ I did some simple wfs for prompts like "She turns herself straight to 90 degrees and stands still." or "She turns straight to camera and stands still." for preparing side view or front face view pics if there is no such picture (ofcourse most of a time there is no such picture). Some of them run then through SeedVR2 to add some details (anyways the quality will be doomed when concatenate to one 1344x896 picture, but better than nothing). Also, I use for side body shape view authors previous lora "posing v2", because from what I saw, it somewhy do better shapes for "shape" picture, than if it does this lora if "shape" are not properly... indicated.

    forfreelsd368Jan 23, 2026

    No, it's still about 40 minutes.
    Also, find out that for face consistency sometimes helpful to do just 1pass render, than do faceswap with reactor of whole clip, than run in through SeedVR2 for upscaling to 1080p - faster for me, better face at the end, but... reactor isn't perfect, if you tried it before - you know weak spots (sometimes can't swap face at some angles, and when half of face at the frame), but works mostly good as the face almost every time at the frame and facing at working angles. Also, tried to use 2pass after reactor, but 0.65 denoising ruins all the work, something like .12 might keep produced with reactor face and blend it deep into clip judging on previous experiments, but not tried that here.

    MMOFanJan 21, 2026
    CivitAI

    This works pretty well on workflows with SVI pro, it starts to lose the face by the third video though

    tyokan_fight641Jan 21, 2026
    CivitAI

    Hello. I’m having trouble running the portrait version of the workflow.
    It fails when concatenating four images (concat right) because their heights do not match.

    I also have a question about concatenating the four images:
    for the images used as the environment, is it better not to apply padding to them?
    In the wide version, it seems the images are being enlarged using resize, so I’m a bit unsure about the intended approach.

    ZealousidealEye1145728Jan 21, 2026· 1 reaction

    in image_concat_multi node set match_input_size to true

    The_Cook
    Author
    Jan 21, 2026· 2 reactions

    if the size doesn't match, just move the cropping boxes slightly and re-run those nodes. The problem is that the OLM nodes haven't updated their sizing when you change the High Res size, and then there is a chance that they crop 1-2 pixels differently. Re-drawing the crop box fixes this.

    There might be a better way for the portrait version, I was in a hurry when putting that together honestly

    ReikitsuneJan 21, 2026

    @The_Cook Do you split the Frames with Premiere or a video editor or am I missing something ?

    The_Cook
    Author
    Jan 21, 2026

    @Reikitsune The high resolution output gets saved as four separate video clips, because Comfyui gives oom System RAM errors if you try to re-combine 513 high resolution frames, so you can use an editor like Premiere to recombine the clips, or you can use an ffmpeg command to stitch the clips

    TWYXFeb 7, 2026

    I have found it is best to crop the images outside at 1:1 2:3 and 5:12 then they never fail. The only thing i can't figure out is what are the crop markers. Is it to the knees or to the finger tips for 5:12? Is it to the waist or to the gap for 2:3? how close is a closeup face shot, just neck and head, to the shoulders, or cut to the chin with no neck?

    AromaticPop76Jan 21, 2026· 6 reactions
    CivitAI

    Can you comment at all on how open to prompting this is? I'm curious what kind of freedom we have with prompting.

    robcoch93Jan 21, 2026
    CivitAI

    Seems very interesting !

    What total cumputing time do you get for a full video ? and with which gpu reference ?

    AromaticPop76Jan 21, 2026

    26 minutes on a 5090 for me, but you get a preview in just a few minutes, before it does all its refining.

    dzuodn9i303Jan 22, 2026

    about 1.5 hours for me on 4060ti 16GB

    That's at 896 x 1344

    Preview in 6mins or so

    eriocaJan 22, 2026

    @dzuodn9i303 possible to let me know how much RAM you having? i have 32gb ram with 5060ti 16gb vram, every time it finish render 1 part of 896x1344 will auto crash my comfy, so i need to restart comfy 4 times to get all the 4 parts :D

    dzuodn9i303Jan 22, 2026

    @erioca 64GB DDR5 it gets pretty full

    eriocaJan 22, 2026

    @dzuodn9i303 ohhh....thats double of mine hahahah, no wonder mine died after 1 video

    fifthcodmobile602Jan 23, 2026

    Just over 9 minutes for me, but thats because I'm only doing 2nd phase at 896x1334 and portrait mode.
    Using H200 SXM and 234gb of ram

    MMOFanJan 21, 2026
    CivitAI

    OK, I wasn’t sure I’d find this entertaining but I had to try and I didn’t want to use any complex workflows so I’m using the example SVI pro one. It does a very capable job even on a 15 second single video using that workflow yeah it’s a little fuzzy during transitions but not so much you’d be upset and it honors your starting image like a champion. Fun to be had by all.

    DoitDoitJan 22, 2026
    CivitAI

    Why is only noise output from the third sampler? The results from samplers 1 and 2 are normal.

    bishomilad1202Feb 7, 2026

    did you find any fix ?

    a0978750321122Jan 22, 2026· 3 reactions
    CivitAI

    This is really powerful. It can be used to generate many different photos, changing them to different scenes, such as snowfields or lakesides. It also works with multiple people; you just need to change the prompt from "one person" to "two people" or "more people."

    mafasdJan 22, 2026
    CivitAI

    I ran sometimes in an error: ImageConcatMulti Sizes of tensors must match except in dimension 2. Expected size... Playing around with the scaler does not really resolve anything for me, anybody an idea?

    Great Lora and Workflow, well done!

    The_Cook
    Author
    Jan 22, 2026· 1 reaction

    if the size doesn't match, just move the cropping boxes slightly and re-run those nodes. The problem is that the OLM nodes haven't updated their sizing when you change the High Res size, and then there is a chance that they crop 1-2 pixels differently. Re-drawing the crop box fixes this.

    WarTouristJan 22, 2026
    CivitAI

    So fun!

    redcreamsodaJan 22, 2026· 9 reactions
    CivitAI

    This is fantastic (used your workflow); you are a Golden God and a 5-Star man! Thanks so much!

    galaxytimemachineJan 23, 2026
    CivitAI

    Looks great! Any chance of an LTX2 version? Would be faster and have sound.

    The_Cook
    Author
    Jan 23, 2026· 3 reactions

    an LTX version is in the pipeline

    ClocksmithJan 23, 2026
    CivitAI

    I'm trying to use the portrait version of your workflow and it looks like you used HTML as input to nodes expecting INT values. This blocks me from trying it because I only have portrait input readily available.

    Prompt outputs failed validation:

    ImageResizeKJv2:

    - Failed to convert an input value to a INT value: per_batch, <tr><td>Output: </td><td><b>1</b> x <b>560</b> x <b>800 | 5.13MB</b></td></tr>, invalid literal for int() with base 10: '<tr><td>Output: </td><td><b>1</b> x <b>560</b> x <b>800 | 5.13MB</b></td></tr>'

    ImageResizeKJv2:

    - Failed to convert an input value to a INT value: per_batch, <tr><td>Output: </td><td><b>1</b> x <b>336</b> x <b>800 | 3.08MB</b></td></tr>, invalid literal for int() with base 10: '<tr><td>Output: </td><td><b>1</b> x <b>336</b> x <b>800 | 3.08MB</b></td></tr>'

    ImageResizeKJv2:

    - Failed to convert an input value to a INT value: per_batch, <tr><td>Output: </td><td><b>1</b> x <b>448</b> x <b>538 | 2.76MB</b></td></tr>, invalid literal for int() with base 10: '<tr><td>Output: </td><td><b>1</b> x <b>448</b> x <b>538 | 2.76MB</b></td></tr>'

    ImageResizeKJv2:

    - Failed to convert an input value to a INT value: per_batch, <tr><td>Output: </td><td><b>1</b> x <b>448</b> x <b>538 | 2.76MB</b></td></tr>, invalid literal for int() with base 10: '<tr><td>Output: </td><td><b>1</b> x <b>448</b> x <b>538 | 2.76MB</b></td></tr>'

    ImageResizeKJv2:

    - Failed to convert an input value to a INT value: per_batch, <tr><td>Output: </td><td><b>1</b> x <b>896</b> x <b>1334 | 13.68MB</b></td></tr>, invalid literal for int() with base 10: '<tr><td>Output: </td><td><b>1</b> x <b>896</b> x <b>1334 | 13.68MB</b></td></tr>'

    TompteJan 23, 2026
    CivitAI

    This is a fun model.

    jellywing414280Jan 24, 2026
    CivitAI

    Thank you for this great workflow.

    It is working great, but noticed first few frames of the vid is always image of concatenated picture frame. Is it just me?

    purger84Mar 18, 2026

    It's happening to me too. Does anyone have any ideas?

    The_Cook
    Author
    Mar 19, 2026· 1 reaction

    The first frame is removed on the interpolation phase, but it's not removed on the 2nd phase output. I have updated the workflow so that it's now removed on both

    jynkz41Jan 24, 2026· 2 reactions
    CivitAI

    I just want to say im not usung your worklow and everything is still coming out great!.. also works pretty well with other loras to get muti scene cuts/angles of the same action, if you know what i mean. Awesome! Its also great breaking past the 12 second barrier doing a sungle render, a lot of the times.

    MaxtorCivitaiJan 24, 2026· 8 reactions
    CivitAI

    Awesome! thank u! there is a hardcore version? could be nice too

    The_Cook
    Author
    Jan 24, 2026· 9 reactions

    Hardcore version is in the pipeline

    LetTheBassDropJan 24, 2026

    You can turn it into hardcore.

    MaxtorCivitaiJan 25, 2026

    @LetTheBassDrop Do you have any tips? I tried using some Blink LORAS, but I haven't had any success. Could you share the prompt and Loras you used?

    happylittleteapotJan 27, 2026

    @MaxtorCivitai using the supplied workflow would be trivial, simply provide it the hardcore source image and only add the loras the that particular sample step. Doing hardcore and the smooth transitions of this lora won't work as well or at all -- as you claim. I'm sure some significant prompt-work could help, but sampling more than once is the real solution

    ran2me3720Jan 25, 2026· 1 reaction
    CivitAI

    I didn't get it, if i need to supply the images, what does the lora do? if i had the images in advance, i can make videos of them with multiple loras

    playnproto266Jan 26, 2026

    That's if you're using the supplied workflow for max quality. If you don't want to do all that then just use it as a normal lora and prompt for scene changes. Personally, I already use a hard cut lora but still use the low noise of this lora as it maintains facial consistency a LOT better between cuts.

    DRZ3000Jan 25, 2026· 3 reactions
    CivitAI

    Great workflow! It seemed to me that dpmpp_2m \ simple - performs better in high resolution. The faces turn out more accurate, with DEIS it loses its original shape.

    DRZ3000Jan 25, 2026

    My vae encode and decode start to slow down periodically with no reason, even though I still have enough memory. I replaced them with the tiled versions, and it seems to have helped.

    Neo_fernuzJan 27, 2026· 1 reaction
    CivitAI

    It's a very good lora, but we need a simpler workflow, i use it with my normal workflow at 162 frames, and works very good, Thanks for sharing such a good model.

    happylittleteapotJan 27, 2026
    CivitAI

    Made me setup I2V again to test. Was worth it. one really cool "feature" is it can combine multiple characters into one, for fun combinations.

    hsholmeszeroJan 27, 2026
    CivitAI

    GOATED LORA 🐐🔥🔥🔥
    prolly second to none at maintaining facial and body consistency. i've also used the low-noise model to pair it with other LoRAs as other ones forcefully change the characters' ethnicity.

    jbear2447Jan 29, 2026
    CivitAI

    Buddy, this is amazing. Thank you for sharing <3

    blaFeb 6, 2026
    CivitAI

    yeah the low lora makes a LOT of difference

    bionovafood863Feb 13, 2026· 2 reactions
    CivitAI

    Excellent work! But your diffusion model eats up too much VRAM and produces errors at low resolutions. I used this author's model https://civitai.com/models/2053259?modelVersionId=2668710. And that solved the memory issue. On a 5080, it's 16GB. Maybe someone will find this useful. Your model is too heavy.

    ellessararagornt20Feb 19, 2026

    yes, my suggestion : ( this is what i did ) 1. use GGUF Q4 models, 2. use block swap ( i did 40 ) 3. use context options , 4. use tiled vae decode - 4.a use tiled encode for high rez pass, 5. i use 2 GPUs - a rtx4060ti 16gb as primary and a 12gb 3060 as secondary - i used multi gpu nodes to load the clip into secondary gpu ( this frees up 6GB ov vram on primary ) 56. finally use a purge memory node between high noise aand low noise pass - i usually use 3 steps for high noise and 5 steps for low noise - --- the work flow gave me excellent results - but with my 2 GPUs and a 64GB RAM - it took exactly 1 hour 21 minutes to generate a 257 frame video in 1536 x 1024 size

    bionovafood863Mar 2, 2026

    I personally wouldn't try to repeat such a feat a second time. Almost an hour and a half is too much for one video.

    Cor4411Apr 18, 2026

    SVI has been a disappointment for me until now(3090 24gb 64gb ram), so this one suits me much better. Can you share your workflow so we can try it? Thanks!

    i get IndexError: index 3 is out of bounds for dimension 0 with size 3 on batch 2 batch one is working any help ? i will try the model above ( 9070xt 16 giga vram / 32 giga ram )

    cyberpunx1980Feb 18, 2026· 3 reactions
    CivitAI

    absolute skinema! 🙌

    tresanti03182Feb 19, 2026· 12 reactions
    CivitAI

    awesome work! need a hardcore version now :)

    Ass51Mar 1, 2026· 3 reactions
    CivitAI

    Awesome! Need a LTX2 version to save that model!

    comprehensivemixpmvMar 4, 2026· 7 reactions
    CivitAI

    Bro dropped this banger and disappeared :(

    tbane526Mar 20, 2026

    Wake up, brother--for he has returned!

    dipi199201Mar 23, 2026· 4 reactions
    CivitAI

    this is one of the best lora and workflow i have come across , hope u can create multiscene stuff for different , sex positions

    rumbleskinMar 26, 2026· 2 reactions
    CivitAI

    Great workflow! Starting from a fresh copy of your workflow, I do see a weird anomaly though. After the 2nd pass when it creates the 4 clips, the clip with the starting image is switched with the second clip in terms of naming. Any idea why this happens?

    LORA
    Wan Video 2.2 I2V-A14B

    Details

    Downloads
    6,870
    Platform
    CivitAI
    Platform Status
    Available
    Created
    1/17/2026
    Updated
    6/14/2026
    Deleted
    -

    Files

    W22_Multiscene_Photoshoot_Softcore_i2v_HN.safetensors

    Mirrors

    HuggingFace (71 mirrors)