CivArchive
    【WAN2.2】IMG to VIDEO - v1.1
    NSFW
    Preview 91979444
    Preview 91979442

    WAN2.2 — Image to video — Simple Workflow

    A clean, all-in-one WAN image-to-video workflow built entirely with the UmeAiRT Toolkit for ComfyUI.
    Only 12 nodes. No spaghetti wires. Just load your model, write your prompt, and hit generate.


    ⚠️ IMPORTANT — Nodes 2.0 Required

    This workflow is built for the Nodes 2.0 (Vue) interface of ComfyUI. If you don't enable it, the workflow may have display problems.

    How to activate Nodes 2.0:

    1. Open ComfyUI

    2. Go to Settings (⚙️ icon, bottom-left)

    3. Find "Use Nodes V2 (Vue)" and toggle it ON

    4. Refresh the page

    5. Load the workflow

    If you prefer the classic interface, check out my Legacy version of this workflow instead (link).


    🎯 Features

    • Text-to-Image generation

    • Automatic download of models in auto version

    • Built-in SeedVR2 upscaler — high-quality tiled upscaling (toggleable on/off) Slower than a classic upscaler, but significantly better quality

    • Full metadata embedding — your images are saved with all generation parameters, ready for online publishing and remixing

    • 3 LoRA slots — with individual on/off toggles and strength control and you can connect as many other lora modules to each other for as many LoRA as you want.


    📦 Custom Node Required

    Only one custom node to install:

    👉 ComfyUI-UmeAiRT-Toolkit

    Install via ComfyUI Manager (search "UmeAiRT") or use the UmeAiRT Auto-Installer.
    The Toolkit packages everything internally — upscaler, face detailer, metadata saver. No other custom nodes needed.


    📂 Files you need (in manual version)

    For base version
    I2V Model : wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors and wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors
    In models/diffusion_models
    CLIP: umt5_xxl_fp8_e4m3fn_scaled.safetensors
    in models/clip

    For GGUF version
    I2V Quant Model : wan2.2_i2v_high_noise_14B_QX.gguf and wan2.2_i2v_low_noise_14B_QX.gguf
    In models/unet
    Quant CLIP: umt5-xxl-encoder-QX.gguf
    in models/clip

    For speed version
    lightning LoRA : Wan2.2-Lightning_I2V-A14B-4steps-lora_HIGH_fp16.safetensors and Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors

    For AIO version
    I2V FAST AIO Model : wan2.2-i2v-rapid-aio.safetensors
    In models/checkpoint

    VAE: wan_2.1_vae.safetensors
    in models/vae

    ANY upscale model:

    in models/upscale_models

    Description

    Fix for step calculation

    Add speed lora version

    FAQ

    Comments (48)

    spanking_cutieAug 2, 2025· 2 reactions
    CivitAI

    10/10

    hpak90161Aug 3, 2025· 1 reaction
    CivitAI

    Out of curiosity is there a reason you didn't include Sage Attention in the workflow like your 2.1 ones did?

    LootDudeAug 3, 2025

    I connected the Sage Attention node and it started patching for Sage Attention. But it was much slower with it. Dunno why. Using Normalized Attention worked better for me.

    UmeAiRT
    Author
    Aug 3, 2025· 3 reactions

    Because using this node is not recommended. Instead, use the "--use-sage-attention" launch parameter for comfyui

    MegasherruAug 3, 2025
    CivitAI

    Getting error "Cannot find a working triton installation" How do you install triton? O.o

    Your 2.1 Workflow works perfectly, just having this issue with 2.2

    MegasherruAug 3, 2025

    Okay seems I could bypass it by disabling three of them, but im not sure either since I do sometimes get it regardless,
    Normalized Attention
    TeaCache
    Torch Compile

    rkleolAug 7, 2025

    if you want to use triton for windows check this:
    https://github.com/woct0rdho/triton-windows

    oo4766229Aug 3, 2025· 1 reaction
    CivitAI

    Why do I keep getting ghost frames? Like two videos merge together?

    nafikovtt777Aug 3, 2025

    try 16 fps or 3 sec, for me 5 s shift/speed 8 16 fps 5 sec, normal frames

    garystylezAug 3, 2025
    CivitAI

    I've tried to batch this workflow exactly; but, getting "lora key not loaded" messages. Any ideas? To broad to diagnose?

    yajukunAug 4, 2025

    Not sure if we are all seeing the same issue when using Wan2.1 I2V loras with Wan2.2? Read this. https://www.reddit.com/r/comfyui/comments/1mgo0z3/wan_22_doesnt_load_certain_loras_i_got_it_working/

    xuanwoaAug 3, 2025
    CivitAI

    I've replicated a simplified Kijai version of the UmeAiRT workflow.
    https://civitai.com/posts/20442238

    smolushaAug 13, 2025

    Please tell me about Umeirt, my video is blurry, but everything is clear in your version. What could be the reason for this?

    xuanwoaAug 14, 2025

    smolusha You can find here(

    https://civitai.com/user/UmeAiRT

    )

    smolushaAug 14, 2025

    I'm sorry, I didn't put it right. In WF umeirt "base-lightx2V", my video is blurry, but in your WF everything is clear and at the right speed

    GriffsuAug 3, 2025· 1 reaction
    CivitAI

    Which model do you recommend for 4090?

    vladulidloAug 4, 2025· 1 reaction

    MSI 4090 Suprim X is not bad at all :-)

    JCD007Aug 5, 2025

    vladulidlo I don't have the answer, but I believe they were asking which .guff model to use on their 4090.

    yajukunAug 5, 2025

    I am a 4090 guy too. I am using the Q8 gguf files, 480x832, 5 seconds, 25 steps, 2 loras, no blockswap, hits 23GB, take 15mins to generate. The AIO is way faster and not as good, as you probably would expect.

    exodus2000Aug 3, 2025
    CivitAI

    Is it normal that in the workflow high runs before low? I thought it was the other way around, low runs first and then high?

    vladulidloAug 4, 2025

    High noise first then low noise.

    ViciousCalamariAug 3, 2025
    CivitAI

    Amazing work as usual, too scared to try anyone else's workflows... can we get a CFG/sampler editor for each model?

    I.e.

    High noise model to run without Lightx2v @ 30 steps, CFG 3.5

    Low noise with Lightx2v @ 4 steps, CFG 1.1

    yajukunAug 4, 2025
    CivitAI

    Hi, for the 4090 guys, what are you enabling vs disabling in Optimizations? Out of the box my 4090 is only using 19.xG VRAM and it seems very slow to render. Normally when using Wan2.1 I'm just about to hit 23G usage. To be fair I normally use 480x832 for my Wan2.1 stuff, not 768x1024 so maybe that is part of it. Is there something I can turn off and use more VRAM but will speed up my processing? Do we need to blockswap?

    vladulidloAug 4, 2025· 2 reactions

    Higher resolution = higher VRAM consumed. Be happy you have some VRAM free left. Use blockswap when you get OOM errors. I usually blocskwap always as I tend to render longer clips on higher resolution with low number of steps with lightx lora and I got OOM all the time without blockswap. I swap all blocks to RAM (i have 128 GB RAM so plenty of virtual VRAM). It costs less time then dealing with OOM issues and running workflow again.

    yajukunAug 4, 2025

    vladulidlo Thanks for the info!

    cnosgame916Aug 4, 2025
    CivitAI

    When using the version 1.2 workflow, I noticed that if there’s a lot of red in the image, the video gradually turns red toward the end — including the face. I'm not sure what’s causing this.

    yajukunAug 5, 2025

    I have generated over 20 videos using the V1.2 classic version (GGUF) of this workflow and I have not seen this any red tint occuring? Very odd. I noticed toward the end of the workflow there is a Color Correction doodah, I wonder if yours is borked somehow?

    cnosgame916Aug 6, 2025· 1 reaction

    yajukun The red tint appearing at the end seems to be caused by Wan21_I2V_14B_lightx2v_cfg_step_distill_lora_rank64. If the original image contains a large amount of red, this effect tends to show up in the final result.

    yajukunAug 6, 2025· 1 reaction

    cnosgame916 Ahh...interesting! I am going to start trying to use the lightx loras soon. I'll keep this in mind. I think they just came out with the Wan2.2 version for T2V right? Thanks.

    bulhae219Aug 7, 2025

    noticed the same phenomenon, for example if the subject is wearing a pure red shirt, by the last frame the character becomes largely red

    bonsi8282240Aug 4, 2025
    CivitAI

    i get this error:
    mat1 and mat2 shapes cannot be multiplied (385x768 and 4096x5120)

    sorry i am new to this. how can i fix it?

    vladulidloAug 4, 2025· 2 reactions

    Make sure input image width and height are divisible by number 16 or set in Input Resize node "divisible by" 16. This will make sure that all input image sizes with not supported resolutions are divisible by 16 and will be processed. At least make sure width and height are not odd numbers.

    gorodorkAug 5, 2025

    vladulidlo I just tried that in a 800x992 image and got the same error, both are divisible by 16 :(

    Actually I was using the wrong clip model, now I got the right one and a different error is showing.

    gorodorkAug 5, 2025· 2 reactions
    CivitAI

    Is anyone getting this error?

    KSamplerAdvanced

    Given groups=1, weight of size [5120, 36, 1, 2, 2], expected input[1, 32, 13, 128, 96] to have 36 channels, but got 32 channels instead

    Appreciate any help.

    yajukunAug 5, 2025

    I did not get this error but I found this on reddit, sounds like its a common wan 2.2 issue. I assume you updated Comy before doing anything? Also seems like some solved it by using the Wan 2.1 VAE? https://www.reddit.com/r/StableDiffusion/comments/1mc2gkq/given_groups1_weight_of_size_5120_36_1_2_2/

    bulhae219Aug 5, 2025
    CivitAI

    'lora key not loaded', though loras seem to work. why this message, will there be a fix one day?

    yajukunAug 17, 2025

    Are you using Wan 2.1 loras with Wan 2.2? There are multiple Reddit posts about this. However a good explaination was from Kijai himself. https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/909

    '2.2 I2V model dropped image cross_attn layers so it's normal to get key mismatch errors when using 2.1 I2V, it still works for the rest of the keys, and those layers never were that important anyway.'

    IntelGoreAug 6, 2025· 3 reactions
    CivitAI

    When running the gguf workflow with lightxV2; it runs, but I get a million errors like this:
    ERROR lora diffusion_model.blocks.2.cross_attn.o.weight shape '[5120, 5120]' is invalid for input of size 9437184

    ERROR lora diffusion_model.blocks.2.ffn.0.weight shape '[13824, 5120]' is invalid for input of size 44040192

    ERROR lora diffusion_model.blocks.2.ffn.2.weight shape '[5120, 13824]' is invalid for input of size 44040192

    ERROR lora diffusion_model.blocks.3.self_attn.q.weight shape '[5120, 5120]' is invalid for input of size 9437184

    ERROR lora diffusion_model.blocks.3.self_attn.k.weight shape

    Any idea what's going wrong?

    UmeAiRT
    Author
    Aug 17, 2025· 1 reaction

    I just released an update with a new lora

    vistramazedAug 11, 2025· 3 reactions
    CivitAI

    Just trying the GGUF version of 1.2 and looks like the wrong Clip node was used (not the GGUF version). Cheers.

    Appreciate the work!

    smolushaAug 13, 2025· 1 reaction
    CivitAI

    Hello. Such a problem. In "IMG to VIDEO (gguf-lightx2V)" everything works fine, the video is clear. But with the same parameters, but in "IMG to VIDEO (base-lightx2V)", the video turns out to be blurry. What could be the reason?

    af1kfb110Aug 19, 2025

    when blurry, adding more steps helps me (8 instead of 4)

    smolushaAug 19, 2025

    There, in the comments, user xuanwoa threw off a redesigned workflow based on "UmeAiRT". https://civitai.com/posts/20442238

    In this workflow, my basic model works well with steps - 4, cfg - 1, Wan21_I2V_14B_lightx2v_cfg_step_distill_lora_rank64 - 1. It works faster and better than the GGUF model. But unfortunately there is no StartAndFrame.

    hithatwhitham895Aug 15, 2025· 1 reaction
    CivitAI

    why do the video outputs flicker?

    firsakAug 16, 2025· 2 reactions
    CivitAI

    i get ghost/blurry video, only if I push the limits in terms of resolution and duration.

    for example,

    640x896, 8 second video is fine

    1024x768, 6 second video is fine

    1024x768, 8 second video - complete mess

    any idea why it might happen?

    af1kfb110Aug 19, 2025

    you probably need more steps

    RumabenAug 19, 2025

    Try Kijai's workflow. That fixed it for me. Maybe the ksampler is bad?

    firsakSep 2, 2025

    @Rumaben 

    switching to KJs models (and a workflow that supports them) indeed vastly improved quality of generated videos. on RTX 4090: 8 second videos, 848x1088, 8 total steps - as fast as 7 minutes with great quality! 720x960, 6 total steps - 3.8 minutes.

    UmeAiRT's workflow doesn't support KJs models properly, unfortunately.

    p.s.

    obviously using "lightning" loras for speed improvement

    Workflows
    Wan Video 14B i2v 720p

    Details

    Downloads
    448
    Platform
    CivitAI
    Platform Status
    Available
    Created
    8/2/2025
    Updated
    6/11/2026
    Deleted
    -

    Files

    WAN22IMGToVIDEO_v11.zip

    Mirrors

    HuggingFace (1 mirrors)
    CivitAI (1 mirrors)