Directions
Here’s version 2 of my V2V WAN 2.2 + VACE workflow. Here’s how it works at a high level
Upload subject reference (Node Name: ⭐️ Load Image of Reference Subject)
Upload video reference (Node Name: ⭐️ Media Selection (Reference Video)
Enter Prompt
Enter Manually in WanVideo TextEncode (Default)
Use SwissArmyKnife LLM nodes
Requires additional setup if using Qwen3-VL via LLM Studio (locally hosted model)
Requires API key if using Gemini API
Run Workflow
Notes
Subject/Character
Use a high quality subject reference image
Closeups work best from my testing
The background of your subject reference will influence your video output slightly
I haven’t figured exactly how to properly mask just the subject in a way that works with WAN VACE
Your subject’s identity will not be preserved perfectly due to the nature of VACE and other variables like seed
Best best is to use a subject/character lora is you need it to be consistent
You can turn down the fun reward lora strength if your generated video is too “shiny”
Lora Additions
You can add more loras to fine tune the generated video but try not to add too many because you end up having loras fighting each other and you will get a burned out looking generation
Prompting
Subject: Describe your main subject with clarity — who or what it is, what they’re doing, and how they appear.
Clothing: Focus on what the subject is wearing or how the outfit contributes to mood, texture, colour or story. Consider description of fabrics, accessories, era, and fit.
Movement: Elaborate on how the subject moves, how the camera moves, or any dynamic elements in the scene. Use cinematic language when helpful.
Scene: Define the environment: time of day, location, background/foreground elements, mood, composition and lighting.
Visual Style: Establish the look and feel: lighting, colour-grading, lens effects, film stock, level of realism vs stylised, any elements you don’t want (negative prompt awareness).
I have added prompt examples to the markdown file in the zip
NSFW
This works with NSFW assuming you have a NSFW lora + good prompt. I haven’t found a great high quality nsfw lora so that’s why it’s not included
Major changes from V1
Added image upload for subject reference
Fixed node mismatches with SwissArmyKnife custom nodes
Switched from Gemini to Qwen3 VL (running locally and exposes via Swiss Army Knife nodes)
Added a path to input prompt instead of relying on SwissArmyKnife LLM nodes
Overhauled & Simplified VACE Encoding nodes, now it just uses the subject reference and depth map
Roadmap
Figure how to mask character so subject’s identity is preserved better and the background of the reference image doesn’t influence the generated video too much
Need a better solution for upscaling and interpolation
Explore VACE’s First Frame Last Frame capabilities to generate longer videos
Dial in settings for NSFW loras
Model links
You can find all the models on Huggingface. I am running a Nvidia 3090TI w 24GB VRAM & 128 GB DDR4 RAM. The FP8_e5m2 work best for the 3000 series generation. Generations take about 300-500 seconds on my system
Diffusion model
WAN 2.2 T2V
Text encoder
VAE
LoRAs
High Noise
Low Noise
Model Storage Location
📂 ComfyUI/
├── 📂 models/
│ ├── 📂 diffusion_models/
│ │ ├── Wan2_2-T2V-A14B-HIGH_fp8_e5m2_scaled_KJ.safetensors
│ │ ├── Wan2_2-T2V-A14B-LOW_fp8_e5m2_scaled_KJ.safetensors
│ │ ├── Wan2_2_Fun_VACE_module_A14B_HIGH_fp8_e5m2_scaled_KJ.safetensors
│ │ └── Wan2_2_Fun_VACE_module_A14B_LOW_fp8_e5m2_scaled_KJ.safetensors
│ ├── 📂 vae/
│ │ └── Wan2.1_VAE.safetensors
│ ├── 📂 text_encoders/
│ │ └── umt5_xxl_fp16.safetensors
│ └── 📂 loras/
│ ├── Wan22_A14B_T2V_LOW_Lightning_4steps_lora_250928_rank64_fp16.safetensors
│ ├── Wan2.2-Fun-A14B-InP-HIGH-MPS_resized_dynamic_avg_rank_21_bf16.safetensors
│ ├── Wan2.2-Fun-A14B-InP-LOW-MPS_resized_dynamic_avg_rank_22_bf16.safetensors
│ ├── Instagirlv2.5-HIGH.safetensors
│ └── Instagirlv2.5-LOW.safetensors
Custom Nodes
ComfyUI-WanVideoWrapper - nightly
comfyui_controlnet_aux - v1.1.2
ComfyUI-Easy-Use - v1.3.4
ComfyUI-KJNodes - v1.1.7
ComfyUI-VideoHelperSuite - v1.7.7
ComfyUI-Frame-Interpolation - v.1.0.7
ComfyUI Video Depth Anything - nightly
CRT-Nodes - v1.8.2
Swiss Army Knife - v2.9.1
ComfyUI
ComfyUI - v0.3.65
ComfyUI_frontend - v1.27.10
Python - v3.12.3
Pytorch - 2.9.0+cu128
Description
Added image upload for subject reference
Fixed node mismatches with SwissArmyKnife custom nodes
Switched from Gemini to Qwen3 VL (running locally and exposes via Swiss Army Knife nodes)
Added a path to input prompt instead of relying on SwissArmyKnife LLM nodes
Overhauled & Simplified VACE Encoding nodes, now it just uses the subject reference and depth map
FAQ
Comments (33)
Hi guys, personnally i wants to thanks LamboBro for this workflow. Actually i can't manage to make a server for ask Qwen2.5-VL to analyze a video. So i'm using Qwen2.5-VL 7 Q_4_M.GGUF on LM Studio, spliting video into frame (one frame every 0.5 sec then ask Qwen2.5-VL to analyze theses frames using this prompt. "can you describe this video following the next prompt, Use smooth, coherent sentences to narrate the scene from start to finish, following the timeline. Don't give details about tatoos".
can u upload workflow please?
its in the zip my guy....the download
Hello, how do I prevent my model from copying the reference's body and clothing? The animation result is great, but my model adapts to the reference's body shape.
when i install the swissarmyknife node and restart comfy, it just stay stuck on starting up. when i remove the node comfy starts up
could you share the startup logs?
Happens to me as well. Stuck on "## Execute management script for '...custom_nodes/comfyUI-SwissArmyKnife'"
same here, something is wrong with that git repo
i waited for an hour or so and it finally installed
an u make another workflow without swiss nodes plz
what error are you running into when executing the workflow. the wf has swiss knife nodes disabled by default so you can just enter your prompt and go
@LamboBro it stuked at installing swiss nodes in teminal when we restart
@markhassain3712 Just dont install it. You can still run WF without it....or you can just delete those nodes.
@LamboBro ok ok brother i will let u know thnx for help
Freezes on loadup w/ swiss armyknife executing the management script
invalid literal for int() with base 10: 'none' in the aiohttp install for the management script
hmm okay, will add a version without it but just dont install it if its causing issues
@LamboBro [ComfyUI-Manager] Starting dependency installation/(de)activation for the extension
## ComfyUI-Manager: EXECUTE => ['D:\\ComfyUI_windows_portable_nvidia\\ComfyUI_windows_portable\\python_embeded\\python.exe', '-s', '-m', 'pip', 'install', 'google-genai']
## Execute management script for 'D:\ComfyUI_windows_portable_nvidia\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-SwissArmyKnife'
[!]
[!] [notice] A new release of pip is available: 25.2 -> 25.3
[!] [notice] To update, run: python.exe -m pip install --upgrade pip
## ComfyUI-Manager: EXECUTE => ['D:\\ComfyUI_windows_portable_nvidia\\ComfyUI_windows_portable\\python_embeded\\python.exe', '-s', '-m', 'pip', 'install', 'opencv-python']
## Execute management script for 'D:\ComfyUI_windows_portable_nvidia\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-SwissArmyKnife'
[!]
[!] [notice] A new release of pip is available: 25.2 -> 25.3
[!] [notice] To update, run: python.exe -m pip install --upgrade pip
[SKIP] Downgrading pip package isn't allowed: aiohttp (cur=3.13.1)
## ComfyUI-Manager: EXECUTE => ['D:\\ComfyUI_windows_portable_nvidia\\ComfyUI_windows_portable\\python_embeded\\python.exe', 'install.py']
## Execute management script for 'D:\ComfyUI_windows_portable_nvidia\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-SwissArmyKnife'
and here is where it hangs indefinitely
I can get up to the depthmap pass and the resized vid but nothing else shows after removing the swissarmy stuff
think i got it now
got the same problem of stuck in ## Execute management script for 'C:\AI\custom_nodes\ComfyUI-SwissArmyKnife' can't even start comfy now
same issue with swiss armyknife
Is there a way to use this without SageAttention?
is there a way to increase fps of output videos?
Interpolate, theres a subgraph in the WF to upscale and interpolate
Does the output always match the same exact body type/shape like in the examples?
just downloaded and install nodes and nothing is really connected?
I can't get it to work with an RTX 3090 and 32GB of RAM. I've tried block swap at 30-20-15-8 and can't get it to work. I'm running out of RAM/VRAM space
Failed to validate prompt for output 1178:
* WanVideoVACEEncode 1107:
- Return type mismatch between linked nodes: input_frames, received_type(DEPTHS) mismatch input_type(IMAGE)
Help
I'm having problems with some nodes that appear with unknown names. I think they are part of LLM, or a subgroup that encapsulates certain nodes. Has anyone found a solution?
still errors, now its about the depthmap, 1107 node and 1232, wont run
the same problem doesnt work any solution ?
⚠️⚠️⚠️⚠️
This thing is beyond broken and will break your comfyui due to outdated nodes.
DO NOT USE.