✨ WAN2.2 — Image to video with flux kontext — Simple Workflow
A clean, all-in-one WAN image-to-video workflow built entirely with the UmeAiRT Toolkit for ComfyUI.
It allows you to generate an initial image using FLUX kontext, and then create a video from that image.
For example, you can generate an image from a person in a photo and then animate it.
Only 12 nodes. No spaghetti wires. Just load your model, write your prompt, and hit generate.
⚠️ IMPORTANT — Nodes 2.0 Required
This workflow is built for the Nodes 2.0 (Vue) interface of ComfyUI. If you don't enable it, the workflow may have display problems.
How to activate Nodes 2.0:
Open ComfyUI
Go to Settings (⚙️ icon, bottom-left)
Find "Use Nodes V2 (Vue)" and toggle it ON
Refresh the page
Load the workflow
🎯 Features
Text-to-Image generation
Automatic download of models in auto version
Built-in SeedVR2 upscaler — high-quality tiled upscaling (toggleable on/off) Slower than a classic upscaler, but significantly better quality
Full metadata embedding — your images are saved with all generation parameters, ready for online publishing and remixing
6 LoRA slots — with individual on/off toggles and strength control and you can connect as many other lora modules to each other for as many LoRA as you want.
📦 Custom Node Required
Only one custom node to install:
Install via ComfyUI Manager (search "UmeAiRT") or use the UmeAiRT Auto-Installer.
The Toolkit packages everything internally — upscaler, face detailer, metadata saver. No other custom nodes needed.
📂 Files you need (in manual version)
For base version
I2V Model : wan2.2_i2v_high_noise_14B_fp8_scaled.safetensors and wan2.2_i2v_low_noise_14B_fp8_scaled.safetensors
In models/diffusion_models
CLIP: umt5_xxl_fp8_e4m3fn_scaled.safetensors
in models/clip
FLUX_Model : flux1-kontext-dev-fp8-e4m3fn.safetensors
in ComfyUI\models\diffusion_models
FLUX_CLIP : t5xxl_fp8_e4m3fn_scaled.safetensors and clip_l.safetensors
in ComfyUI\models\clip
For GGUF version
I2V Quant Model : wan2.2_i2v_high_noise_14B_QX.gguf and wan2.2_i2v_low_noise_14B_QX.gguf
In models/unet
Quant CLIP: umt5-xxl-encoder-QX.gguf
in models/clip
FLUX_GGUF_Model : G8, G5, G4
in ComfyUI\models\unet
FLUX_CLIP : Q8, Q6, Q5, Q4, Q3
in ComfyUI\models\clip
For speed version
lightning LoRA : Wan2.2-Lightning_I2V-A14B-4steps-lora_HIGH_fp16.safetensors and Wan2.2-Lightning_I2V-A14B-4steps-lora_LOW_fp16.safetensors
For AIO version
I2V FAST AIO Model : wan2.2-i2v-rapid-aio.safetensors
In models/checkpoint
FLUX Text encoder : ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF.safetensors
in ComfyUI\models\clip
VAE: wan_2.1_vae.safetensors
in models/vae
FLUX VAE : ae.safetensors
in ComfyUI\models\vae
ANY upscale model:
Realistic : RealESRGAN_x4plus.pth
Anime : RealESRGAN_x4plus_anime_6B.pth
in models/upscale_models
Description
Base version
FAQ
Comments (22)
First off, thanks for your work. Impressive stuff.
I'm not even sure why I got this workflow, I have all the files but what exactly is the purpose of this workflow?
This workflow allows you to reproduce a face in a video
Is this a flux pluid wf that feeds into a i2v wan wf?
Yes
ok, I have this working now.. However, I enter a Caucasian woman and the workflow generates an asian woman. I thought the idea of this was to keep the face the same?
With the 1.1 version? As you can see in my exemple i use a caucasian girl and have a caucasian woman in the video
The preview image is an asian girl too?
@UmeAiRT Ok, yes I had the old version. I updated to 1.1, now its working nice. Thanks for the effort on this. Yes, the resemblance is pretty good. I definitely will be using the workflow. The resemblance isn't perfect, but it is pretty good overall.
@trashkollector175 It is possible to play a little with the settings in the PuLID node for a better resemblance
not working fine...
Great workflow! Any plans to add WAN lora?
You can easily add lora by taking example from my two other workflows for WAN
I have been using this for a few days now.. this workflow is great.. kudos to the author.
Thanks !
Thank you for this, and thank you even more for your scripts. I was struggling to install the missing nodes in comfy but your script resolved it for me and the guide was easy to follow, I'll upload some videos after playing around with it for a bit.
Thanks for your feedback.
@UmeAiRT I like how you have these workflows set up, but for some reason after I resize the image I can never get it set back to 720, I always have to use 512x768. Is it possible to keep what you have but also add the option to type the exact size if needed?
Sometimes I get blurry teeth as well, I'm going to try and change the upscaler model to see if that helps fix the issue, but it might end up needing some kind of facedetailer node.
If for some reason a facedetailer node is needed is it worth also adding one for the hands as well?
@synalon973 To choose the size in exact numbers, simply double-click on the number and an editor will open.
@UmeAiRT When I double click it doesn't give me an editor, but its a minor issue anyway.
RuntimeError: Can't access transformer_options, this requires ComfyUI nightly version from Mar 14, 2025 or later

