Pusa is a finetune for Wan 2.2 in LoRA format. It improves video quality.
Requirements: Kijai WanWrapper node https://github.com/kijai/ComfyUI-WanVideoWrapper
Note: As of 9/3/2025, ComfyUI just updated Pusa nodes yesterday. Should be a little bit easier to use now.
Various tasks in one model, all support 4-step inference with LightX2V: Image-to-Video, Start-End Frames, Video Completion, Video Extension, Text-to-Video, Video Transition, and more..
What Wan 2.2 Pusa Does
Video Generation Tasks: It can perform text-to-video, image-to-video (turning an image into a moving scene), video extension, transitions, and even more complex tasks like mapping both a starting and ending frame to a video seamlessly.
Multi-Input, Multi-Task: Unlike older models that needed a separate model or training for each task, Wan 2.2 Pusa’s unique design (the “Pusa paradigm”) allows it to generalize to different video tasks with one unified system and minimal additional training.
Fast and Efficient: Thanks to its “LightX2V” acceleration and “vectorized timestep adaptation” (VTA), it achieves high-quality results in very few inference steps, drastically reducing the computing power and time needed for video generation.
How Does It Work?
Vectorized Timestep Adaptation (VTA): The core innovation is its use of many “timesteps” for each video frame, letting the model adapt motion and transitions independently across the video. This means each part of a video can evolve with customized motion, leading to better temporal smoothness and creative control without disturbing the original strengths of the base video model.
Mixture of Experts (MoE): Wan 2.2 uses two specialized sub-models: one for “high noise” (complex motion) and one for “low noise” (preserving static details), blending their strengths for better video quality.
LoRA Fine-Tuning: The Pusa model uses a lightweight, “surgical” adaptation technique (LoRA) to inject new capabilities (like image-to-video) without overwriting what the base model already does well, preserving text-to-video powers while adding new features efficiently.
Wan 2.2 Pusa
https://github.com/Yaofang-Liu/Pusa-VidGen
https://huggingface.co/RaphaelLiu/Pusa-Wan2.2-V1
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Pusa
These are rank 256 LoRAs - that is why the file size is so large. These Loras are supposed to enhance the video/image quality of Wan 2.2. In my own personal use, I frequently find the effect subtle, but always improves the video. I don't know if it was trained on NSFW, but it works fine with it.
According to the author's page, they are supposed to be used around 1.4-1.5 weight. You can use it with lightx2v for 4-8 steps, or normally with normal steps.
Workflow Example: https://civarchive.com/models/1923086?modelVersionId=2176586
Description
FAQ
Comments (49)
Here is the link to the creators HuggingFace for those who want to read more.
Interesting. So this is sort of like a lora(?) that is intended to lower the amount of steps needed to produce a quality result like the Wan lightning lora? https://huggingface.co/lightx2v/Wan2.2-Lightning/tree/main
I wonder its stance on NSFW material
@gumpbubba721291 It's a Lora that's supposed to enhance the video/image quality of Wan 2.2. In my own personal use, I frequently find the effect subtle, but always improves the video. I don't know if it was trained on NSFW, but it works fine with it. You can use it with the lightx2v and it is recommended.
@Cyph3r how did you tested it? comfyui is not loading the model.
@Cyph3r In my tests, all of my videos were better without it, using lightx2v on its own.
@bielzimdag346 i've attached an example workflow in the lora description.
@playtime_ai verify you're actually using it. this seems to be a frequent issue with Pusa - it requires custom node in ComfyUI to work. If you can't tell a difference, it's very likely not even loading. It makes a quite noticeable difference.
@Cyph3r I will test your workflow. Before it gave me several errors of the lora not loading on the log.
@Cyph3r So, it does not work with normal nodes, should be used a kijai setup wrapper only, or can it be used with Power Lora Loader?
Any suggested settings to get the most out of this? Strength? Scheduler/sampler? steps?
According to the author's page, they are supposed to be used around 1.4-1.5 weight. You can use it with lightx2v for 4-8 steps, or normally with normal steps. I like res_2s/bong_tangent or eurler/beta personally.
@Cyph3r do you have a workflow you can post? the loras are not getting loaded in my usual workflow
This is insanely good - it for example doesn't distort faces in gens when used with other loras, very impressive work - thanks for sharing
can someone make the rank64 one?? the size is so huge..😥😥
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Pusa
wan 2.2 versions are less than 1 gb each.
@zorbon2 Uploaded those here. thx for the tip
tyvm, but i got the lora weigh error using the 900mb one
@fronyax It doesn't work bro. Wait for updates.
It worked with kijai wanwrapper nodes for me. They also updated nodes for pusa yesterday but that update caused my 5090 to go 100% and got stuck there repeatedly. So I went back to previous version. It works but I don't think worths the hustle. It supposedly do I2V with T2V model but in terms of consistency I2V is still much superior imo.
I tried generating the same video with and without LORA, and the result is exactly the same. Not even a single pixel difference. I used both LORAs with a weight of 1.5 as recommended and the same seed for both tests. I am using Light2V, 4 steps in total.
same. i tried the lora here and kijais rescaled lora. absolutely zero difference
alright just saw that comfy isn't loading the loras at all. they might not work with the fp8/gguf models:
lora key not loaded: blocks.0.cross_attn.k.alpha
lora key not loaded: blocks.0.cross_attn.k.lora_A.weight
@crombobular You're right. I just realized that I have the same error messages (I'm using GGUF models)
I can't get this lora to load either. Tried both T2V and I2V. Not using GGUF.
It doesnt work lora key not loaded: blocks.0.cross_attn.k.alpha
lora key not loaded: blocks.0.cross_attn.k.lora_A.weight
@crombobular I downloaded the files from the huggingface page and it loaded. I did not get any errors. Differences were subtle and I preferred the generation without it 100% of the time.
check out the workflow i've added to the description. there's a node you need for this lora to work.
@Cyph3r The sample workflow is a Wan 2.1 I2V workflow and there are no nodes in there that I did not already have installed... So what is the custom node you are referring to? What is it called?
@playtime_ai typical that there is still no reply on how to magically make this work
@Fealow Why are you directing this comment at me? This isn't my model and I didn't post it.
can someone explain what it actually does? their huggingface sure doesn't explain anything. i'm guessing it's a general quality improver?
"their huggingface sure doesn't explain anything"
yea - it does not, but https://github.com/Yaofang-Liu/Pusa-VidGen/blob/main/PusaV1/README.md does
They claim it improves quality but I didn't see it in my testing. I preferred every generation without it over the one with it.
From my own experience, it "enhances" motion. Videos that don't have much movement will have more movement if you enable Pusa. This can be good, or bad, depending on the type of video you are generating, but generally it helps.
@Mu5hr00moO it actually doesn't explain anything in that link either, its just showing light2x and talks about a custom node that includes a noise slider, which by itself DOES improve quality at higher steps, at the cost of distorting the first frame, but you don't need this lora to do that.
I'll have to try this later!
The name is kinda funny though. Sounds like it would be some sort of swear word in a different language. 😆
"Hey put some pusa in your workflow!"
"Hah? What did you just say about my mother?"
"In Da Pussa!!"
Giant Mexican Laugh erupts.
how to use the multi-input in comfy or is not possible?
It doesn't even work with Comfyui. Waste of time.👎🏻👎🏻👎🏻
That's not true. It is brand new so it's a little difficult to use. But it works and you have to have kijai node in comfy for it to work. Please read the instructions.
let's try using both CineScale and Pusa together. It might lead to amazing results.
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/CineScale https://huggingface.co/Kijai/WanVideo_comfy/tree/main/Pusa
What does CineScale do??
@fronyax I'm not entirely sure, but somehow it boosts the quality — mainly the resolution.
@fronyax i used it few times, it was mostly related to a movie like cinematic scene, where the camera work looks more dynamic, it also bring a different visual tone to the videos.
The pictures and description are funny, like if I waxed my car and then told everyone I made it.
I thought the same thing :D
I dont know about you guys, but when making videos with 700x740 or those 'smaller HD videos', prompting quality tags in the end like "The atmosphere is warm, tropical, and authentic, full of photorealistic details, amateur cellphone quality, heavy HDR glow, amateur photo, blown-out highlight from the lamp, deeply crushed shadows", EVEN INDOORS, PUSA v1 shines better!
PUSA might not like 'blue' light and tags like (photorealistic 4K details, natural color, cinematic lighting, crisp textures, clean edges, no watercolor, temporal consistency, fine surface detail) OR (amateur cellphone quality, visible sensor noise, heavy HDR glow)
The camera moves like crazy loosing the focus of the prompt (characters or events get out of the shot), from 0.5 to 2 to 5.6 on HIGH, any hints?
Details
Files
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Mirrors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Wan22_PusaV1_lora_HIGH_resized_dynamic_avg_rank_98_bf16.safetensors
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.


