【WAN2.1】IMG to VIDEO - 🔴 v1.7

NSFW

Description:

This workflow allows you to generate video from a base image and a text.

You will find a step-by-step guide to using this workflow here: link

My other workflows for WAN: link

Resources you need:

📂Files :

For base version
I2V Model : 480p or 720p
In models/diffusion_models

For GGUF version
I2V Quant Model :
- 720p : Q8, Q5, Q3
- 480p : Q8, Q5, Q3
In models/unet

Common files :
CLIP: umt5_xxl_fp8_e4m3fn_scaled.safetensors
in models/clip
CLIP-VISION: clip_vision_h.safetensors
in models/clip_vision
VAE: wan_2.1_vae.safetensors
in models/vae
Speed LoRA: 480p, 720p
in models/loras

ANY upscale model:

Realistic : RealESRGAN_x4plus.pth
Anime : RealESRGAN_x4plus_anime_6B.pth

in models/upscale_models

📦Custom Nodes :

Description

Added user setting for interpolation ratio
Added new auto-prompt feature

Complete patch notes here : link

FAQ

Comments (41)

Devilday666Mar 28, 2025

CivitAI

I have a new issue on V7 When I want to input my own prompt it doesn't allow me but gives me a value box anyway way around this.

https://i.imgur.com/h6uOj4x.jpeg

https://imgur.com/a/a9Ql6gE

UmeAiRT

Author

Mar 28, 2025· 2 reactions

I just made a hotfix, can you re-upload the workflow?

LatteLeopardMar 29, 2025· 3 reactions

CivitAI

I converted your script to a python script so it can be used universally across OS platforms and on servers using Jupyter(like vast.ai)

I can send it over to you if you would like to add it to your description.

UmeAiRT

Author

Mar 29, 2025

You can send it to me by message and it will be provided alongside the Windows version of the script.

CasualGamingMar 29, 2025

CivitAI

This is odd, where the Size, Steps, Frames, etc were supposed to appear sliders and such, for me it's blank, so I can't change anything on this workflow. Anyone knows what could it be?

uggiugi681Mar 29, 2025

Missing nodes?

UmeAiRT

Author

Mar 29, 2025

It's a compatibility issue with mixlab node : Sliders not working after update · Issue #28 · Smirnov75/ComfyUI-mxToolkit

To explain simply, the developer of mixlab wrote a method that corrupts other nodes

CasualGamingMar 29, 2025

@UmeAiRT Cool, thanks for the fast response. I will try the fix someone pointed out in that page later. Bless ya!

CasualGamingMar 29, 2025

@UmeAiRT yeah, that did it! Modifying the line 2186 with the one suggested solved the issue. You're the man. Thanks!

keybladeMar 29, 2025

CivitAI

The "vae decoder(tiled)" can cause color shift issues in long video generation. Replace it with the normal vae decoder can solve this problem.

vae分块解码节点会导致长视频出现颜色偏移, 换成普通的vae解码节点就好了

UmeAiRT

Author

Mar 29, 2025

Thanks for your feedback, I will try this solution.

UmeAiRT

Author

Mar 29, 2025

After testing, this doesn't fix the problem, quite the opposite. However, what seems to completely fix this problem is using a non-quantized model. But obviously, this is slower and requires more VRAM.

keybladeMar 30, 2025

@UmeAiRT Sorry, I may not have made myself clear. What I meant was that when the total number of frames generated exceeds the "temporal size" of "VAE decoder(tiled)", there will be a color shift(or contrast variation) near the frames of temporal points.

As for the color darkening from beginning to end, it seems that we can only avoid it by randomly generating multiple times.

↑ for v1.6. I haven't used v1.7 yet, but I think these issues still exist.

Recently, I have seen some new technologies such as Teacache Retention mode, Cfg Zero Star, Zero Init, etc. Can I expect these optimizations to be added in the new version?

UmeAiRT

Author

Mar 30, 2025· 1 reaction

@keyblade I am open to any improvement. I will look into what these features do and how to integrate them into future updates.

RealmodelsMar 31, 2025

Try use 832*480 480*832 512*512 1280*720 720*1280 Resolutions

BreezyHeezyMar 29, 2025

CivitAI

Do the GGUF versions of wan have the same output quality as the 14b versions?

GoldenBeanOSMar 30, 2025

GGUF versions are like in between the regular model versions

some are "quantisized" from the highhest quality version and are better quality than the lower quality versions then there are those "quantisized" from those lower versions, the best quality is still the 14b version

but depending on your hardware you might want to pick and choose whichever version is best for quality for resource usage/performance your hardware can handle

"quanitsized" GGUF versions are mildly inferior to the versions they based on losing some quality but the resource requirement reduction for only a little bit of quality loss is worth it

harp357100Mar 29, 2025

CivitAI

I wish I could figure out why, but my outputs from this workflow look faded and weird compared to the basic workflow. Fluids look particularly bad, everything looks smeared with Vaseline compared to a basic workflow. I have tried disabling sage attention by right clicking and selecting bypass in the workflow and doing likewise for the apply teacache box, but I don't think it works because the output is still much faster than a basic workflow and the quality issue remains.

deestovelMar 29, 2025· 2 reactions

CivitAI

Have been using this workflow. Nice setup of everything. Has anyone figured out a cause or a fix for the weird color / contrast spike at about 2/3 of the way into every clip?

john281Mar 29, 2025· 1 reaction

I have the same issue, would love to know how to avoid this ?

EechiZeroMar 30, 2025

Here too, in my case it always starts on frame 56 and lasts a few frames before returning to normal. Maybe it's something related to VAE, but I have no idea how to configure it.

synalon973Mar 31, 2025

Make shorter videos and merge them has been my fix.

EechiZeroMar 31, 2025· 3 reactions

@synalon973 @john281
It's not the same, at least for those of us looking for consistency in movement. But it looks like I already solved the problem, and I can even make beautiful 7-second videos.
You just need to replace the Decode stage nodes with new VAE DECODE nodes

1-Look for the nodes 'Decode stage 1' and 'Decode stage 2'.

2-Create two new vae decode nodes in >Latent> VAE DECODE.

3-Now connect the old Decode stage nodes to these new ones. All inputs and outputs must go to their respective nodes.

4-Delete the old decode stage nodes.

It worked for me.

endnoteApr 1, 2025

@EechiZero Wow this actually works and has improved my generations - no more color flickering. If someone can bring this attention to the workflow owner that would be great for all users!

UmeAiRT

Author

Apr 1, 2025

@endnote @EechiZero This is strange because I have already tested this solution and it made my videos very unstable. Maybe an update. Are you talking about the VAE node decoded by default?

UmeAiRT

Author

Apr 1, 2025

my result with default VAE decode : https://snipboard.io/HxUpBJ.jpg ^^'

deestovelApr 2, 2025

@EechiZero @UmeAiRT @UmeAiRT vae decode switch worked for me. make sure the nodes are all connected the same as before. isn't the tiled decode just a memory saver? I am on a 4090.

82821253Mar 29, 2025

CivitAI

Hi, I'm really enjoying this workflow, it's so clean and cool!

Also, you have written a guide on workflow, so I can read it and grow my basic knowledge, thank you very much.

I have a question. You said that the shift level determines the movement speed of the result. If I lower the level, it gets slower, and if I raise it, it gets faster, right?

The slider can only go up to a maximum shift level of 10, is it because going higher is not recommended?

UmeAiRT

Author

Mar 30, 2025

Usually people want to reduce the speed, but if you want to go higher just double click on the number and edit directly with any values

Kanonno88Mar 30, 2025

CivitAI

Thx! I have a question, where do you control the lenght? noob me can't find it I'd like 201 length for a full loop

synalon973Mar 31, 2025

Duration slider is the length.

Kanonno88Mar 31, 2025

@synalon973 I don't find it I only see a "frames" slider with 120x frames max

synalon973Mar 31, 2025

@Kanonno88 Are you using the simple workflow?

synalon973Mar 31, 2025

If you are using the simple workflow it is the yellow duration node. Directly below the green Positive prompts node.

If you are using the complete workflow it is the yellow frames node, 2 nodes down from the red Negative prompts node.

I don't recommend going much beyond 64-80 frames in one go as trying to large a change to quickly gives me some bad results, I get better results making 2 64 frames videos and merging them.

Kanonno88Apr 1, 2025

@synalon973 I switched to the simple one, thx ^-^

ktsminhtan771Mar 30, 2025

CivitAI

why can't i adjust the camera angle, everything is fixed angle, even though i tried every prompt?

UmeAiRT

Author

Mar 30, 2025· 1 reaction

I'm not an expert on dynamic movement prompts. You'd have to look for articles from community members who have that expertise.

synalon973Mar 31, 2025· 1 reaction

If its image to image it can take a lot of frames to gradually change the camera angle from the original images postition.

GoldenBeanOSMar 30, 2025

CivitAI

I am curious to know why this workflow has an empty clip encoder, what does it do, what purpose does it serve? i like to pick things apart see how it all works to get a better understanding on how it all functions

UmeAiRT

Author

Mar 30, 2025· 2 reactions

It will disappear in the new versions, it was linked to a bug in the display of the "positive" node when it is not directly connected to a node of this type.

GoldenBeanOSMar 30, 2025

i see thanks

Workflows

Wan Video

by UmeAiRT

Download (Beta) View on CivitAI