CivArchive
    Wan i2v with 720p smoothing - v1.3
    NSFW

    Generate a base image with Wan 2.1 480p then upscale and smooth out with 1.3b t2v.

    Using upscaling models directly on videos tends to produce poor results. Frames look too disjointed. Using just a few passes with low denoise of the 1.3b t2v model does a great job at taking upscaled videos and smoothing them back into looking more natural. The 14b t2v model produces even better results, but requires large amounts of vram and time. The 1.3b t2v model does a surprisingly good job and is pretty quick.

    This workflow uses primarily GGUF quantized models to reduce vram where possible. The current version runs comfortably on 12GB of vram when using the Q3 i2v model and the Q4 T5 text encoder.

    (If you are using this on less than 12GB please let me know!)

    Models Needed

    Wan 2.1 i2v 14b 480p

    Goes into models/unet

    (Use the Q6_K if you have 24GB of vram, otherwise Q4_K_M or Q3_K_M)

    Wan 2.1 t2v 1.3b

    Goes into models/diffusion_models

    umt5-xxl-encoder

    Goes into models/text_encoders

    wan 2.1 vae

    Goes into models/vae

    clip_vision_h

    Goes into models/clip_vision

    Any upscaler model. I recommend RealEsrgan_2xPlus

    Goes into models/upscale_models

    Triton/Torch Compile

    This workflows includes a version that uses Torch Compile, and one that does not. Torch Compile speeds up generation times by a substantial amount, but requires Triton to be installed, which is not included by default on Windows. If you want to use the Torch Compile version on Windows, I suggest following the install instructions from this Github Repository: https://github.com/woct0rdho/triton-windows

    In my own testing, torch compile speeds up generations for me by around 30%. I've seen reports that it increase generation speed by double on some hardware configurations.

    Settings

    Experiment with denoise in the vid2vid section. 0.1 seems like a decent baseline. Higher should result in slightly smoother videos, but lose more detail from the original. Lower should result in more consistent details.

    Frame Length can go lower to run on cards with less vram, or to create a video faster. If going over 81 frames, enable the RfileXRope node.

    The workflow is a little dense, but it makes it easy to tweak settings quickly.

    Description

    Add working unload model implementation

    Merge triton and non-triton version with simple mutable group

    Workflows
    Wan Video 14B i2v 480p

    Details

    Downloads
    10,833
    Platform
    CivitAI
    Platform Status
    Available
    Created
    6/3/2025
    Updated
    10/1/2025
    Deleted
    -

    Files

    wanI2vWith720p_v13.zip

    Mirrors

    CivitAI (1 mirrors)