CivArchive
    【WAN2.1】TXT to VIDEO - v1.3
    NSFW
    Preview 63521935
    Preview 63521944

    Description:

    This workflow allows you to generate video from text.

    You will find a step-by-step guide to using this workflow here: link

    My other workflows for WAN: link

    Resources you need:

    📂Files :

    For base version
    T2V Model: fp16, fp8
    In models/diffusion_models

    For GGUF version
    T2V Quant Model: Q8, Q5, Q3
    In models/diffusion_models

    Common files :
    CLIP: umt5_xxl_fp8_e4m3fn_scaled.safetensors
    in models/clip

    VAE: wan_2.1_vae.safetensors
    in models/vae

    Speed LoRA: lightx2v_T2V_14B_cfg_step_distill_v2_lora_rank64_bf16.safetensors
    in models/loras

    ANY upscale model:

    in models/upscale_models

    📦Custom Nodes :

    Description

    Bugfix: Cleared the cache after generation to limit bugs during upscaling

    Added: Frame interpolation, saving the last frame, model shifting

    FAQ

    Comments (7)

    yajukunMar 15, 2025· 2 reactions
    CivitAI

    Great workflow, works well right out of the box. Thank you for posting this.

    synalon973Mar 15, 2025
    CivitAI

    A great workflow as always, it's my go to for Text to Video with Wan and works really well. It is very well laid out and very adjustable to cover most peoples needs and systems.

    The added guide you have produced also raises it to another level in my opinion at least.
    Sometimes I get slightly distorted faces with full body videos, so maybe a facedetailer or face swap node will help me there I'm just not sure how to add it.
    Also is it possible to add a wildcard node at the start somewhere to help with random inspiration?

    gilma13Mar 15, 2025
    CivitAI

    Hey, amazing work as always, i do have an issue though, if anybody has a solution i would appreciate sharing!
    using the txt to video workflows, tried multiple versions, i just get random noise on a black background, using everything exactly as the image in the workflow and going step by step according to the guide except the model since i have 4070 with 12gb of ram so i took the smaller model
    here an example of the outputs i get:
    https://imgur.com/a/55y3JDY
    edit: img to video does work without issue

    UmeAiRT
    Author
    Mar 15, 2025· 3 reactions

    It's because you are using a Image to video model in a text to video workflow.

    You can download T2V model here : city96/Wan2.1-T2V-14B-gguf at main

    gilma13Mar 15, 2025· 3 reactions

    @UmeAiRT I feel stupid now XD
    Thank you!

    DiffusionGuruMar 20, 2025

    @UmeAiRT I was about to ask the same thing. It looks like the Quant Model in the description of this page is the I2V, rather than the T2V model. Thanks for answering, though!

    UmeAiRT
    Author
    Mar 21, 2025

    @DiffusionGuru I modified the link 2 days ago and it was a bad copy and paste, everything is corrected now.

    Workflows
    Wan Video

    Details

    Downloads
    754
    Platform
    CivitAI
    Platform Status
    Available
    Created
    3/14/2025
    Updated
    5/25/2026
    Deleted
    -

    Files

    WAN21TXTToVIDEO_v13.zip

    Mirrors

    CivitAI (1 mirrors)