CivArchive
    Wan2.1-VACE-14B(pro) - 3-s

    πŸ‘‘ Wan2.1-VACE-14B (LoRA Accelerated): 10x Speed with CausVid LoRA for 3-Step Video Creation

    🎬 Skyrocket Your Video Creation: Achieve ~10x Speed with Wan2.1 & the CausVid LoRA! 🎬

    πŸ“Œ Overview

    The Wan2.1-VACE-14B video diffusion model, when supercharged by the CausVid LoRA, is designed for high-quality, highly efficient video generation. It particularly excels at 480p and 720p resolutions through a streamlined 3-step ComfyUI workflow. This guide will walk you through the setup process to unlock this accelerated video generation capability, including options for full precision and quantized models like the fast Q3KL GGUF.

    πŸ”‘ Key Components

    1. Diffusion Model (14B):

      • Full Precision: wan2.1_vace_14B_fp16.safetensors (Recommended for compatibility with LoRA examples)

      • Quantized (Civitai): wan2.1_vace_14B_Q4KM.safetensors

      • Quantized (GGUF - Civitai): wan2.1_vace_14B_Q3kl.gguf (Used in the 5-min example, requires GGUF loader)

        This isn't the same as the GGUF format from Hugging Face (they missing !). I tested that one, and it didn't work for vid2vid tasks. So, I developed my own types specifically designed to work well with vid2vid. These are optimized and structured differently to ensure compatibility and better results , if u need another type do "a comment" after test hugging face one!

    2. Performance LoRA (Essential for Speed):

    3. VAE:

    4. Text Encoder: Choose one:

    πŸ“ File Organization

    Place the downloaded files in the following structure within your ComfyUI directory:

    ComfyUI/
    β”œβ”€β”€ models/
    β”‚   β”œβ”€β”€ diffusion_models/
    β”‚   β”‚   └── wan2.1_vace_14B_fp16.safetensors  # Or Q4KM.safetensors, or Q3kl.gguf
    β”‚   β”œβ”€β”€ text_encoders/
    β”‚   β”‚   └── umt5_xxl_fp16.safetensors         # Or the fp8 version
    β”‚   β”œβ”€β”€ loras/
    β”‚   β”‚   └── Wan21_CausVid_14B_T2V_lora_rank32.safetensors
    β”‚   └── vae/
    β”‚       └── wan_2.1_vae.safetensors
    

    🎨 Model Showcase: Rapid 720p Cinematic Shots

    This setup, featuring Wan2.1-VACE-14B and the CausVid LoRA, excels at producing 720p (and 480p) video clips with remarkable speed, even faster with quantized GGUF models. It's ideal for quick iterations, creative experimentation, and efficient content creation, all streamlined by a 3-step workflow.

    πŸ’‘ Usage Tips

    • Model & LoRA Configuration: For maximum speed and quality, ensure you are using the appropriate 14B model (e.g., wan2.1_vace_14B_fp16.safetensors or wan2.1_vace_14B_Q3kl.gguf) paired with the Wan21_CausVid_14B_T2V_lora_rank32.safetensors LoRA. The LoRA should be applied with a strength typically around 1.0.

    • Text Encoder: The umt5_xxl_fp16.safetensors text encoder is recommended for best compatibility with existing examples and Kijai's original demonstrations. The fp8 version can save VRAM.

    • Resolution: This setup is optimized for 480p and 720p video generation.

    • Performance Gains:

      • Without LoRA (fp16): An 81-frame 720p video might take ~40 minutes on an RTX 4090.

      • With CausVid LoRA (fp16): The same video can be generated in ~4 minutes on an RTX 4090.

      • With CausVid LoRA & Q3KL GGUF: Potentially even faster, around 5 minutes or less for similar output on capable hardware with a GGUF loader.

    • Workflow Simplicity: The primary advantage, beyond speed, is the reduction to a 3-step generation process once models are loaded. This typically involves: 1. Prompting (Text Input), 2. KSampler (or equivalent node with LoRA and chosen model), 3. Video Combine (Output).

    🌟 Credits & Acknowledgements

    Original Wan 2.1 models repackaged for ComfyUI by Comfy-Org: Wan 2.1 ComfyUI Repackaged on Hugging Face. The performance-boosting CausVid LoRA (Wan21_CausVid_14B_T2V_lora_rank32.safetensors) was extracted and shared by Kijai. Original announcement and details: Kijai's Reddit Post. Quantized GGUF and Safetensors versions available on Civitai, enabling broader accessibility and speed. Gratitude to the developers of the underlying CausVid technique (presumably available under an MIT License or similar open terms).

    πŸ‘¨β€πŸ’» Developer Information

    This guide was created by Abdallah Al-Swaiti:

    1. Hugging Face

    2. GitHub

    3. LinkedIn

    4. ComfyUI-OllamaGemini

    For additional tools and updates, check out my other repositories.

    ✨ Create Dreamy Videos with WAN 2.1 VACE and Pastel Dream! ✨

    No alternative text description for this image

    Description

    FAQ

    Comments (11)

    AbdallahAlswa80
    Author
    May 26, 2025Β· 1 reaction
    CivitAI

    IF U FACE CUDAGRAPH ERROR START COMFYUI with this > python main.py --preview-method auto --force-fp16 --dont-upcast-attention --use-sage-attention --disable-cuda-malloc (i prefer inductor with this command python main.py --preview-method auto --force-fp16 --dont-upcast-attention --use-sage-attention)

    JellaiMay 27, 2025Β· 1 reaction
    CivitAI

    So people know, the new AccVideo Wan model (And presumably the Lora too) I think makes higher quality outputs than CausVid, speeding things up quite a bit. Swap out the CausVid Lora for the AccVideo Lora, make sure the CFG is 1, and set the steps to between 10 and 20. 10 is recommended, but I personally got better results adding more. The thing is, the steps take less time each, so 20 steps does not take as much time as 20 steps with either standard Wan, or CausVid.

    Kijai has made the Lora and model available here:

    https://huggingface.co/Kijai/WanVideo_comfy/tree/main

    AbdallahAlswa80
    Author
    May 27, 2025

    did u tried workflow ?

    rainerWfFbJun 3, 2025Β· 1 reaction
    CivitAI

    Sounds great... But I get this error: shape '[81, 120, 8, 96, 8]' is invalid for input of size 59781888

    AbdallahAlswa80
    Author
    Jun 4, 2025

    try another video or resize it (put values at upscale node (768 for bigger side ))

    seductivelyai695Jun 6, 2025Β· 1 reaction
    CivitAI

    Oh no, what did you do the Bee Gees?

    AbdallahAlswa80
    Author
    Jun 6, 2025

    if they have this effect they will use it ! what do u think ( https://www.youtube.com/watch?v=4V90AmXnguw)

    seductivelyai695Jun 7, 2025

    @AbdallahAlswa80Β oh yea, of course I was only joking... but good video

    seductivelyai695Jun 7, 2025
    CivitAI

    API Error: 429 You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. [violations {

    quota_metric:

    Oh no!!! Google doesn't like me

    AbdallahAlswa80
    Author
    Jun 8, 2025

    try another gemini flash , its free

    delta45424155Jun 13, 2025
    CivitAI

    For a 50080 video card. What resolution would you recommend using?

    Workflows
    Wan Video 14B i2v 720p

    Details

    Downloads
    1,655
    Platform
    CivitAI
    Platform Status
    Available
    Created
    5/26/2025
    Updated
    5/12/2026
    Deleted
    -

    Files

    wan21VACE14BPro_3S.zip

    Mirrors

    CivitAI (1 mirrors)