WAN 2.2 S2V 14B GGUF - CivArchive (CivitAI Archive)

Wan-S2V is an AI video generation model that can transform static images and audio into high-quality videos.

WIP: working on description adding all needed infos/tools! Use with some caution 🤪

Note: S2V has a very high chance of producing some 1st "flashy" over-saturated frames. That seems a limitation of all Wan 2.2 S2V models right now.

Requirements:

lite lora for 4/8-step operation (optional)
Main Model Wan2.2-S2V-14B ComfyUI/models/unet GGUF
Audio Encoder wav2vec2_large_english ComfyUI/models/audio_encoders
Encoder Umt5-xxl ComfyUI/models/text_encoders
Wan2.1_VAE.safetensors ComfyUI/models/vae

Usage hints:

Audio file should be about same length as the video file in seconds

👂🎶 👉 Hint: Click the sample for full-screen and play from the post with SOUND ON!

Sources:

Clip: https://huggingface.co/city96/umt5-xxl-encoder-gguf/

Model: https://huggingface.co/QuantStack/Wan2.2-S2V-14B-GGUF/

Lite LoRA: https://huggingface.co/calcuis/wan2-gguf/

YOU are responsible for outputs as always! If you make ToS violating content and I get aware I WILL report this.

Wan-S2V is an AI video generation model that can transform static images and audio into high-quality videos.

Description

FAQ

Details

Files

wan22S2V14BGGUF_clipQ8.gguf

Mirrors

Available On (1 platform)

Wan-S2V is an AI video generation model that can transform static images and audio into high-quality videos.

Description

FAQ

What is WAN 2.2 S2V 14B GGUF?

Why was this model removed from CivitAI?

How do I use WAN 2.2 S2V 14B GGUF?

What should I watch out for with Wan Video models?

What other Wan Video-based models are worth knowing?

Can I use this model commercially?

What files are available and where can I download them?

Details

Files

wan22S2V14BGGUF_clipQ8.gguf

Mirrors

Available On (1 platform)