Wan 2.1 Text to Image - CivArchive (CivitAI Archive)

Wan 2.1 Text to Image - Umt5-xxl-encoder

NSFW

Wan 2.1 Text-to-Image!

Who knew Wan 2.1 was an absolute beast at generating stunning, single-frame text-to-image outputs? Well… you do now.

Originally trained for rendering video, Wan 2.1 wasn’t meant to be a full-blown T2I model - but it turns out, this thing absolutely slaps when it comes to creating high-detail, expressive, and stylish compositions from simple prompts. Anime scenes, suggestive portraits, or moody cinematic stills, Wan 2.1 brings an uncensored edge and a surprising amount of depth, lighting finesse, and expressiveness to every gen.

This model card contains everything you need to get started using Wan 2.1 as an image generator with ~12 to 16 GB VRAM, utilizing GGUF models. You will need some custom ComfyUI nodes, so make sure ComfyUI is up to date, and pull those in with the Comfy Manager!

Description

Q5_K_M GGUF

FAQ

Comments (18)

Adel_AIJul 9, 2025· 6 reactions

CivitAI

Quite pleasantly surprised by the renderings of Wan 2.1, which is supposed to do T2V, I find it performs significantly better than some T2I models, these images even seem more realistic than those from Flux1.Dev.

It runs well on an 8 GB VRAM/32 GB RAM configuration, even faster than Flux.

I obtained good results with DPM++ 2M Karras 20 steps.

Thank you for sharing this original approach.

amazingbeautyJul 10, 2025

how it faster than flux if we said we run both at 20 steps for same resolution ?

Adel_AIJul 10, 2025

@amazingbeauty faster than flux with the same number of steps, and higher resolution

DaddyWolfgangJul 14, 2025

Yeah, many things are better than flux. Illustrious and NOobAI are worlds better. No idea why people ride Flux so hard.

amazingbeautyJul 15, 2025

@DaddyWolfgang i can use it with the txt2vid wan 14b ? just pointing to a single frame ? which model best 720p or 480p ?

VCominosJul 10, 2025· 1 reaction

CivitAI

Yep I knew, That's why I am trying to improve Wan to be the ultimate AI gen, T2I, T2V, I2V and V2V

zerocool22Jul 12, 2025

CivitAI

I have this error:

https://i.imgur.com/9JH7Wa1.png

theallyJul 12, 2025

No clue why those aren't working, but you don't really need the Clean VRAM Used and Clear Cache All nodes - just connect the image output from Fast Film Grain directly to the Save Image.

CyberoJul 17, 2025· 1 reaction

CivitAI

This is probably the best model for creating high‑resolution images with great detail. For now, I’ve put Flux on hold.

meryruizk332Jul 19, 2025· 1 reaction

CivitAI

Will LORAs work with it? Like the Wan LORAs?

theallyJul 20, 2025· 1 reaction

They do!

meryruizk332Jul 28, 2025

theally Thank you

cosmicsugarJul 28, 2025

are there character loras or just video loras?

danicht945Jul 20, 2025· 1 reaction

CivitAI

I get this error:

'ModelSamplingAdvanced' object has no attribute 'log_sigmas'

park0167444Jul 20, 2025· 2 reactions

CivitAI

I have this error:

ModelPatchTorchSettings

Failed to set fp16 accumulation, this requires pytorch 2.7.0 nightly currently

cosmicsugarJul 28, 2025· 1 reaction

I just bypassed it and it worked

SamohtAug 22, 2025

o meu deu o mesmo erro como voce resolveu ?

cooperdkSep 21, 2025

Just update pytorch to at least 2.7.0, obviously.

Workflows

Other

by CivitaiOfficial

Download (Beta) View on CivitAI

tool

text2img

wan 2.1

Details

Downloads

706

Platform

CivitAI

Platform Status

Available

Created

7/8/2025

Updated

6/11/2026

Deleted

Files

wan21TextToImage_umt5XxlEncoder.zip

Size:

3.86 GB

SHA256:

83ac363f33d6a469b9bb4d8487d34e30a64ba833541e0d260586ebe802bdc521

Mirrors

CivitAI (1 mirrors)

wan21TextToImage_umt5XxlEncoder.zip

Wan 2.1 Text-to-Image!

Description

FAQ

What is Wan 2.1 Text to Image?

What files are available and where can I download them?

Comments (18)

Details

Files

wan21TextToImage_umt5XxlEncoder.zip

Mirrors