NSFW-Wan-UMT5-XXL-GGUF - CivArchive (CivitAI Archive)

NSFW-Wan-UMT5-XXL-GGUF - Q5 and Q4_K_M

NSFW

Preview 119521918

Preview 119403045

[English] This is a quantized GGUF version of the NSFW-Wan-UMT5-XXL text encoder. It is designed to be used with the Wan 2.1 video generation model in ComfyUI.

File Guide :

Pruned Model fp16 (3.4GB) = Q4_K_M (Recommended for 12GB VRAM)
Pruned Model bf16 (3.8GB) = Q5_K_M (Higher Quality)

Why use this?

Uncensored: Fixes the issue where the official Wan 2.1 model refuses to generate NSFW or specific prompts (it plays dumb).
Lightweight: GGUF format allows you to run this huge T5 encoder with much less VRAM (Q4/Q5 versions).

How to use:

Place the .gguf file in your ComfyUI/models/text_encoders/ folder.
Load it using the CLIPLoader (GGUF) node.
Set the type to wan.

Credits & Disclaimer:

Original Weights: NSFW-API/NSFW-Wan-UMT5-XXL on HuggingFace.
Conversion: Quantized by MomusAki.
If the original author (NSFW-API) has any objections to this quantization release, please contact me and I will take it down immediately.

[中文] 这是 NSFW-Wan-UMT5-XXL 文本编码器的 GGUF 量化版本。专为 ComfyUI 中的 Wan 2.1 视频生成模型打造。

文件说明:

Pruned Model fp16 (3.4GB) = Q4_K_M (Recommended for 12GB VRAM)
Pruned Model bf16 (3.8GB) = Q5_K_M (Higher Quality)

主要作用：

解除限制： 完美解决了 Wan 2.1 官方模型对 NSFW 或特定敏感提示词“装傻”、“听不懂”的问题。
节省显存： 相比原始的 safetensors 格式，GGUF 版本（Q4/Q5）大幅降低了显存占用，让 12G/16G 显卡也能流畅运行。

使用方法：

将 .gguf 文件放入 ComfyUI/models/text_encoders/ 文件夹。
在 ComfyUI 中使用 CLIPLoader (GGUF) 节点加载。
Type 选项选择 wan。

致谢与声明：

原始权重： 源自 HuggingFace 上的 NSFW-API/NSFW-Wan-UMT5-XXL。
量化制作： 由 MomusAki 转换。
本模型仅为方便社区使用的量化版本。如果原作者 (NSFW-API) 认为此发布不妥，请联系我，我会立即下架。

Workflow：https://civarchive.com/models/2417743?modelVersionId=2718266

Description

FAQ

Comments (19)

MarioKartMTAJan 31, 2026

CivitAI

Great! thank you so much!!

XPFanaticFeb 1, 2026

CivitAI

Q8?

Author

Feb 2, 2026

To be honest, the Q8 version is almost the same size as the original BF16, so I feel it might be better to just use the original if you have the resources.

But I'm open to it! If I see more than 5 requests for Q8 in this thread, I'll bake one and upload it for you guys.

fuckingtagsMar 16, 2026

+1

fuckingtagsMar 16, 2026

the fact is that for some hardware gguf is better managed than saftensors, so it's not allways a question of size...

wAmhi7Apr 18, 2026

@MomusAki plz bake one.

yamasaki78Apr 21, 2026

+1 Q8 please

brotherelFeb 1, 2026

CivitAI

you mentioned DualCLIPLoader, does this have to run along side with the umt5-xxxl encoder? or can it be loaded by itself.

Author

Feb 2, 2026

This GGUF file IS the UMT5-XXL encoder itself. You don't need to run it alongside the original one. It replaces the original huge file.

You can simply use the CLIPLoader (GGUF) node to load it by itself. If you use DualCLIPLoader, just select this file for the T5 slot and leave the other slot empty (or load CLIP-L if your specific workflow needs it for I2V vision).

wisieeen293Feb 7, 2026

@MomusAki In this case description provided is inconsistent with reality. It literally says to use DualCLIPLoader node.

Author

Feb 8, 2026· 1 reaction

@wisieeen293 It was my oversight; I have already revised the description.

RenessanceFeb 3, 2026

CivitAI

I tested the solution, and the output video quality improved...

darkdiabolo23454Feb 18, 2026

CivitAI

can this work with normal checkpoint which are not guff models?

Author

Feb 20, 2026

You need to use a node that supports GGUF, for example, by replacing the CLIPLoader node with CLIPLoader(gguf).

darkdiabolo23454Feb 24, 2026

@MomusAki i did but the erros is till happening check my other comment

darkdiabolo23454Feb 18, 2026

CivitAI

can u add worflow as well as beacause in my case its showing
Using pytorch attention in VAE

VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16

gguf qtypes: Q4_K (144), F32 (49), F16 (24), Q6_K (25)

Attempting to rebuild sentencepiece tokenizer from metadata..

!!! Exception during processing !!! only 0-dimensional arrays can be converted to Python scalars

Traceback (most recent call last):

File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 530, in execute

output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 334, in get_output_data

return_values = await asyncmap_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 308, in asyncmap_node_over_list

await process_inputs(input_dict, i)

File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 296, in process_inputs

result = f(**inputs)

^^^^^^^^^^^

File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 632, in load_clip

return (self.load_patcher([clip_path], get_clip_type(type), self.load_data([clip_path])), get_device('default'))

^^^^^^^^^^^^^^^^^^^^^^^^^^^

File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 613, in load_data

sd = load_gguf_clip(p)

^^^^^^^^^^^^^^^^^

File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 483, in load_gguf_clip

sd['spiece_model'] = tokenizer_builder(path)

^^^^^^^^^^^^^^^^^^^^^^^

File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\gguf_connector\tkn.py", line 51, in tokenizer_builder

spm.trainer_spec.eos_id = get_field(reader,

^^^^^^^^^^^^^^^^^

File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\gguf_connector\tkn.py", line 12, in get_field

return field_type(field.parts[field.data[-1]])

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

TypeError: only 0-dimensional arrays can be converted to Python scalars

Prompt executed in 13.66 seconds

Author

Feb 24, 2026

Workflow：https://civitai.com/models/2417743?modelVersionId=2718266

darkdiabolo23454Feb 28, 2026

@MomusAki its works with the simple gguf and the diffusion models has to be gguf as well as not the safetenor model

MilitAIFeb 24, 2026

CivitAI

they being 2 in 1 sort of breaks wan2gp

Checkpoint

Wan Video 2.2 I2V-A14B

Download (Beta) View on CivitAI

Details

Downloads

4,682

Platform

CivitAI

Platform Status

Available

Created

1/31/2026

Updated

6/18/2026

Deleted

-

Files

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

Size:

3.86 GB

SHA256:

1e1ed5c5ddbdcd9d6a1e067d29925ce205008a1e069370735783c5e701004054

Mirrors

HuggingFace (2 mirrors)

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

CivitAI (1 mirrors)

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

Size:

3.40 GB

SHA256:

efb12422633da369366db5203fa6c0af60440d451514be9f597052c8591c48d5

Mirrors

HuggingFace (1 mirrors)

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

CivitAI (1 mirrors)

nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf