CivArchive
    NSFW-Wan-UMT5-XXL-GGUF - Q5 and Q4_K_M
    NSFW
    Preview 119521918
    Preview 119403045

    [English] This is a quantized GGUF version of the NSFW-Wan-UMT5-XXL text encoder. It is designed to be used with the Wan 2.1 video generation model in ComfyUI.

    File Guide :

    • Pruned Model fp16 (3.4GB) = Q4_K_M (Recommended for 12GB VRAM)

    • Pruned Model bf16 (3.8GB) = Q5_K_M (Higher Quality)

    Why use this?

    • Uncensored: Fixes the issue where the official Wan 2.1 model refuses to generate NSFW or specific prompts (it plays dumb).

    • Lightweight: GGUF format allows you to run this huge T5 encoder with much less VRAM (Q4/Q5 versions).

    How to use:

    1. Place the .gguf file in your ComfyUI/models/text_encoders/ folder.

    2. Load it using the CLIPLoader (GGUF) node.

    3. Set the type to wan.

    Credits & Disclaimer:

    • Original Weights: NSFW-API/NSFW-Wan-UMT5-XXL on HuggingFace.

    • Conversion: Quantized by MomusAki.

    • If the original author (NSFW-API) has any objections to this quantization release, please contact me and I will take it down immediately.


    [中文] 这是 NSFW-Wan-UMT5-XXL 文本编码器的 GGUF 量化版本。 专为 ComfyUI 中的 Wan 2.1 视频生成模型打造。

    文件说明:

    • Pruned Model fp16 (3.4GB) = Q4_K_M (Recommended for 12GB VRAM)

    • Pruned Model bf16 (3.8GB) = Q5_K_M (Higher Quality)

    主要作用:

    • 解除限制: 完美解决了 Wan 2.1 官方模型对 NSFW 或特定敏感提示词“装傻”、“听不懂”的问题。

    • 节省显存: 相比原始的 safetensors 格式,GGUF 版本(Q4/Q5)大幅降低了显存占用,让 12G/16G 显卡也能流畅运行。

    使用方法:

    1. .gguf 文件放入 ComfyUI/models/text_encoders/ 文件夹。

    2. 在 ComfyUI 中使用 CLIPLoader (GGUF) 节点加载。

    3. Type 选项选择 wan

    致谢与声明:

    • 原始权重: 源自 HuggingFace 上的 NSFW-API/NSFW-Wan-UMT5-XXL

    • 量化制作: 由 MomusAki 转换。

    • 本模型仅为方便社区使用的量化版本。如果原作者 (NSFW-API) 认为此发布不妥,请联系我,我会立即下架。

    Description

    FAQ

    Comments (19)

    MarioKartMTAJan 31, 2026
    CivitAI

    Great! thank you so much!!

    XPFanaticFeb 1, 2026
    CivitAI

    Q8?

    MomusAki
    Author
    Feb 2, 2026

    To be honest, the Q8 version is almost the same size as the original BF16, so I feel it might be better to just use the original if you have the resources.

    ​But I'm open to it! If I see more than 5 requests for Q8 in this thread, I'll bake one and upload it for you guys.

    fuckingtagsMar 16, 2026

    +1

    fuckingtagsMar 16, 2026

    the fact is that for some hardware gguf is better managed than saftensors, so it's not allways a question of size...

    wAmhi7Apr 18, 2026

    @MomusAki plz bake one.

    yamasaki78Apr 21, 2026

    +1 Q8 please

    brotherelFeb 1, 2026
    CivitAI

    you mentioned DualCLIPLoader, does this have to run along side with the umt5-xxxl encoder? or can it be loaded by itself.

    MomusAki
    Author
    Feb 2, 2026

    This GGUF file IS the UMT5-XXL encoder itself. You don't need to run it alongside the original one. It replaces the original huge file.

    ​You can simply use the CLIPLoader (GGUF) node to load it by itself. If you use DualCLIPLoader, just select this file for the T5 slot and leave the other slot empty (or load CLIP-L if your specific workflow needs it for I2V vision).

    wisieeen293Feb 7, 2026

    @MomusAki In this case description provided is inconsistent with reality. It literally says to use DualCLIPLoader node.

    MomusAki
    Author
    Feb 8, 2026· 1 reaction

    @wisieeen293 It was my oversight; I have already revised the description.

    RenessanceFeb 3, 2026
    CivitAI
    I tested the solution, and the output video quality improved...


    darkdiabolo23454Feb 18, 2026
    CivitAI

    can this work with normal checkpoint which are not guff models?

    MomusAki
    Author
    Feb 20, 2026

    You need to use a node that supports GGUF, for example, by replacing the CLIPLoader node with CLIPLoader(gguf).

    darkdiabolo23454Feb 24, 2026

    @MomusAki i did but the erros is till happening check my other comment

    darkdiabolo23454Feb 18, 2026
    CivitAI

    can u add worflow as well as beacause in my case its showing
    Using pytorch attention in VAE

    VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16

    gguf qtypes: Q4_K (144), F32 (49), F16 (24), Q6_K (25)

    Attempting to rebuild sentencepiece tokenizer from metadata..

    !!! Exception during processing !!! only 0-dimensional arrays can be converted to Python scalars

    Traceback (most recent call last):

    File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 530, in execute

    output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)

    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 334, in get_output_data

    return_values = await asyncmap_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)

    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 308, in asyncmap_node_over_list

    await process_inputs(input_dict, i)

    File "C:\Users\sache\AppData\Local\Programs\ComfyUI\resources\ComfyUI\execution.py", line 296, in process_inputs

    result = f(**inputs)

    ^^^^^^^^^^^

    File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 632, in load_clip

    return (self.load_patcher([clip_path], get_clip_type(type), self.load_data([clip_path])), get_device('default'))

    ^^^^^^^^^^^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 613, in load_data

    sd = load_gguf_clip(p)

    ^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\pig.py", line 483, in load_gguf_clip

    sd['spiece_model'] = tokenizer_builder(path)

    ^^^^^^^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\gguf_connector\tkn.py", line 51, in tokenizer_builder

    spm.trainer_spec.eos_id = get_field(reader,

    ^^^^^^^^^^^^^^^^^

    File "C:\Users\sache\Documents\ComfyUI\custom_nodes\gguf\gguf_connector\tkn.py", line 12, in get_field

    return field_type(field.parts[field.data[-1]])

    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

    TypeError: only 0-dimensional arrays can be converted to Python scalars

    Prompt executed in 13.66 seconds

    darkdiabolo23454Feb 28, 2026

    @MomusAki its works with the simple gguf and the diffusion models has to be gguf as well as not the safetenor model

    MilitAIFeb 24, 2026
    CivitAI

    they being 2 in 1 sort of breaks wan2gp

    Checkpoint
    Wan Video 2.2 I2V-A14B

    Details

    Downloads
    4,682
    Platform
    CivitAI
    Platform Status
    Available
    Created
    1/31/2026
    Updated
    6/18/2026
    Deleted
    -

    Files

    nsfwWanUMT5XXLGGUF_q5AndQ4KM.gguf

    Mirrors