ERNIE Image Quants - CivArchive (CivitAI Archive)

Ernie in different quants:

Mixed FP8 - Mostly fp8_e4m3 some are not quantized, fast.
Mixed NVFP4 - NVFP4 except final layers to give a higher quality finish, faster than FP8
NVFP4 - Mostly NVFP4 - Fastest

Note: You will only see speedups from NVFP4 on Blackwell series NVIDIA cards.

https://ernie.baidu.com/blog/posts/ernie-image/

text_encoders

ministral-3-3b.safetensors
ernie-image-prompt-enhancer.safetensors

vae

flux2-vae.safetensors

Model Storage Location

📂 ComfyUI/
├── 📂 models/
│   ├── 📂 diffusion_models/
│   │   └── ernie-image-turbo-nvfp4.safetensors
│   ├── 📂 text_encoders/
│   │   ├── ministral-3-3b.safetensors
│   │   └── ernie-image-prompt-enhancer.safetensors
│   └── 📂 vae/
│       └── flux2-vae.safetensors

Model Storage Location

Description

FAQ

Details

Files

ernieImageQuants_turboMixedNVFP4.safetensors

Mirrors

Model Storage Location

Description

FAQ

What is ERNIE Image Quants?

How do I use ERNIE Image Quants?

What files are available and where can I download them?

Details

Files

ernieImageQuants_turboMixedNVFP4.safetensors

Mirrors