Z-Image-Turbo/Base-AIO - Z-Image-Turbo-FP16-AIO

NSFW

🚀 Z-Image AIO Collection

⚡ Base & Turbo • All-in-One • Bilingual Text • Qwen3-4B

⚠️ IMPORTANT: Requires ComfyUI v0.11.0+

📥 Download ComfyUI

✨ What is Z-Image AIO?

Z-Image AIO is an All-in-One repackage of Alibaba Tongyi Lab's 6B parameter image generation models.

Everything integrated:

✅ VAE already built-in
✅ Qwen3-4B Text Encoder integrated
✅ Just download and generate!

🎯 Available Versions

🔥 Z-Image-Turbo-AIO (8 Steps • CFG 1.0)

Ultra-fast generation for production & daily use

⚫ NVFP4-AIO (7.8 GB) 🆕

🎯 ONLY for NVIDIA Blackwell GPUs (RTX 50xx)!
⚡ Maximum speed optimized
💾 Smallest file size
🚀 FP4 precision - blazing fast

Perfect for: RTX 5070, 5080, 5090 owners who want maximum speed

🟡 FP8-AIO (10 GB) ⭐ RECOMMENDED

✅ Best balance of size & quality
✅ Works on 8GB VRAM
✅ Fast downloads
✅ Ideal for most users

Perfect for: Daily use, testing, RTX 3060/4060/4070

🔵 FP16-AIO (20 GB)

💾 Same file size as BF16
🔄 ComfyUI auto-casts to BF16 for compute
⚠️ Does NOT enable FP16 compute mode
📦 Alternative download option

Note: Z-Image does not support FP16 compute - activation values exceed FP16's max range, causing NaN/black images. Weights are cast to BF16 during inference regardless of file format.

Perfect for: Alternative to BF16 download (identical inference behavior)

🌟 BF16-AIO (20 GB) ⭐ RECOMMENDED FOR FULL PRECISION

✅ BFloat16 full precision
✅ Absolute best quality
✅ Professional projects
✅ Also works on 8GB VRAM

Perfect for: Professional work, maximum quality

🎨 Z-Image-Base-AIO (28-50 Steps • CFG 3-5)

Full creative control for pros & LoRA training

🟡 FP8-AIO (10 GB)

✅ Efficient for daily use
✅ Full CFG control
✅ Negative prompts supported
✅ 8GB VRAM compatible

Perfect for: Daily work with full control

🔵 FP16-AIO (20 GB)

💾 Same file size as BF16
🔄 ComfyUI auto-casts to BF16 for compute
⚠️ Does NOT enable FP16 compute mode
📦 Alternative download option

Note: See technical explanation in FAQ below.

Perfect for: Alternative to BF16 download (identical inference behavior)

🌟 BF16-AIO (20 GB) ⭐ RECOMMENDED FOR FULL PRECISION

✅ Maximum quality
✅ Ideal for LoRA training
✅ Professional projects
✅ Highest precision

Perfect for: LoRA training, professional work

🆚 Turbo vs Base - When to Use?

⚡ Use TURBO when:

⚡ Speed is priority → 8 steps = 3-10 seconds
📸 Production workflows → Consistent high quality
💾 Quick iterations → Rapid prototyping
🎯 Simple prompts → Less complex scenes

🎨 Use BASE when:

🎨 Creative exploration → Higher diversity
🔧 LoRA/ControlNet dev → Undistilled foundation
📝 Complex prompting → Full CFG control
🚫 Negative prompts needed → Remove unwanted elements

⚙️ Recommended Settings

⚡ Turbo Settings (incl. NVFP4)

📊 Steps: 8
🎚️ CFG: 1.0 (don't change!)
🎲 Sampler: res_multistep OR euler_ancestral
📈 Scheduler: simple OR beta
📐 Resolution: 1920×1088 (recommended)
🚫 Negative Prompt: ❌ Not used!

🎨 Base Settings

📊 Steps: 28-50
🎚️ CFG: 3.0-5.0 (start with 4.0)
🎲 Sampler: euler ⭐ OR dpmpp_2m
📈 Scheduler: normal ⭐ OR karras
📐 Resolution: 512×512 to 2048×2048
🚫 Negative Prompt: ✅ Fully supported!

📊 Quick Overview

Turbo Versions

⚫ NVFP4  │ 7.8 GB  │ RTX 50xx only  │ Max Speed 🆕
🟡 FP8   │ 10 GB   │ 8GB VRAM       │ Recommended ⭐
🔵 FP16  │ 20 GB   │ → BF16 compute │ See FAQ ⚠️
🌟 BF16  │ 20 GB   │ 8GB VRAM       │ Max Quality ⭐

Base Versions

🟡 FP8   │ 10 GB   │ 8GB VRAM       │ Efficient
🔵 FP16  │ 20 GB   │ → BF16 compute │ See FAQ ⚠️
🌟 BF16  │ 20 GB   │ 8GB VRAM       │ LoRA Training ⭐

💡 Prompting Guide

✅ Good Example:

Professional food photography of artisan breakfast plate. 
Golden poached eggs on sourdough toast, crispy bacon, fresh 
avocado slices. Morning sunlight creating warm glow. Shallow 
depth of field, magazine-quality presentation.

❌ Bad Example:

breakfast, eggs, bacon, toast, food, morning, plate

📝 Tips

DO:

✅ Use natural language
✅ Be detailed (100-300 words)
✅ Describe lighting & mood
✅ Specify camera angle
✅ English OR Chinese (or both!)

DON'T:

❌ Tag-style prompts (tag1, tag2, tag3)
❌ Very short prompts (under 50 words)
❌ Negative prompts with Turbo

🌐 Bilingual Text Rendering

English:

Neon sign reading "OPEN 24/7" in bright blue letters 
above entrance. Modern sans-serif font, glowing effect.

中文:

Traditional tea house entrance with sign reading 
"古韵茶坊" in elegant gold Chinese calligraphy.

Both:

Modern cafe with bilingual sign. "Morning Brew" in 
white script above, "晨曦咖啡" in Chinese below.

📥 Installation

Step 1: Download

Choose your version based on:

GPU: RTX 50xx → NVFP4 possible
VRAM: 8GB → FP8 recommended
Purpose: LoRA Training → Base BF16

Step 2: Place File

ComfyUI/models/checkpoints/
└── Z-Image-Turbo-FP8-AIO.safetensors

Step 3: Load & Generate

Open ComfyUI (v0.11.0+!)
Use "Load Checkpoint" node
Select your AIO version
Generate!

No separate VAE or Text Encoder needed!

🙏 Credits

Original Model

👨‍💻 Developer: Tongyi Lab (Alibaba Group)
🏗️ Architecture: Single-Stream DiT (6B parameters)
📜 License: Apache 2.0

Links

🔗 Z-Image Base: https://huggingface.co/Tongyi-MAI/Z-Image

🔗 Z-Image Turbo: https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

🧠 Text Encoder: https://huggingface.co/Qwen/Qwen3-4B

📈 Version History

v2.2 - FP16 Clarification

📝 Updated FP16 descriptions for technical accuracy
⚠️ Clarified: FP16 weights ≠ FP16 compute
🔄 FP16 files are cast to BF16 during inference

v2.1 - NVFP4 Release 🆕

➕ Z-Image-Turbo-NVFP4-AIO (7.8GB)
⚡ Optimized for NVIDIA Blackwell (RTX 50xx)
🚀 Maximum speed generation

v2.0 - Base AIO Release

➕ Z-Image-Base-BF16-AIO
➕ Z-Image-Base-FP16-AIO
➕ Z-Image-Base-FP8-AIO
🔄 ComfyUI v0.11.0+ support
📝 Qwen3-4B Text Encoder

v1.1 - FP16 Added

➕ Z-Image-Turbo-FP16-AIO
🔧 Wider GPU compatibility

v1.0 - Initial Release

✅ Z-Image-Turbo-FP8-AIO
✅ Z-Image-Turbo-BF16-AIO
✅ Integrated VAE + Text Encoder

❓ FAQ

Q: Which version should I choose?

RTX 50xx + Speed → NVFP4 🆕
Most users       → Turbo FP8 ⭐
Full precision   → BF16 ⭐
LoRA Training    → Base BF16

Q: Turbo or Base?

Fast & simple    → Turbo ⚡
Full control     → Base 🎨

Q: Will NVFP4 work on my RTX 4090?

❌ No! NVFP4 is only for RTX 50xx (Blackwell architecture).

Use FP8 instead for RTX 40xx and older.

Q: Do I need separate VAE/Text Encoder?

❌ No! Everything is already integrated.

Just Load Checkpoint and go!

Q: Works on 8GB VRAM?

✅ Yes! All versions work on 8GB VRAM.

(NVFP4 requires RTX 50xx regardless of VRAM)

⚠️ Q: What about FP16 for older GPUs (RTX 2000/3000)?

Important technical clarification:

Z-Image does NOT support FP16 compute type. Here's why:

📊 Technical reason:
- FP16 max value: ~65,504
- BF16 max value: ~3.39e+38 (same as FP32)
- Z-Image's activation values exceed FP16's range
- Result: Overflow → NaN → Black images

What actually happens:

ComfyUI automatically casts weights to BF16 for computation
You can see this in logs: "model weight dtype X, manual cast: torch.bfloat16"
"Weight dtype" (file format) ≠ "Compute dtype" (actual calculation)

For RTX 20xx users (no native BF16):

BF16 is emulated via FP32 = slower but works
There is no way to run Z-Image in true FP16 compute
FP8 with CPU offload may be a better option for limited VRAM

TL;DR: FP16 and BF16 files behave identically during inference. Choose based on download preference, not GPU compatibility.

🚀 Get Started Now!

Download → Load Checkpoint → Generate!

Recommended versions:

🟡 FP8 for most users (best size/quality balance)
🌟 BF16 for maximum quality
⚫ NVFP4 for RTX 50xx speed

All versions work on 8GB VRAM

Happy generating! 🎨

Description

! Update ! FP16 version released

Why an FP16 version?

FP16 is natively supported by most older GPUs — especially NVIDIA GPUs below the 4000 series. This makes the FP16 version the best and most compatible choice for users running older hardware.

NVIDIA GTX 1000 / RTX 2000 / RTX 3000 / RTX 4000

The other two versions, BF16 and FP8, are recommended if you are using an NVIDIA 4000 series GPU or newer, as these architectures are optimized for those formats and can take better advantage of them.

In short:

FP16 → best compatibility for older GPUs
BF16 / FP8 → best choice for NVIDIA 4000 series and newer

This update simply gives everyone the option to use the version that fits their hardware best.

FAQ

Comments (22)

SeeSeeLP

Author

Dec 30, 2025· 14 reactions

CivitAI

! Update ! FP16 version released

FP16 offers the widest compatibility and works on virtually all GPUs.

BF16 is supported starting with NVIDIA RTX 3000 (Ampere) and newer.
FP8 is primarily optimized for NVIDIA RTX 4000 series and newer, where it can be used most efficiently.

Choose the version that best matches your hardware for the best experience.

AieditorDec 30, 2025· 1 reaction

Thank you! Did you use the regular VAE or UltraFlux?

SeeSeeLP

Author

Dec 30, 2025· 1 reaction

@Aieditor the regular VAE 😊

p4tt3n832Jan 1, 2026

Me and my Radeon thank you! o7

actionmaster679854Jan 17, 2026

I have an RTX 3060 with 12GB Ram, trying out the fp8, seems to work fine (around 25 seconds for a 832x1216 image)

Can I make the FP16 version work? I have 32 GB Ram. But I don't know if that'll make it much slower.

alucardnoir941Dec 30, 2025· 4 reactions

CivitAI

Not to sound like a dick - because this IS my go to checkpoint for Stability Matrix, so really, thanks for the work you did - but using different seeds and samplers even if the prompts are the same make the differences between the checkpoints look a lot bigger than they are. Take for example the dragon image. You're using 3 different seeds and 2 different samplers. Even if everything else is the same, that's still enough to give us different images. Is FP16 softer than FP8 and BF16? or is it just euler + Beta that gives it that look? Are those really the differences between FP8 and BF16 or is it just to the seed.

I've tested both the FP8 and BF16 earlier in the month and there are differences, but for the same seeds the composition is mostly the same, FP8 being just more prone to error than BF16 (and extra hand here, and extra leg there). I unironically preferred the FP8 images in most cases I tested, even if they were more likely to have error. But based on the images you're showcasing, the differences seem a LOT bigger because of different seeds. (Of note though, ComfyUI seems to have had an update with their KSampler and it seems to no longer have a fixed seed option in it and I haven't had much luck with forxing a fixed seed with other nodes, so in case you though you were using the same seeds, that might be why your'e not)

BTW, the FP8, did you do any calibration on those or just weight quantization without calibration?

SeeSeeLP

Author

Dec 30, 2025· 3 reactions

Thanks for the detailed comment — and no worries at all, I didn’t take it the wrong way 🙂
I really appreciate you taking the time to test things thoroughly, especially since you’re actively using this checkpoint in Stability Matrix.

You’re absolutely right that different seeds, samplers, schedulers, and step counts can significantly change the output, even when the prompt itself stays the same. In my showcases, I intentionally keep the prompts mostly identical but vary settings like seed, sampler, scheduler, and steps, because all of these factors play a major role in the final image. The resulting differences are therefore expected and, to some extent, intentional.

Regarding FP8 vs. BF16:
The main difference I observe is that FP8 introduces more noise. FP8 images often appear dirtier or grainier, while BF16 tends to be cleaner and more stable. With identical settings, composition between FP8 and BF16 usually remains very similar, but FP8 is more prone to structural errors.

So FP16 or BF16 themselves aren’t inherently “softer” — what often gets interpreted as softness is usually a result of noise characteristics combined with sampler and scheduler behavior, rather than the numeric format alone.

On the topic of fixed seeds in ComfyUI: I just tested this again using two checkpoint loaders (FP8 and BF16) with the same seed, and in my setup the results remain fully deterministic. I wasn’t able to reproduce the issue you mentioned, so at least on my end, fixed seeds are still behaving as expected.

As for FP8 specifically: these builds are weight-quantized without an additional calibration pass. That choice was mainly about maintaining iteration speed and compatibility, but calibrated FP8 variants are definitely something I may explore further.

Thanks again for the thoughtful feedback — discussions like this are genuinely helpful and push the project forward.

alucardnoir941Dec 31, 2025

@SeeSeeLP It's the Node 2.0 thing comfyui is testing. Turn that on and you lose access to the ksampler's control after generation option for seeds. The new node 2.0 UI hides A LOT of stuff and it's usually for the worse.

As for my point about seeds, what I meant was that since the only difference between these checkpoints is precision (as opposed to your anime checkpoint), people would be better served were they able to see the same set of images generated under the same conditions bar the checkpoint and judge the precision trade-off without having to download and test each checkpoint individually.

MisccDec 31, 2025· 2 reactions

CivitAI

Thank you for the update! <3

Unhearing3490274Jan 4, 2026· 3 reactions

CivitAI

One of the only AIOs I've tried - I am quite impressed. I will mess around with the FP8 for a bit.

SeeSeeLP

Author

Jan 4, 2026· 1 reaction

Thank you so much for the feedback :-)

cynic2010Jan 10, 2026· 3 reactions

CivitAI

@SeeSeeLP

the best AIO Pack PERIOD!!!!

much love and keep it going 🌞❤️👍

SeeSeeLP

Author

Jan 26, 2026· 1 reaction

Thank you so much, cynic2010! 🥰
Your kind words really made my day. I’m super happy you’re enjoying the AIO pack — much love back to you! 🌞❤️

kengdieJan 13, 2026· 5 reactions

CivitAI

在C站中你的fp8是我认为现在最好的，简洁的面，人物肢体良好。谢谢你！

SeeSeeLP

Author

Jan 26, 2026

谢谢你！很高兴你喜欢 FP8 模型，希望你玩得开心～ 😄✨

LuminalArtJan 24, 2026· 3 reactions

CivitAI

Anyone else getting a BSOD when trying to run any of these? Using a 4060

SeeSeeLP

Author

Jan 26, 2026

Hi @TeKniKo I’m actually using an RTX 4060 myself and haven’t run into any BSOD issues. It’s likely something with your settings. You might want to check temperatures, your power supply, or try running a fresh ComfyUI installation in a separate folder to see if that helps.

LuminalArtJan 26, 2026

@SeeSeeLP I figured out the issue. I was using base ForgeUI which doesnt support ZIT. I had to use a different branch. Now Im addicted to the quality lol.

SeeSeeLP

Author

Jan 26, 2026· 1 reaction

@TeKniKo Nice, glad you figured it out! 😄
Yeah, base ForgeUI can be a bit tricky with ZIT support. Happy to hear it’s working now — and welcome to the addiction, the quality really hits hard 😈🔥
Have fun generating!

rivdemon1221554Jan 26, 2026· 3 reactions

CivitAI

Does the Qwen 8B text encoder work with z-image, or only the smaller one?

SeeSeeLP

Author

Jan 26, 2026

Hi @rivdemon1221554 , I haven’t tested this myself, but I don’t think it will work. Z-Image uses Qwen3-4B as the text encoder, not the Qwen3-8B version, so the 8B encoder is likely not supported there. That said, this is just my assumption since I haven’t tested it directly.

rivdemon1221554Jan 26, 2026

@SeeSeeLP Yeah, doesn't for me. It's a shame, Z-image is fast but it just doesn't understand the level of detail that FLUX Klein 9b and especially Qwen 2512. I get people hate slow stuff but I've tested multiple character images with all 3 and I see now, there's definitely a reason Z-image is faster and cheaper.

Checkpoint

ZImageTurbo

by SeeSeeLP

Download (Beta) View on CivitAI