Mixed 8-bit microscaling quantization of Z-Image Turbo (6B S3-DiT), generated with convert_to_quant. A Blackwell GPU with comfy-kitchen installed together with CUDA 13.x and at least PyTorch 2.10 is needed for this to work on ComfyUI.
Faster inference than BF16 or FP8 on supported hardware. The quality loss is barely noticeable compared to BF16.




