CivArchive
    Preview 126477661

    ๐ŸŽŒ Z-Anime | Full Anime Fine-Tune on Z-Image Base

    Full Fine-Tune โ€ข Rich Aesthetics โ€ข Strong Diversity โ€ข Full Negative Prompt Support

    BF16 & FP8 โ€ข Natural Language Prompts โ€ข 8GB VRAM


    โœจ What is Z-Anime?

    Z-Anime is a full fine-tune of Alibaba's Z-Image (Base) architecture โ€” not a LoRA merge, but a completely retrained model optimized for anime aesthetics from the ground up.

    Built on the S3-DiT (Single-Stream Diffusion Transformer) with 6 billion parameters, Z-Anime inherits everything that makes Z-Image Base special: rich diversity, strong controllability, full negative prompt support and a high ceiling for fine-tuning โ€” now fully tuned for anime.

    This page contains all three variants:

    ๐ŸŽŒ Z-Anime Base โ€” Full quality, full control, full creativity โšก Z-Anime Distill-8-Step โ€” Great results in 8 steps ๐Ÿš€ Z-Anime Distill-4-Step โ€” Maximum speed, 4 steps

    Each variant is available in BF16 (~12GB) and FP8 (~6GB).


    ๐ŸŽฏ Key Features

    โœ… Full fine-tune on Z-Image Base โ€” not a LoRA merge

    โœ… Rich anime aesthetics with strong style diversity

    โœ… Natural language prompts โ€” detailed descriptions, not tag lists

    โœ… High diversity across characters, poses, compositions and layouts

    โœ… LoRA training ready โ€” perfect base for further fine-tuning

    โœ… Partially NSFW capable

    โœ… 8GB VRAM compatible

    โœ… All variants supported by the official Z-Anime ComfyUI Workflow


    ๐Ÿ—บ๏ธ Z-Anime Roadmap

    โœ… Released

    ๐ŸŽŒ Z-Anime Standard

    Full fine-tune on Z-Image Base โ€” BF16 & FP8

    Available now on CivitAI

    โšก Z-Anime-Distill-8-Step

    BF16 & FP8 โ€” fast anime generation in 8 steps, CFG 1.0

    ๐Ÿš€ Z-Anime-Distill-4-Step

    BF16 & FP8 โ€” ultra-fast anime generation in 4 steps, CFG 1.0

    ๐Ÿ“ฆ GGUF Variants

    Now available for low VRAM and AMD GPUs

    Since CivitAI currently has no dedicated GGUF category, here is what the files represent:

    * Z-Anime-Base-Q8_0 = Pruned Model FP8 (**6.73 GB**)

    * Z-Anime-Base-Q4_K_S = Pruned Model NF4 (**4.2 GB**)

    ๐Ÿ”œ Coming Soon

    ๐Ÿ”ง Z-Anime ComfyUI Workflow

    Official workflow โ€” supports all variants

    ๐ŸŽฒ Diffusers folder upload on Hugging Face

    ๐Ÿ”ฎ Planned

    ๐Ÿ“ฆ AIO Versions

    All versions with VAE + Text Encoder integrated in a single file

    More updates coming โ€” follow to stay notified! ๐ŸŽŒ


    ๐Ÿ“ฆ Versions Overview

    ๐ŸŸข BF16 (~12GB)

    Maximum precision. BFloat16 format, no quality compromise. Best for professional or commercial work and LoRA training. Still runs on 8GB VRAM.

    ๐ŸŸก FP8 (~6GB)

    Recommended for most users. Half the file size, much faster downloads. Excellent quality, barely distinguishable from BF16. Perfect for everyday use and testing.


    ๐ŸŽŒ Z-Anime Base

    The foundation of the Z-Anime family. A full fine-tune with the highest quality ceiling, the widest creative range and full negative prompt support.

    Recommended Settings:

    Steps:      28โ€“50
    CFG:        3.0โ€“5.0  (up to 9.0 possible)
    Sampler:    euler_ancestral
    Scheduler:  beta
    Negative:   strongly recommended โ€” very responsive!
    

    CFG Guide: 3.0โ€“5.0 is the sweet spot for balanced quality and creativity. 5.0โ€“7.0 gives tighter prompt adherence. 7.0โ€“9.0 is for maximum control โ€” watch for over-saturation. Above 9.0 is not recommended.

    Negative prompts have full effect on Z-Anime Base. The official workflow ships with an optimized negative prompt ready to use.


    โšก Z-Anime Distill-8-Step

    The sweet spot of the family. Distilled from Z-Anime Base, delivering strong anime results in just 8 steps. Much faster than Base while keeping most of the quality intact.

    Recommended Settings:

    Steps:      8
    CFG:        1.0  (max ~1.5)
    Sampler:    euler_ancestral
    Scheduler:  beta
    Negative:   limited effect
    

    CFG Guide: Runs best at CFG 1.0 by design. Small nudges up to 1.3โ€“1.5 are possible for slightly tighter prompt adherence. Do not go above 1.5 โ€” artifacts may appear.

    Negative prompts have limited effect at this distillation level. Use ConditioningZeroOut (included in the workflow) instead of writing a full negative prompt.


    ๐Ÿš€ Z-Anime Distill-4-Step

    The fastest Z-Anime variant. Built for maximum throughput โ€” rapid prototyping, batch generation and situations where speed matters most.

    Recommended Settings:

    Steps:      4
    CFG:        1.0  (max ~1.5)
    Sampler:    euler_ancestral
    Scheduler:  beta
    Negative:   limited effect
    

    CFG Guide: At 4 steps the model has very little correction room. Stay at CFG 1.0 for the most stable results. Nudging up to 1.3โ€“1.5 is possible but increases instability. Do not go above 1.5.

    Tips for 4-Step: Be specific and front-load the most important details early in your prompt. The optional upscaler (hires fix or SeedVR2) in the workflow is especially useful here to recover fine detail.


    ๐Ÿ“ Resolution Guide

    โญ Portrait: 832ร—1216 โ€” Character art Landscape: 1216ร—832 โ€” Scenes, backgrounds Square: 1024ร—1024 โ€” General purpose Tall: 768ร—1344 โ€” Full body, phone wallpaper Cinematic: 1920ร—1088 โ€” Wide scenes, wallpapers High Quality: 1024ร—1536 โ€” Detailed portraits

    Supported range: 512ร—512 to 2048ร—2048, any aspect ratio. All resolutions run on 8GB VRAM.


    ๐Ÿ’ก Prompting Guide

    Natural language โ€” not tag lists!

    โœ… Good:

    A young anime girl with long silver hair and golden eyes, wearing a 
    traditional shrine maiden outfit with white haori and red hakama. 
    She stands in a sunlit bamboo forest, cherry blossoms falling softly 
    around her. Warm afternoon light filtering through the trees, 
    detailed fabric shading, expressive face, calm serene expression. 
    High quality anime illustration with fine line work.
    

    โŒ Avoid:

    anime girl, silver hair, shrine maiden, bamboo, cherry blossom, warm light
    

    Character portraits:

    Detailed anime portrait of [character], soft rim lighting, 
    expressive eyes with detailed reflections, fine hair strands, 
    clean linework, professional anime illustration quality.
    

    Action scenes:

    Dynamic anime [scene], dramatic angle, motion energy, speed lines, 
    particle effects, cinematic composition, detailed shading, 
    high quality anime art.
    

    Backgrounds & landscapes:

    Anime [location] at [time of day], [lighting], [atmosphere], 
    Studio Ghibli inspired detail level, beautiful background art, 
    wallpaper quality.
    

    ๐Ÿ”ง Installation

    Step 1 โ€” Download your version (BF16 or FP8) for the variant you want.

    Step 2 โ€” Place the files:

    ComfyUI/models/diffusion_models/
    โ””โ”€โ”€ z-anime-base-bf16.safetensors        (Base BF16)
    โ””โ”€โ”€ z-anime-base-fp8.safetensors         (Base FP8)
    โ””โ”€โ”€ z-anime-distill-8step-bf16.safetensors
    โ””โ”€โ”€ z-anime-distill-8step-fp8.safetensors
    โ””โ”€โ”€ z-anime-distill-4step-bf16.safetensors
    โ””โ”€โ”€ z-anime-distill-4step-fp8.safetensors
    
    **GGUF variants**
    ComfyUI/models/unet/
    โ””โ”€โ”€ z-anime-q8_0.gguf
    โ””โ”€โ”€ z-anime-q4_k_s.gguf
    
    ComfyUI/models/clip/
    โ””โ”€โ”€ qwen_3_4b.safetensors
    
    ComfyUI/models/vae/
    โ””โ”€โ”€ ae.safetensors
    

    Step 3 โ€” Load in ComfyUI:

    Use the Load Diffusion Model node for the model file, a CLIPLoader node for the text encoder and a VAELoader node for the VAE.

    Or use the official Z-Anime ComfyUI Workflow โ€” it handles all three variants and both precisions with a built-in model switch.

    For the GGUF versions:

    - Load the **GGUF model from the models/unet/ folder**

    - Use the same **CLIP** and **VAE** files as above


    ๐Ÿ“ฆ Custom Nodes (for the official workflow)


    ๐Ÿ“ˆ Version History

    v1.0 โ€” Initial Release

    - Z-Anime Base: Full fine-tune on Z-Image Base, available in BF16 & FP8

    - Z-Anime Distill-8-Step: Available in BF16 & FP8

    - Z-Anime Distill-4-Step: Available in BF16 & FP8

    - GGUF Variants added:

    - Z-Anime-Q8_0 = pruned FP8 model

    - Z-Anime-Q4_K_S = pruned NF4 model

    - Optimized for euler_ancestral + beta across all variants

    - Official ComfyUI Workflow included


    ๐Ÿ™ Credits

    Base Architecture: Tongyi Lab (Alibaba) โ€” Z-Image Fine-Tune: SeeSee21 License: Apache 2.0 Architecture: S3-DiT (Single-Stream Diffusion Transformer, 6B parameters) Base Model: Tongyi-MAI/Z-Image GitHub: Tongyi-MAI/Z-Image


    Z-Anime โ€” Anime at its finest, powered by Z-Image Base. ๐ŸŽŒ

    Description

    ๐Ÿงช VAE

    Z-Anime also expects the standard VAE for Z-Image Base.

    If the wrong VAE is used, the latent may decode incorrectly, which can result in:

    • black output images

    • broken or washed-out results

    • incorrect colors or contrast

    • images that were generated correctly internally, but decoded incorrectly at the final step