CivArchive
    Preview 110102968
    Preview 110102998
    Preview 110103019
    Preview 109980323
    Preview 109980375
    Preview 109980663
    Preview 110103112
    Preview 109980722
    Preview 109980768
    Preview 110103204
    Preview 109980820
    Preview 109980877
    Preview 109981429
    Preview 110103081
    Preview 110103145
    Preview 110103630

    🎨 QWEN-Anime | Beta3-AIO

    Advanced Anime Generation with Image Editing

    ⚠️ Note: All versions are combined on this model card for convenience.


    πŸ“’ VERSION UPDATES

    🎨 VERSION 3 - LATEST (Beta3-AIO)

    πŸš€ MAJOR UPDATE - Image Editing Revolution!

    NEW FEATURES:

    • ✨ Image Editing functionality - Edit 1-3 images simultaneously

    • πŸ”„ Dual workflow - Text-to-Image AND Image-to-Image

    • πŸ“¦ Upgraded base model - Qwen Image Edit 2511 (from 2509)

    • ⚑ Faster generation - 4 steps minimum (down from 8)

    • πŸ”“ Custom uncut model - Qwen 2.5 VL 7B FP8 for maximum creative freedom

    • πŸ”ž NSFW capabilities - Partial nudity and clothing removal possible

    • πŸ“¦ FP8 only - Other formats available on request

    IMPROVEMENTS:

    • Combine multiple characters from different images

    • Transform and merge scenes

    • Style transfer between images

    • Enhanced detail preservation

    • More consistent results

    TIPS FOR BEST RESULTS:

    • Results depend on seed, prompt, and input images

    • For NSFW content: Load NSFW image as second image for better guidance

    • Experiment with different combinations

    WORKFLOW EVOLUTION:

    • V1/V2: Text-to-Image only

    • V3: Text-to-Image + Image-to-Image ⭐


    πŸ”„ VERSION 2 (Beta2-AIO)

    FEATURES:

    • All-in-One format (no separate VAE/Text Encoder needed)

    • Two variants: Full (20+ steps) and Pruned (6-8 steps)

    • FP8 precision (26.99 GB)

    • Integrated VAE + Text Encoder

    • Single file, plug-and-play

    IMPROVEMENTS:

    • Easier setup vs Beta1

    • Same quality, simpler workflow

    • Lightning LoRA compatible


    πŸ“¦ VERSION 1 (Beta1 - Legacy)

    FEATURES:

    • Original release

    • FP16 only (38.05 GB)

    • Requires separate VAE + Text Encoder

    • Base training

    ═══════════════════════════════════════════════════════════════════

    BETA3-AIO ⭐ β”‚ BETA2-AIO β”‚ BETA2 β”‚ BETA1

    ═══════════════════════════════════════════════════════════════════

    Image Editing βœ… 1-3 imgs β”‚ ❌ No β”‚ ❌ No β”‚ ❌ No

    Text-to-Image βœ… Yes β”‚ βœ… Yes β”‚ βœ… Yes β”‚ βœ… Yes

    Base Model 2511 β”‚ 2509 β”‚ 2509 β”‚ Base

    Min Steps 4 β”‚ 6-8 β”‚ 6-8 β”‚ 20+

    Setup Single β”‚ Single β”‚ 3 file β”‚ 3 file

    VAE/Encoder Integrated β”‚ Integrated β”‚ Separateβ”‚ Separate

    NSFW βœ… Limited β”‚ ⚠️ Limited β”‚ ⚠️ β”‚ ⚠️

    File Size 27 GB β”‚ 27 GB β”‚ 19-38 β”‚ 38 GB

    Format FP8 β”‚ FP8 β”‚ Multi β”‚ FP16

    Speed (8 steps) ⚑⚑⚑ β”‚ ⚑⚑⚑ β”‚ ⚑⚑ β”‚ ⚑

    ═══════════════════════════════════════════════════════════════════

    🎯 Available Versions & Formats

    πŸ’Ž Beta3-AIO (Recommended) ⭐

    Format:

    • 🟑 FP8 (26.99 GB) - 4+ steps, CFG 3.5

    • Other formats: Available on request

    What's included:

    • βœ… Image Editing (1-3 images)

    • βœ… Text-to-Image

    • βœ… Integrated VAE + Text Encoder

    • βœ… Uncut model for creative freedom

    Settings:

    • Steps: 4-20 (recommend 20 for quality)

    • CFG: 3.5

    • Sampler: Euler

    • Scheduler: Beta


    πŸ’Ž Beta2-AIO

    Variants:

    • 🟒 Full Model FP8 (26.99 GB) - 20+ steps, CFG 2.5-4.0 (quality mode)

    • 🟑 Pruned Model FP8 (26.99 GB) - 6-8 steps, CFG 1.0 (speed mode)

    What's included:

    • βœ… Integrated VAE + Text Encoder

    • βœ… Single file, plug-and-play

    • βœ… Use regular "Load Checkpoint" node


    🧠 Beta2 (Safetensors & GGUF)

    Requires separate VAE + Text Encoder

    SafeTensor Versions:

    • πŸŸͺ BF16 (38.05 GB)

    • 🟦 FP16 (38.05 GB)

    • 🟨 FP8 (19.03 GB)

    GGUF Versions: ⚠️ Requires ComfyUI-GGUF

    • πŸ”Ή F16 (38.07 GB)

    • πŸ”Ή Q8 (20.23 GB)

    • πŸ”Ή Q6_K (15.63 GB)

    • πŸ”Ή Q4_K_S (10.72 GB)


    πŸ“¦ Beta1 (Legacy)

    • FP16 only (38.05 GB)

    • Requires separate VAE + Text Encoder


    πŸ§ͺ TEST RESULTS

    🎨 Beta3-AIO Image Editing Test

    Tested on Nvidia RTX 4060 with Euler sampler

    Test: Multi-Image Composition

    Prompt:

    Place the two figures in a fantasy medieval tavern, laughing and clinking two beer glasses.

    Images Used:

    • Image 1: Character A

    • Image 2: Character B

    Result:

    • Successfully combined both characters

    • Tavern setting accurately generated

    • Natural interaction and poses

    • Consistent anime style maintained


    πŸ“Š Beta2-AIO Test Results

    Tested on Nvidia RTX 4060 with Euler A sampler

    🟒 Full Model fp8 (20+ Steps Version)

    Test 1: Elegant Shrine Maiden

    Resolution: 1024Γ—1024 | Steps: 24 | CFG: 3.6 | Time: ~176.48s

    Prompt:

    anime, masterpiece, best quality, 1girl, shrine maiden, long black hair, 
    red hakama, white kimono top, holding paper talisman, sacred shrine background, 
    cherry blossoms falling, soft sunlight, detailed face, serene expression, 
    traditional japanese architecture, torii gate in background, 
    cinematic lighting, depth of field
    

    Test 2: Cyberpunk Street Scene

    Resolution: 1536Γ—1024 | Steps: 28 | CFG: 4.0 | Time: ~229.52s

    Prompt:

    anime, 2k quality, ultra-detailed, 1girl, cyberpunk hacker, 
    neon-lit tokyo street, rain reflections, holographic advertisements, 
    purple and cyan color scheme, tech wear jacket, mechanical arm augmentation, 
    confident pose, sharp focus, cinematic composition, bokeh background, 
    night city atmosphere, detailed eyes
    

    Test 3: Fantasy Dragon Knight

    Resolution: 832Γ—1216 | Steps: 32 | CFG: 3.8 | Time: ~227.94s

    Prompt:

    anime, masterpiece, high detail, 1girl, dragon knight, 
    silver armor with blue accents, flowing cape, dragon companion beside her, 
    epic fantasy landscape, castle ruins background, dramatic sky, 
    wind effect on hair and cape, detailed armor patterns, 
    heroic pose, cinematic lighting, depth of field
    

    🟑 Pruned Model fp8 (6-8 Steps Version)

    Test 4: Cozy Cafe Moment

    Resolution: 1024Γ—1024 | Steps: 8 | CFG: 1.0 | Time: ~32.47s

    Prompt:

    anime, best quality, 1girl, casual outfit, sitting in cafe, 
    holding coffee cup, warm lighting, bokeh background, 
    soft smile, detailed eyes, cozy atmosphere, 
    window light, autumn colors, relaxed pose
    

    Test 5: Magical Girl Transformation

    Resolution: 512Γ—768 | Steps: 7 | CFG: 1.0 | Time: ~19.28s

    Prompt:

    anime, masterpiece, 1girl, magical girl, transformation pose, 
    sparkles and light effects, flowing hair, colorful costume, 
    magic circle background, dynamic composition, 
    vibrant colors, detailed ribbons, glowing effects
    

    Test 6: Beach Sunset Portrait

    Resolution: 1024Γ—1536 | Steps: 6 | CFG: 1.0 | Time: ~32.07s

    Prompt:

    anime, best quality, 1girl, summer dress, beach sunset, 
    golden hour lighting, ocean waves, soft wind effect on hair, 
    warm colors, peaceful expression, detailed face, 
    cinematic sunset, depth of field, romantic atmosphere
    

    βš™οΈ SETTINGS & USAGE

    🎨 Beta3-AIO Settings

    Text-to-Image Mode:

    • Steps: 4-8

    • CFG: 1

    • Sampler: Euler

    • Scheduler: Simpel

    • Resolution: 512Γ—512 to K4

    Image Editing Mode:

    • Steps: 4-8

    • CFG:1

    • Sampler: Euler

    • Images: 1-3 (Image 1 required)

    • Tip: Higher steps for complex edits

    NSFW Content:

    • Load NSFW reference as Image 2 or 3

    • Be specific in prompt

    • Results vary - experiment with seeds


    🟒 Beta2-AIO Full Model (Quality Mode)

    • Steps: 20-32

    • CFG: 2.5-4.0 (sweet spot: 3.6)

    • Sampler: Euler A, Euler Normal, Beta, Simple

    • Use for: High quality, detailed work, final renders


    🟑 Beta2-AIO Pruned Model (Speed Mode)

    • Steps: 6-8 (optimal: 8)

    • CFG: 1.0 (max 2.0, but stay at 1.0)

    • Sampler: Euler A recommended

    • Use for: Fast iterations, testing, quick generations


    πŸ“Š Universal Settings (All Versions)

    • Resolution: 512Γ—512 to 2048Γ—1152

    • VRAM: 8GB+ recommended

    • Lightning LoRAs: Compatible (4-step or 8-step)


    πŸ’‘ Which Version Should I Choose?

    Choose Beta3-AIO if: ⭐

    βœ… Want image editing capabilities βœ… Need to combine multiple images βœ… Want latest features and improvements βœ… Need NSFW capabilities βœ… Want fastest base model (4+ steps)

    Choose Beta2-AIO (Pruned) if:

    βœ… Want fastest text-to-image (6-8 steps) βœ… Need quick iterations/testing βœ… Prefer simplicity (single file) βœ… 8GB+ VRAM available

    Choose Beta2-AIO (Full) if:

    βœ… Want maximum quality βœ… Need more control (CFG 2.5-4.0) βœ… Creating final/detailed work βœ… Prefer traditional workflow

    Choose Beta2 FP8 if:

    βœ… Want flexibility (separate VAE/encoder) βœ… Using custom VAE/encoders βœ… Need maximum compatibility

    Choose Beta2 GGUF if:

    βœ… Limited VRAM (6-8GB) βœ… Want smallest files (Q4 = 10GB) βœ… CPU inference needed

    Choose Beta1 if:

    βœ… Compatibility with old workflows βœ… Testing/comparison purposes


    πŸ”§ INSTALLATION GUIDE

    πŸ“¦ Beta3-AIO (Easiest!)

    1. Download Beta3-AIO FP8

    2. Place in ComfyUI/models/checkpoints/

    3. Load with standard "Load Checkpoint" node

    4. For Image Editing: Use provided workflow

    5. Generate!

    No extra files needed!


    πŸ“¦ Beta2-AIO

    1. Download your preferred version (Full or Pruned)

    2. Place in ComfyUI/models/checkpoints/

    3. Load with standard "Load Checkpoint" node

    4. Generate!

    No extra files needed!


    🧠 Beta2 (Safetensors)

    1. Download checkpoint β†’ diffusion_models/

    2. Download Text Encoder β†’ text_encoders/QWEN/

    3. Download VAE β†’ vae/QWEN/

    4. Use "Load Diffusion Model" node


    πŸ’Ύ Beta2 (GGUF)

    1. Install ComfyUI-GGUF: https://github.com/city96/ComfyUI-GGUF

    2. Download GGUF β†’ unet/

    3. Download Text Encoder + VAE (same as Safetensors)

    4. Use "GGUF Loader" node


    πŸ“ PROMPTING TIPS

    ✍️ General Tips

    Quality Tags (All Versions):

    anime, masterpiece, best quality, ultra-detailed, 
    2k resolution, sharp focus, cinematic lighting
    

    Style Modifiers:

    MOE STYLE, official art, anime coloring, 
    detailed eyes, depth of field, bokeh
    

    Negative Prompt:

    low quality, blurry, bad anatomy, bad hands, 
    text, watermark, mutation, distorted
    

    🎨 Beta3-AIO Specific Tips

    Text-to-Image Prompts:

    anime girl with long blue hair, wearing school uniform, 
    cherry blossoms in background, soft lighting, detailed 
    eyes, anime style, high quality
    

    Image Editing Prompts (Single Image):

    change hair color to pink, add cat ears, school uniform, 
    keep the same pose and composition
    

    Image Editing Prompts (Multiple Images):

    combine the character from image 1 with the background 
    from image 2, match the lighting and style, anime aesthetic
    

    Important for Editing:

    • Be specific about changes

    • Describe both images and desired result

    • Mention style consistency if needed

    • Natural language works best


    πŸ”§ Beta3-AIO Specifications

    Base Model: Qwen Image Edit 2511 Text Encoder: Qwen 2.5 VL 7B FP8 (uncut) Precision: FP8 Format: AIO (All-in-One) File Size: ~27 GB VRAM: 8GB minimum Steps: 4-20 (4 min, 20 recommended) CFG: 3.5 Sampler: Euler Scheduler: Beta

    Capabilities:

    • Text-to-Image generation

    • Image-to-Image editing (1-3 images)

    • Character combination

    • Scene composition

    • Style transfer

    • Partial NSFW support


    πŸ”ž CONTENT NOTICE

    ⚠️ NSFW Capabilities

    Beta3-AIO:

    • βœ… Partial nudity - Supported

    • βœ… Clothing removal - Possible (results vary)

    • βœ… Artistic nudity - Breasts/underboob

    • ❌ Full explicit content - Not supported

    • πŸ”ž Age restriction - 18+ only, use responsibly

    Tips for NSFW:

    • Load NSFW reference image as Image 2 or 3

    • Results depend on seed, prompt, and input images

    • Experiment with different combinations

    Beta2-AIO & Earlier:

    • ⚠️ Limited NSFW - Artistic nudity (breasts/underboob) supported

    • ❌ Full explicit content - Not supported

    • πŸ”ž Age restriction - 18+ only


    ❓ FAQ

    General Questions

    Q: Which version should I download? A: Beta3-AIO for latest features + image editing. Beta2-AIO Pruned for fastest text-to-image.

    Q: Do I need separate VAE/encoder for AIO versions? A: No! AIO has everything integrated.

    Q: Can I use Lightning LoRAs? A: Yes! All versions support Lightning LoRAs (4-step or 8-step).


    Beta3-AIO Specific

    Q: How many images can I edit at once? A: 1-3 images (Image 1 required, Images 2-3 optional).

    Q: Can I still do text-to-image with Beta3? A: Yes! Beta3 supports both Text-to-Image AND Image-to-Image.

    Q: How do I get better NSFW results? A: Load an NSFW reference image as Image 2 or 3 for guidance.

    Q: What's the minimum steps for Beta3? A: 4 steps minimum, but 20 steps recommended for quality.


    Beta2-AIO Specific

    Q: What's the difference between Full and Pruned AIO? A: Full = quality mode (CFG 2.5-4.0, 20+ steps). Pruned = speed mode (CFG 1.0, 6-8 steps).

    Q: Why does Pruned need CFG 1.0? A: It's optimized for low-step high-speed generation. CFG 1.0 works best.

    Q: Can I use CFG 3.0 with Pruned? A: Not recommended. Max is 2.0, but results are best at 1.0.


    Compatibility

    Q: Is quality different between AIO and Beta2 FP8? A: No, same training - AIO just bundles files together.

    Q: Which has better quality: Full AIO or Beta2 BF16? A: Beta2 BF16 has slightly better precision, but difference is minimal.

    Q: Can I mix versions (e.g., Beta3 with Beta2 VAE)? A: Not recommended. Each version is optimized as a complete package.


    πŸ™ CREDITS

    Training: Custom dataset, Dual Tesla P40 GPUs Base Model: Qwen Image Edit 2511 (Beta3), 2509 (Beta2) Text Encoder: Qwen 2.5 VL 7B FP8 (uncut for Beta3) Architecture: Qwen-Image-Edit framework Community: Thanks to all Beta1, Beta2, and Beta3 testers!

    Special Thanks:

    • Qwen team for the base models

    • ComfyUI community for feedback

    • All users who provided testing data


    πŸš€ QUICK START

    Getting Started (Beta3-AIO)

    1. Download Beta3-AIO FP8 from files section

    2. Place in ComfyUI/models/checkpoints/

    3. Download the provided workflow

    4. Load workflow in ComfyUI

    5. Choose mode:

      • Text-to-Image: Write prompt, generate

      • Image Editing: Upload 1-3 images, write edit prompt, generate

    6. Generate amazing anime art!


    Version Information

    Current Version: Beta3-AIO ⭐ Previous Versions: Beta2-AIO, Beta2, Beta1 Release Date: December 2025 License: Apache 2.0 Format: Safetensors (AIO)


    Created with ❀️ for the anime AI community

    Choose Beta3-AIO for the complete experience!

    Description

    FAQ

    Comments (27)

    1q2w3e4rQAZNov 14, 2025Β· 1 reaction
    CivitAI

    38g??????Is this the actual size?

    SeeSeeLP
    Author
    Nov 14, 2025

    Hey, thanks for your question.

    YES, since it's the BF16 version, it's neither quantized nor scaled. 😊

    SeeSeeLP
    Author
    Nov 14, 2025Β· 4 reactions

    Smaller versions will follow, but not this beta version πŸ‘

    luodehaier222Nov 15, 2025Β· 1 reaction
    CivitAI

    too big bro

    SeeSeeLP
    Author
    Nov 15, 2025Β· 1 reaction

    I hear that quite often 😏 That's actually the normal size for QWEN images in BF16, but I'll quantize later versions as well, and thanks for your comment.

    ProteiniqueNov 15, 2025Β· 1 reaction
    CivitAI

    Yoo is qwen the new upgrade ? Is it better than IL ? Im alway hype when Γ  new model generation come out

    SeeSeeLP
    Author
    Nov 15, 2025Β· 1 reaction

    Hey Proteinique! 🎨

    Great question! QWEN and Illustrious are actually different beasts - they're not really competing, but complementary!

    Here's the breakdown:

    QWEN-Anime-Beta1 (this model):

    ⚑ INSANE speed - 4-8 steps with Lightning LoRA (19-84s for most resolutions!)

    🎯 Different architecture (QWEN/FLUX-based, NOT SDXL)

    πŸ“ Multi-resolution beast (512x512 up to 2K+ confirmed on 8GB VRAM)

    πŸš€ Flexible step range (4-50 steps all produce usable results)

    ⚠️ Beta stage - still gathering feedback and refining

    Illustrious XL:

    πŸ›οΈ Mature, stable SDXL-based model

    πŸ“š Huge LoRA ecosystem and community support

    🎨 Danbooru-trained with deep anime knowledge

    πŸ”§ Typically needs 20-40 steps for best results

    βœ… Production-ready

    So... which is "better"?

    Depends on your use case!

    πŸ™Œ

    TL;DR: Not necessarily "better," just different - and DAMN fast! πŸš€

    Thanks for the hype! Feel free to test and let me know what you think! πŸ’ͺ

    ProteiniqueNov 15, 2025Β· 3 reactions

    @SeeSeeLPΒ Well, I'm a bit impatient for the new booru model to release with better prompt understanding. It still struggles with complex scenery, and when there are too many tokens, the quality drops.
    I'll occasionally try it once it's more fine-tuned. I'm not looking for speed but for quality and strong scene building, so I'll be following this model closely

    VisionaryAI_StudioNov 16, 2025Β· 1 reaction
    CivitAI

    Excellent, I'm already looking forward to the next version! πŸ‘πŸ»

    Haylow117Nov 17, 2025Β· 1 reaction
    CivitAI

    How do you get it to run on 8gb of vram? I have 32 gb and am having trouble. Is there a workflow you use that you can share?

    SeeSeeLP
    Author
    Nov 17, 2025Β· 1 reaction

    Everything is set up with ComfyUI.
    Regarding my hardware, I’m using an Nvidia RTX 4060 Ti with 8GB VRAM and 64GB of DDR5 RAM.
    I was planning to upload a workflow anyway β€” I’ll put one together for you today.

    SeeSeeLP
    Author
    Nov 17, 2025Β· 1 reaction

    @Haylow117 Here's the link to the workflow for you πŸ€“ : https://civitai.com/models/2135240?modelVersionId=2415389

    artyclawNov 22, 2025Β· 1 reaction

    I'm using it with 16gb vram and 64gb ram. You need quite some ram as well.

    Haylow117Nov 27, 2025

    @SeeSeeLPΒ Thanks

    Haylow117Nov 27, 2025

    @artyclaw Got 128gb, seems some workflows do some smart loading and unloading to prevent vram overload. Sometimes my computer crashes due to Vram not being cleared.Β 

    Haylow117Nov 27, 2025

    @SeeSeeLPΒ Thank you!

    artyclawNov 27, 2025Β· 1 reaction

    @Haylow117Β Also: I have DisTorch installed, which basically patches all Comfy loaders to be capable to unload stuff to normal ram. I guess that's going on here.

    Haylow117Dec 1, 2025

    @artyclawΒ Ahh that makes sense. The workflow helped a ton and that explains how you avoid your computer crashing / freezing, I'll need to do that.

    XPFanaticNov 19, 2025Β· 2 reactions
    CivitAI

    Any chance of getting GGUF_Q8 or FP8_Scaled?
    Offloading sacrifices too much performance.
    While those 2 solutions are very close to FP16 in quality.

    SeeSeeLP
    Author
    Nov 19, 2025Β· 1 reaction

    Yes, of course. But I hadn't planned that for the Beta 1 version, unless someone wants to quantize it. It's definitely planned for later versions and will be implemented. Thanks for your feedback.

    NRDXNov 21, 2025Β· 1 reaction
    CivitAI

    What tool did you use to train this full finetune?

    SeeSeeLP
    Author
    Nov 21, 2025Β· 1 reaction

    @NRDX Thanks for the question! πŸ‘

    I trained the checkpoints using OnaTrainer and a customized configuration. πŸ‹οΈ

    NRDXNov 21, 2025Β· 1 reaction

    That's great, I didn't know OneTrainer did full FineTune. How much VRAM did it require?

    SeeSeeLP
    Author
    Nov 21, 2025Β· 1 reaction

    @NRDXΒ Hey, it actually wasn’t that little.
    Two 24 GB VRAM Nvidia Tesla P40s running in BF16,
    and around 200 GB of system RAM with EMA enabled.

    SeeSeeLP
    Author
    Nov 21, 2025Β· 1 reaction

    @NRDXΒ Without EMA, 48GB of VRAM should probably be sufficient.

    dreadfulpirateNov 21, 2025Β· 1 reaction
    CivitAI

    Holy shit!! 38GB!! You have a dream system.

    SeeSeeLP
    Author
    Nov 21, 2025Β· 1 reaction

    @dreadfulpirate Yes, you really need a dream system for training. 😁

    But not for creating images; 8GB of VRAM is sufficient, depending on your RAM.

    However, V2 will also be available as an FP8 version.

    Checkpoint
    Qwen

    Details

    Downloads
    185
    Platform
    CivitAI
    Platform Status
    Available
    Created
    11/12/2025
    Updated
    6/11/2026
    Deleted
    -

    Files

    qwenAnime_qwenAnimeBeta1.safetensors

    Mirrors