Overview
This model is built based on the architecture of the NOOBAI XL-VPred 1.0, with some structural modifications.
For version 1.0 to 3.0: It is trained on the Danbooru2024 dataset along with Yande Full and e621, and using NOOBAI XL-VPred 1.0 and Illustrious XL 1.0 as teacher models during training.
In version 2.0: I have used the old data along with additional real-life character data more than 50k images from various source on the internet.
In version 3.0, I refactored the dataset and added more dataset labels using ChatGPT o3-mini, followed by a manual recheck.
In version 4.0: The model was trained on the danbooru2024, danbooru_newest-all datasets, e621, e621_newest, gelbooru_full, yande_full as well as a custom dataset (which I collected and labeled using natural language with GPT-4.5, and later manually verified by me).
For version 1.0: This model focuses on balancing multiple art styles (through the use of trigger prompts) and good anatomy when generating images.
For version 2.0: This version focuses heavily on improving anatomy and enables the creation of more realistic characters (through the use of trigger prompts). Note that this version may reduce the quality of image generation across multiple art styles.
For version 3.0: This version can generate images in multiple styles (similar to version 1.0) while also creating more realistic characters (more lifelike compared to version 2.0) with improved anatomy. However, to achieve the desired image, you need to input a precise descriptive prompt, as it significantly impacts the output.
In version 4.0, to adapt to the large amount of data used for training, I reconstructed the model with some modifications. Moreover, I had to train all parts of the model, including CLIP, VAE, and UNet. In this version, the improvements allow the model to generate image styles more accurately, as well as improve the character anatomy. In addition, I fixed the issues that occurred in versions 2.0 and 3.0.
Important Note
I personally reconstructed this model, so I’d greatly appreciate any feedback. Your insights won’t just motivate me but will also help me better understand its strengths and weaknesses, allowing me to refine it in the future.
This is a V-prediction model (unlike epsilon-prediction), which requires specific parameter configurations. Please refer to the user guide here.
Currently, the model is not available for use via Civitai Generation. You can visit the following website to use it:
Settings for Generating Realistic Characters
For version 2.0 and 3.0: Add these prompt to generate
Positive prompt: realistic, cosplay, real life, photorealistic
Negative prompt: illustration, blur, film grain, noise, sketch, comic, cartoon, toon, oil painting (medium), flat color, outline, 3D, 2.5D, 2D, unrealistic, game engine style, anime coloring, smooth skin
Recommended Settings
Positive prompt: masterpiece,best quality,amazing quality
Negative prompt: bad quality,worst quality,worst detail,sketch,censor, simple background,transparent background
CFG: 4-6
Clip skip: 2
Step: 20-30
Sampler: Euler a
Contributed by @Ligmanese
Sampler: Euler Ancestral CFG++
Schedule Type: Simple
Sampling Steps: 25-30
CFG Scale: 1.2-1.5
Note:
I don’t use any post-processing or LoRA to enhance the example images. They are generated solely using these settings and prompts with my base model.
For comparison and independent evaluation, I used prompts from various sources and authors to generate these example images.
Acknowledgments
Thanks to narugo1992 and Nyanko for sharing such valuable data and Laxhar Lab for providing an amazing model!
Thanks @Sennke for creating the noobReal model! This model has given me more ideas for improving the ChromaYume version 2.0.
If you'd like to support my work, you can do so through Ko-fi!
Description
FAQ
Comments (5)
v1 is very good since its like base noob vpred 1.0 but more stable and all artist tags look like they should. if you could just do more training on that version so artist tags are even better thatd be incredible
Hi, model v3.0 is the next version of v1.0. Please check it and give me your idea about that model
From my personal perspective, I find that this version 3.0 model has improved quite well compared to the previous two versions. However, the issue here is that this model has increased the difficulty in prompting to achieve the desired image. The idea of mixing 2D, 3D, 2.5D realistic, and artist styles is fantastic. I also understand that training a model with multiple styles is extremely challenging and also very costly. I hope that more people will know about this model instead of constantly mixing multiple models together, as the results from these mixes don't change much, mainly just a few minor details. Thank you for creating this model.
Some early thoughts on v3. Backgrounds are definitely better than v1. It can do 2D/anime styling better than v2. I need more time for further testing.
By the way, you have v1 selected as default version on model page. Some people may not realize that v3 was released.
One big problem with v3 is high contrast / burned colors. Almost all sample images display this issue. Even reliable euler_a which usually produces clean and smooth colors suffers from this issue to the point that sometimes it starts to generate black and white lineart at CFG >= 5. I never had such problems with v1. And DPM++ 2M SDE sampler is completely unusable.
I managed to get decent colors with plain euler and beta scheduler up to CFG 6 but it often generates strange hands. The most consistent results are with euler + sgm_uniform up to CFG 4 but colors still come a bit burned for my taste.
I hope this issue can be resolved with future version. Because otherwise v3 is great and very promising model.
















