Overview
This model is built based on the architecture of the NOOBAI XL-VPred 1.0, with some structural modifications.
For version 1.0 to 3.0: It is trained on the Danbooru2024 dataset along with Yande Full and e621, and using NOOBAI XL-VPred 1.0 and Illustrious XL 1.0 as teacher models during training.
In version 2.0: I have used the old data along with additional real-life character data more than 50k images from various source on the internet.
In version 3.0, I refactored the dataset and added more dataset labels using ChatGPT o3-mini, followed by a manual recheck.
In version 4.0: The model was trained on the danbooru2024, danbooru_newest-all datasets, e621, e621_newest, gelbooru_full, yande_full as well as a custom dataset (which I collected and labeled using natural language with GPT-4.5, and later manually verified by me).
For version 1.0: This model focuses on balancing multiple art styles (through the use of trigger prompts) and good anatomy when generating images.
For version 2.0: This version focuses heavily on improving anatomy and enables the creation of more realistic characters (through the use of trigger prompts). Note that this version may reduce the quality of image generation across multiple art styles.
For version 3.0: This version can generate images in multiple styles (similar to version 1.0) while also creating more realistic characters (more lifelike compared to version 2.0) with improved anatomy. However, to achieve the desired image, you need to input a precise descriptive prompt, as it significantly impacts the output.
In version 4.0, to adapt to the large amount of data used for training, I reconstructed the model with some modifications. Moreover, I had to train all parts of the model, including CLIP, VAE, and UNet. In this version, the improvements allow the model to generate image styles more accurately, as well as improve the character anatomy. In addition, I fixed the issues that occurred in versions 2.0 and 3.0.
Important Note
I personally reconstructed this model, so I’d greatly appreciate any feedback. Your insights won’t just motivate me but will also help me better understand its strengths and weaknesses, allowing me to refine it in the future.
This is a V-prediction model (unlike epsilon-prediction), which requires specific parameter configurations. Please refer to the user guide here.
Currently, the model is not available for use via Civitai Generation. You can visit the following website to use it:
Settings for Generating Realistic Characters
For version 2.0 and 3.0: Add these prompt to generate
Positive prompt: realistic, cosplay, real life, photorealistic
Negative prompt: illustration, blur, film grain, noise, sketch, comic, cartoon, toon, oil painting (medium), flat color, outline, 3D, 2.5D, 2D, unrealistic, game engine style, anime coloring, smooth skin
Recommended Settings
Positive prompt: masterpiece,best quality,amazing quality
Negative prompt: bad quality,worst quality,worst detail,sketch,censor, simple background,transparent background
CFG: 4-6
Clip skip: 2
Step: 20-30
Sampler: Euler a
Contributed by @Ligmanese
Sampler: Euler Ancestral CFG++
Schedule Type: Simple
Sampling Steps: 25-30
CFG Scale: 1.2-1.5
Note:
I don’t use any post-processing or LoRA to enhance the example images. They are generated solely using these settings and prompts with my base model.
For comparison and independent evaluation, I used prompts from various sources and authors to generate these example images.
Acknowledgments
Thanks to narugo1992 and Nyanko for sharing such valuable data and Laxhar Lab for providing an amazing model!
Thanks @Sennke for creating the noobReal model! This model has given me more ideas for improving the ChromaYume version 2.0.
If you'd like to support my work, you can do so through Ko-fi!
Description
FAQ
Comments (9)
Do you have any plans for continuing training the chroma without any real-life data? Because it looks like realistic one should be a separate model tbh.
Hi. I think I will update it. I am looking for a solution for the next version because, currently, version 1.0 is too powerful in the artist's style. It will take me longer.
As you have warned v2 is weak at artists styles.
Even without "realistic" tag images now have a nice but generic 2.5d look with very little resemblance of actual artist pictures. Not my cup of tea, I prefer v1.
Yub, This version is a demo for person who want to generate realistic images. I am currently investigating on the next version to have both realistic style and artist style. This may be a hard challenge. However. I will do it :D
My friend, we just want an even better NoobAI, there are still a lot of weak artist tags, and some anatomy problems, we don't need a generic “realistic” style, don't put garbage in the training. Just improve what's already good and keep the NoobAI prompting.
Thanks.
Note: Illustrious has more recent versions.
Hi. Nice to meet you. This model is still under investigation. :v The second version is different from the first one; therefore, the next version is still being investigated to generate artistic and realistic images. :v (The next version will be more like a combination of these versions.)
I remember in early Noob versions people complained when they added photo images to training data for the first time. Because 2D quality suffered. But in the end everything had trained out just fine.
@somedoby That's because a lot of people don't understand models and just parrot what the majority says (the majority being many recent ai users). As you stated, if trained PROPERLY everything will workout fine. It's baffling to me that people wouldn't want a model that can do both, just don't prompt for realism and you won't get it.
Hello, as some people have pointed out, version 2.0 seems to have significantly reduced diversity in generating images with various artist styles. I was already aware of this issue beforehand. The current version 2.0 is a demo model focused on generating realistic images while also incorporating anime with multiple artist styles.
The next version is still in the training and testing phase. As for this version (which I’ve been training for many days now), based on my tests, it has retained the ability to generate artist styles similar to version 1.0 (in my tests, it actually produces better images). Not only that, but it can also generate images of real-life characters instead of the semi-realistic images seen in version 2.0.
I have optimized this version and applied many techniques to create a model capable of generating a wide range of content. I’m still unsure whether to release it as early access or publish it directly (the training cost for this version is significantly higher than the previous one, and I’ve had to stay up many nights trying to improve it :v).

















