RouWei - v0.8.0 epsilon

NSFW

In depth retraining of Illustrious to achieve best prompt adherence, knowledge and state of the art performance.

Big dreams come true

The version number is just an index of current final release, not a fraction of the planned training.

HF repo

Large scale finetune using gpu cluster with a dataset of ~13M pictures (~4M with natural text captions)

Fresh and wast knowledge about characters, concepts, styles, cultural and related things
The best prompt adherence among SDXL anime models at the moment of release
Solved main problems with tags bleeding and biases, common for Illustrious, NoobAi and other checkpoints
Excellent aesthetics and knowledge across a wide range of styles (over 50,000 artists (examples), including hundreds of unique cherry-picked datasets from private galleries, including those received from the artists themselves)
High flexibility and variety without stability tradeoff
No more annoying watermarks for popular styles thanks to clean dataset
Vibrant colors and smooth gradients without trace of burning, full range even with epsilon
Pure training from Illustrious v0.1 without involving third-party checkpoints, Loras, tweakers, etc.

There are also some issues and changes compared to the previous version, please RTFM.

Dataset cut-off - end of April 2025.

Features and prompting:

Important change:

When you are prompting artist styles, especially mixing several, their tags MUST BE in a separate CLIP chunk. Just add BREAK after it (for A1111 and derivatives), use conditioning concat node (for Comfy) or at least put them in the very end. Otherwise, significant degradation of results is likely.

Basic:

The checkpoint works both with short-simple and long-complex prompts. However, if there are contradictory or weird things - unlike with others they won't be ignored affecting the output. No guide-rails, no safeguards, no lobotomy.

Just prompt what you want to see and don't prompt what shouldn't be on the picture. If you want to have a view from above - don't put ceiling into positive, if you want to have crop view with head out of frame - don't make detailed description of character facial features, and so on. Pretty simple but sometimes people are missing it.

Version 0.8 comes with advanced understanding of natural text prompts. It doesn't mean that you are obligated to use it, tags only - completely fine, especially because understanding of tags combinations is also improved.

Do not expect it to perform like Flux or other models based on T5 or LLM text encoders. The whole size ot SDXL checkpoint is less then only that text encoder, in addition illustrious-v0.1 which is used as the base completely forgot a lot of general things from vanilla sdxl-base.

However, even in current state it works much better, allows to do new things usually impossible without external guidance, as well making manual editing, inpainting, etc more convenient.

To achieve best performance you should keep track of CLIP chunks. In SDXL the prompt is separated into a chunks of 75 (77 including BOS and EOS) tokens, that are processing by CLIP separately, and only then are concatinating and comes as conditions to unet.

If you want to specify some features for character/object and separate them from other prompt parts - make sure they are in the same chunk and optionally separate it with BREAK. It will not solve problem of traits mixing completely, but can reduce it improving overall understanding, since text encoders on RouWei are able to process the whole sequence, not individual concepts better then others.

Dataset contains only booru-style tags and natural text expressions. Despite having a share of furries, real life photos, western media, etc. all captions have been converted to classic booru style to avoid a number of problems from mixing of different systems. So e621 tags won't be understanded properly.

Sampling parameters:

~1 megapixel for txt2img, any AR with resolution multiple of 32 (1024x1024, 1056x, 1152x, 1216x832,...). Euler_a, 20..28steps.
CFG: for epsilon version 4..9 (7 is best), for vpred version, 3..5
Sigmas multiply may improve results a bit, CFG++ samplers work fine. LCM/PCM/DMD/... and exotic samplers untested.
Some schedulers doesn't work well.
Highresfix - x1.5 latent + denoise 0.6 or any gan + denoise 0.3..0.55.
For vpred version lower CFG 3..5 is needed!

For vpred version lower CFG 3..5 is needed!

Quality classification:

Only 4 quality tags:

masterpiece, best quality

for positive and

low quality, worst quality

for negative.

Nothing else. Actually you can even omit positive and reduce negative to low quality only, since they can affect basic style and composition.

Meta tags like lowres have been removed and don't work, better not to use them. Low resolution images have been either removed or upscaled and cleaned with DAT depending on their importance.

Negative prompt:

worst quality, low quality, watermark

That's all, no need of "rusty trombone", "farting on prey" and others. Do not put tags like greyscale, monochrome in negative unless you understand what are you doing. Extra tags for brightness/colors/contrast section below can be used

Artist styles:

Grids with examples, list/wildcard (also can be found in "training data").

Used with "by " it's mandatory. It will not work properly without it.

"by " is a meta-token for styles to avoid mixing/misinterpret with tags/characters of similar or close name. This allows to have a better results for styles and at the same time avoid random style fluctuation that you may observe in other checkpoints.

Multiple give very interesting results, can be controlled with prompt weights and spells.

YOU MUST ADD `BREAK` after artists/style tags (for A1111) or concat conditioning (for Comfy) or put them in the very end of your prompt.

For example:

by kantoku, by wlop, best quality, masterpiece BREAK 1girl, ...

General styles:

2.5d, anime screencap, bold line, sketch, cgi, digital painting, flat colors, smooth shading, minimalistic, ink style, oil style, pastel style

Booru tags styles:

1950s (style), 1960s (style), 1970s (style), 1980s (style), 1990s (style), 2000s (style), animification, art nouveau, pinup (style), toon (style), western comics (style), nihonga, shikishi, minimalism, fine art parody

and everything from this group.

Can be used in combinations (with artists too), with weights, both in positive and negative prompts.

Characters:

Use full name booru tag and proper formatting, like karin_(blue_archive) -> karin \(blue archive\), use skin tags for better reproducing, like karin \(bunny\) \(blue archive\). Autocomplete extension might be very useful.

Most characters are recognized just by their booru tag, but it will be more accurate if you describe their basic traits. Here you can easily redress your waifu/husbendo just by the prompt without suffering from the typical leaks of basic features.

Natural text:

Use it in combination with booru tags, works great. Use only natural text after typing styles and quality tags. Use just booru tags and forget about it, it's all up to you. To get best performance keep track if CLIP 75 tokens chunks.

About 4M of images in dataset had hybrid natural-text captions, made by Claude, GPT, Gemini, ToriiGate, then refactored, cleaned and combined with tags in different variations for augmentation.

Unlike typical captions, these contains character names which is very useful. Better to keep it clean, short and convenient description works best. Better not use long and sloppy BS like

A mysteriously enchanting feminine entity of indeterminate yet youthful essence, whose celestial visage radiates with the ethereal luminescence of a thousand dying stars, blessed with locks cascading like the golden rivers of ancient mythology, perhaps styled in a manner reminiscent of contemporary fashion trends though not necessarily adhering to any specific aesthetic paradigm. Her eyes, pools of unfathomable depth and hue, sparkle with the wisdom of millennia yet maintain an innocent quality that defies temporal constraints...

For captioning you can use ToriiGate in short mode.

And don't expect it to be as good as flux and others, it tries very hard and after several rolls usually you can get what you want, but it is not that stable and detailed.

Oh yeah

tail censor, holding own tail, hugging own tail, holding another's tail, tail grab, tail raised, tail down, ears down, hand on own ear, tail around own leg, tail around penis, tailjob, tail through clothes, tail under clothes, lifted by tail, tail biting, tail penetration (including a specific indication of vaginal/anal), tail masturbation, holding with tail, panties on tail, bra on tail, tail focus, presenting own tail...

(booru meaning, not e621) and many others with natural text. The majority works perfectly, some requires a lot of rolling.

Brightness/colors/contrast:

You can use extra meta tags to control it:

low brightness, high brightness, low saturation, high saturation, low gamma, high gamma, sharp colors, soft colors, hdr, sdr

Example

They work both in epsilon and vpred version and works really good.

Epsilon version relies on them too much. Without low brightness or low gamma or limited range (in negative) it might be difficult to achieve true 0,0,0 black, the same often true for white.

Both epsilon and vpred versions have like true zsnr, full range of colors and brightness without common flaws observed. But they behaves differently, just try it.

Vpred version

Main thing you need to know - lower your CFG from 7 down to 5 (or less). Otherwise, the use is similar with advantages.

It seems that starting from v0.7 vpred works flawlessly now. It shouldn't suffer from ignorance of tags close to the 75tokens chunk borders like nai. It is more difficult to get burned images - even on cfg7 usually it just over-saturated but with smooth gradients, which can be useful for some styles. Yes it can make anything from (0,0,0) to (255,255,255). You will find brightness meta tags described above quite useful for easier/lazy prompting, natural text expressions also work. To get the most dark image - put high brightness into negative and/or use low brightness, low gamma tags. If you don't like very bright skin on dark background and want to reduce contrast (or on the contrary, enhance the effect) - use hdr/sdr in negative/positive.

It was reported that in rare cases on some prompts there is a drop in contrast. Looks like other vpred models have same behaviour with such prompts, adding a "separator" closer to the border of the 75-token chunk fixes this. However, with 0.7 I haven't encountered this myself.

To launch vpred version you will need dev build of A1111, Comfy (with special loader node), Forge or Reforge. Just use same parameters (Euler a, cfg 3..5, 20..28 steps) like epsilon. No need to use Cfg rescale, but you can try it, cfg++ works great.

Base model:

The model here has a small unet polishint after main training to improve small details, bump up resolution and others. Hovewer, you may be also interested into a RouWei-Base, which sometimes can perform better at complex prompts despite having minor mistakes in small details. It also comes in FP32, for example if you want to use fp32 text encoder nodes in Comfy, merge it or finetune.

It can be found in Huggingface repo

Known issues:

Off course there are:

Artists and style tags must be seperated into a different chunk from main prompt or come very last
There may be some positional or combinational bias in rare cases, but it's not yet clear.
There are some complaints about few of the general styles.
Epsilon version relies too much on brightness meta tags, sometimes you will need to use them to get desired brightness shift
Some newly added styles/characters might be not as good and disctinct as they deserve to
To be discovered

Requests for artists/characters in future models are open. If you find artist/character/concept that perform weak, inaccurate or has strong watermark - please report, will add them explicitly. Follow for a new versions.

JOIN THE DISCORD SERVER

License:

Same as illustrious. Fell free to use in your merges, finetunes, ets. but please leave a link or mention, it is mandatory

How it's made

I'll consider to make a report or something like it later. For sure.

In short, 98% of work is related to dataset preparations. Instead of blindly relying on loss-weighting based on tag frequency from nai paper, a custom guided loss-weighting implementation along with asynchronous collator for balancing have been used. Ztsnr (or close to it) with Epsilon prediction was achieved using noise scheduler augmentation.

Spent compute - over 8k hours of H100 (apart from research and fail attempts)

Thanks:

First of all I'd like to acknowledge everyone who supports open source, develops in improves code. Thanks to the authors of illustrious for releasing model, thank to NoobAI team for being pioneers in open finetuning of such a scale, sharing experience, raising and solving issues that previously went unnoticed.

Personal:

Artists wish to remain anonymous for sharing private works; Few anonymous persons - donations, code, captions, etc., Soviet Cat - GPU sponsoring; Sv1. - llm access, captioning, code; K. - training code; Bakariso - datasets, testing, advices, insides; NeuroSenko - donations, testing, code; LOL2024 - a lot of unique datasets; T.,[] - datasets, testing, advises; rred, dga, Fi., ello - donations; TekeshiX - datasets. And other fellow brothers that helped. Love you so much ❤️.

And off course everyone who made feedback and requests, it's really valuable.

If I forgot to mention anyone, please notify.

Donations

If you want to support - share my models, leave feedback, make a cute picture with kemonomimi-girl. And of course, support original artists.

AI is my hobby, I'm spending money on it and not begging for donations. However, it has turned into a large-scale and expensive undertaking. Consider to support to accelerate new training and researches.

(Just keep in mind that I can waste it on alcohol or cosplay girls)

BTC: bc1qwv83ggq8rvv07uk6dv4njs0j3yygj3aax4wg6c

ETH/USDT(e): 0x04C8a749F49aE8a56CB84cF0C99CD9E92eDB17db

XMR: 47F7JAyKP8tMBtzwxpoZsUVB8wzg2VrbtDKBice9FAS1FikbHEXXPof4PAb42CQ5ch8p8Hs4RvJuzPHDtaVSdQzD6ZbA5TZ

if you can offer gpu-time (a100+) - PM.

Description

Major update

FAQ

Comments (58)

AquaShadesMay 25, 2025· 3 reactions

CivitAI

❤❤❤

alternative_UniverseMay 25, 2025

CivitAI

What's on the major uptade?

Minthybasis

Author

May 25, 2025

A new version

alternative_UniverseMay 25, 2025

@Minthybasis right 😅, but will you provide more info about it? v8 is a big event 🙌🏻

Minthybasis

Author

May 25, 2025· 1 reaction

@alternative_Universe Definitely. Description updated, some news and discussions you can find in discord. If you're about detailed report - yes, but now I need to take a break and rest for a bit while vpred is baking.

rerolls26May 25, 2025· 6 reactions

CivitAI

Massive update, waiting for the v-pred

EnigmataMay 25, 2025

CivitAI

Is v-pred version finetune of noobai vpred?

Minthybasis

Author

May 25, 2025· 1 reaction

No, it is a special tune of rouwei-base.

somedobyMay 25, 2025· 5 reactions

CivitAI

Mr, your model is awesome! Well documented too. I especially like color/brightness tags, long awaited. And no crazy negatives and dozen custom nodes just to make model work. Is it a miracle?

Keep up great work!

bl4ckfuture107May 25, 2025· 6 reactions

CivitAI

And Minthy does it again. A huge improvement over 0.7, use BREAK like Minthy recommends, and everything will be fine, for those on Comfy, you might use the CLIP Text Encode (BREAK) for A1111-like BREAK function. Use euler_a with sgm_uniform for the best results.

AquaShadesMay 25, 2025· 1 reaction

and for those who don't want to use custom nodes, Comfy has the BREAK function baked in via conditioning concat nodes 👍

bl4ckfuture107May 26, 2025· 1 reaction

@AquaShades Exactly, thanks for the heads up!

QH96May 26, 2025· 4 reactions

CivitAI

Always nice to see new models.

NaGaRiMay 26, 2025

CivitAI

Is there any TE difference between the v-pred and epsilon versions?

Minthybasis

Author

May 26, 2025

Current last vpred version - 0.7, it has a different from v0.8 TE. The upcoming 0.8 vpred will have the same encoder as epsilon 0.8, unless something unless some special reason for change.

NaGaRiMay 26, 2025· 2 reactions

@Minthybasis thank you for your works

IJDEIHMay 26, 2025· 1 reaction

CivitAI

I love this model and appreciate everything you've done. I'm a big fan of all your work.

reakaakaskyMay 26, 2025· 1 reaction

CivitAI

"training_set": “a video url of Kon” 🤣

allamallaMay 26, 2025· 4 reactions

CivitAI

8.0 released a few hours ago and there were no fireworks outside my house!?

Ghost_NameMay 26, 2025· 2 reactions

CivitAI

Very thank you for New UPDATE, big love to you! <3 and your work

VecthralMay 26, 2025· 1 reaction

CivitAI

Version 0.8 has some very significant improvements, and the NL capabilities of the new version are impressive.

The main issues in version 0.7 were the default colors being too gray and the fidelity of the artist tags not being enough, I would say these issues have been resolved now.

Thank you for your hard working.

randomlygeneratedMay 26, 2025· 3 reactions

CivitAI

RouWei continues to be my favorite model for raw experimentation. Glad to see the new update, hope it brings some more attention to the line!

dfijgklerhjkldghtjykghljgMay 26, 2025· 2 reactions

CivitAI

I NEED ROUWEI BASE 0.8

Minthybasis

Author

May 26, 2025· 1 reaction

https://huggingface.co/Minthy/RouWei-0.8 Here

dfijgklerhjkldghtjykghljgMay 27, 2025· 5 reactions

CivitAI

the anime sub genre end user is in dire need of a good base model, and ROUWEI is that one. instead of paying "STARDUST" to onomaAI, the user should all come to Rouwei. Yes the person he/she might spend dollars on cosplay girls but the model is god damn good.

me personally jumped ship from illustrious base to noobai 1.1 for all my training, just recently, and now rouwei 0.8 dropped, it is the best higher high. I'm SO fortunate.

also personally I would suggest all remixer fine tuners to change base model from illustrious to rouwei as well, just like wai-NSFW, the illustrious base is the most maintained version but noobai/rouwei branch are superior.

dfijgklerhjkldghtjykghljgMay 27, 2025

unpopular opinion, what if Rouwei ditched illustrious entirely?

the concern of it is the same issue of pony and illustrious, apparently the base structure is too big of a change, they don't work well interchangeably, it is the same as ArtiWAifu diffusion, it is trained directly from sdxl 1.0, and the model is good but I can't use it unless I retrain all my stuff with it again.

another wickd opinion of mine is anime sub genre model makers might be better off to join force and make one singular base model, cause it is now clear the resource is not spent efficiently, from the few base model that I'm aware of, the maker all made significant change to its infrastructure, and considerable money spent renting GPUs, it is scatter across different individual model and it might be a waste. also it may be really tiring for fine tuners/remixers to keep up versioning to various base model, it is simply too much.

but people issue is the real issue, it is not university, teens been forced put together in a group project. that won't work.

Minthybasis

Author

May 27, 2025· 1 reaction

@dfijgklerhjkldghtjykghljg Well, you're right about it. Illustrious is a great starting point, that allowed to achieve nice results in v0.6 without spending too much of compute. But the longer this goes on, the clearer a number of problems become, especially about strong biases and lobotomizing most of general knowledge from sdxl-1.0. To add a new data about anime culture is easier then to reintroduce basic general things.

The model has been developed and trained gradually for more than half year, and step-by-step approach was the only option at start. If I had all the knowledge, experience and money spent for compute, I would undoubtedly start training with the sdxl-base. Especially since there already are some experience in this area.

As for joining forces - that's a complicated thing, like you said. The only successful project of this kind that united many creators is NoobAi and there were nuances there.

In general, building and organizing effective team is a difficult task in itself. Doing it among creative people, each of whom has own attitude, opinion, priorities, tastes may disagree with others, etc. is doubly difficult. And in open source, where there is no clear hierarchy, people are not held by money and responsibilities, but do mostly on pure enthusiasm - kind of marvel.

I don't mind of collaboration or joining with others, but to take all organization upon myself - no way.

allamallaMay 27, 2025· 1 reaction

The absolute dream model would be trained on SDXL or perhaps AlbedoaBase for the greatest variety of base concepts/artist styles, plus the danbooru and pixiv/artstation datasets with little or minimal character captioning but detailed captioning on objects, concepts, art technique, composition, and visual features. Maximum artistic expressiveness without lobotomizing the model to know the outfit associated with a character that has 50 images.

And233May 27, 2025

CivitAI

Have you upscaled the picture in dataset? It seems that the overall generated effect is not very sharp. Some details and strokes are quite blurry.

Minthybasis

Author

May 27, 2025

There are upscaled images to achieve coverage of rare characters and concepts. But a DAT upscaler, which doesn't give blurry effects, have been used and the overall share of such pictures is less then 1.5%, so it can't affect.

Sharpness or blurriness significantly depends from used styles and generation mode. Some artists have more soft stokes, others do a very sharp images. 4 channel sdxl vae also makes it more difficult to make small details without unwanted biases, and some things are even impossible. But it can be solved with simple upscale/highresfix.

Peoples are using various detail tweaker loras to get improvements, but side effects may occur.

Also you can try base version without final polishing, some people like it more.

And233May 27, 2025

@Minthybasis yep, I know some artists have more soft stokes, but what I mean is that for those artists with sharper art styles, these artists are "thicker" and, maybe more "western"(?) than other illustrious finetune, like IllumiYume XL or NoobAI. And I'm hard to reduce this effects, with lora or negative or hires fix.

Actually, in my opinion, this problem also occurred in last year's 4th tail, which even become a representative feature to your model (just in my sight).

It's difficult for me to describe it more specifically because it's a bit subjective. I'm sorry if it caused any inconvenience to you.

By the way, I have already try the base version, but this "special feature" still exists.

Minthybasis

Author

May 27, 2025· 2 reactions

@And233 That's a quite complicated topic. Different models may have different default style, for example, NoobAi tends to produce a sharper images, which looks very good for some styles but not so pretty for others that requires smooth gradients. It can be fixed with training (lora is fine), but sometimes, especially for vpred, you have to use tricks with noise scheduler to achieve best result.

These base biases may come from quality tags based on applied classification, from the dataset itself, or training approach. It may also be fair to say that the default style represents some average for the dataset. For example if you go to e621, then sort pictures by score and download a large chunk - average style and popular patterns may appear to look quite familiar.

In RouWei on the other hand, the dataset is based mostly on fresh data and has bias to 2.5d, despite attempts to fix it. Omitting quality tags, leaving only low quality in negative may help to improve things a bit.

Probably the easiest way to fix it - just make a fast aesthetic tune with small dataset, then merge few tweakers to add sharpness (or just multiply selected weights) and claim result as a "brand new trained checkpoint" like the absolute majority of models published. The default style will be strong and attractive, but there will be a price for this with lots of side-effects, especially in style mimicking, versatility for pictures other than 1girl standing, even visible artifacts and so on.

No, you are absolutely right here, discussing issues and shortcomings is the only way to development. I'll think about it in future, may be introducing something to get more control over it, or preparing aesthetic versions. Also I hope there will be derrivatives where people do it.

Btw, if you could upload some specific examples with full metadata anywhere for investigation, I would be very grateful.

And233May 28, 2025

@Minthybasis oh, according to your instructions, I might be able to explain this feeling, maybe it just comes from the bias to 2.5d, which make the output less flat for some artists need flat.

schneesturmx91988May 27, 2025

CivitAI

i love testing models for sure using forge on local but this model gives me alltime full destroyed images as result and forge has all types in Sd flux xl

Kaligo731May 27, 2025

Are you using 0.7 V-Pred? V-Pred requires custom settings.

Minthybasis

Author

May 27, 2025

Which settings, samplers, schedulers, etc. do you use?

schneesturmx91988May 28, 2025

@Minthybasis use every time euler a and i tryed the newest version

NoMethodError553672May 28, 2025· 1 reaction

V-Pred with Forge works out of the box. (SDE 3M + AYS or SGM), any resolution, Detail Daemon, FreeU, Dynamic Thresholding, Self Attention, ADetailer

Minthybasis

Author

May 28, 2025

@schneesturmx91988 can you upload any broken pic with metadata to catbox or somewhere else?

schneesturmx91988May 29, 2025

@Minthybasis i would but i deleted it + u can see nothing its more then pixel chaos u can nothing see out of white points or color points so i think euler a is not working on forge ui

Minthybasis

Author

May 29, 2025

@schneesturmx91988 Both vpred and epsilon versions are working with vanilla a1111, latest webui-forge, reforge, forge-classic, comfy and others with euler sampler. May be you've got some issues related to sdxl emphasis from older versions, some incompatible scheduler, or something else. Of course, the noise in the picture won't tell the source of the problem, but what's needed here is the generation metadata. Without any intel one can only guess.

schneesturmx91988May 31, 2025

@Minthybasis i know i have a so far since a upgrade to a 2tb ssd a fresh installed version from github + updated euler a sampler is mostly at automatic no vae set . maybe i give it a try later again . so far i have with other test models no problems out of this and one of the older realistic models that i tryed to test . sry for mostly late responses iam sometimes busy to train own models on local + fixing issues in gens. but i tryed around . so i will look later around again at it

schneesturmx91988May 31, 2025· 1 reaction

@Minthybasis works now i dont know what it was but now i have a clean image

mutageneMay 29, 2025

CivitAI

I trained a LoRA with v0.7 vpred in OneTrainer using the same dataset and settings as I did with NoobAI vpred, and I'm extremely impressed with the accuracy and how well it handles my one-word "outfit tags", but for whatever reason, it seems to have baked overly high saturation into it. Do I need to use different training settings than for NAI?

reakaakaskyMay 30, 2025

I was also wondering why there are over saturation issues when applied illustrious lora onto v0.8 eps.

Because of the "noise offset" in the base model, i guess?

Minthybasis

Author

May 31, 2025

@reakaakasky Hm, this might be related to noise-scheduler, for example if you're using debiased. How do results look and was there something related in training settings?

In all epsilon versions a pyramid noise approach have been applied to achieve range and reduce brightness-color biases. If on top of it will be applied a lora trained with noise-offset - outcome is unknown, but likely nothing good. Loras trained without it should work fine, or at least not cause serious issues. For vpred models no noise-offset should be used.

mutageneMay 31, 2025

@Minthybasis https://i.imgur.com/yjCa7Gj.png
These are the noise-related settings that I used, anything wrong with them? I am very new to offline training (I only just gained a system capable of it a few weeks ago), and these are the settings that were given in a guide for training with NoobAI vpred.

reakaakaskyMay 31, 2025

@Minthybasis maybe the pyramid noise.

The LoRA I used doesn't have noise offset. But the dataset has a very wide brightness and color range all most in every images. So in theory, the predicted noise will strong enough to reach the extreme colors.

It works fine on illustrious and NoobAI. But on rouwei it has over saturation issue and i have to lower the CFG from 6 to 3.

my guessing is pyramid noise in base model + LoRA with such dataset = predicted noise overshot.

cds899May 30, 2025· 2 reactions

CivitAI

Very exciting! But I am curious, why Illustrious v0.1 over 1.0 or 2.0?

Minthybasis

Author

May 31, 2025· 5 reactions

This is simple, v0.7 is based on top of v0.6, which was made from Illustrious v0.1. When it was trained Rouwei v0.6 seems to be better then Illustrious v1.0. V0.8 is trained on top of v0.7, before start I've made numerous comparison, and it turned out that Ill. v2 is significantly inferior in understanding prompts, knowledge, the number of biases, despite the statements.

So, since the first branching there was no point to start over again, chasing illusory improvements from base.

cds899May 31, 2025

@Minthybasis Got it, thanks a lot!

gannibalJun 1, 2025· 2 reactions

CivitAI

First of all, great work!

On a side note, do you have any idea why lora extracts of Rouwei from illustrious-0.1 won't work with NoobXL models? Are they that different? Simple checkpoint merges seem to work fine, but not a lora extract on top of NoobXL model. It results in pure noise.

Minthybasis

Author

Jun 1, 2025

Thank you. Yes, after a long training it is a completely different checkpoint compared with base illustrious. There are some compatibility but it depends from case.

Adding difference on top might work if one of merging model contains just small adjustments and tweaks and very close to origin. With a different models this will lead to disaster.

In this case since noob and rouwei are not completely different and share same base - various types of averaging should work.

IJDEIHJun 2, 2025· 3 reactions

CivitAI

After a few days using EPS 0.8, I found myself preferring Base 0.8.

EPS 0.8 seems very sensitive to prompts. Even minor tag changes can drastically change the resulting image.

For example, when I change a tag from 'smile' to 'angry', the whole theme of the image suddenly changes, even with the same artist tag. And in this problem, it was more stable in the base version.

You may like to train your Lora on the base version as well, since it provides a more neutral result.

But still, it's a very good model with new image data, and I can't wait to see the v-pred version.

xaded18618103Jun 9, 2025

Try using prompts that change, like [smile:angry:0.5]. First half of steps will be smile, second half of steps will be angry.

OneRingJun 5, 2025· 6 reactions

CivitAI

I would like to know the approximate release date of vpred 0.8, if possible :3

Minthybasis

Author

Jun 5, 2025· 6 reactions

Likely this Sunday

omnisaaJun 8, 2025· 1 reaction

CivitAI

Very good model, as a merge base and after some mixing and clip replacement, I kept about 95% of the target main model style on my private model, and got the new knowledge base updated to April 25 in 0.8, which is almost a lossless upgrade

Checkpoint

Illustrious

by Minthybasis

Download (Beta) View on CivitAI

base model

anime

Details

Downloads

2,958

Platform

CivitAI

Platform Status

Available

Created

5/25/2025

Updated

6/28/2026

Deleted

Files

rouwei_v080Epsilon.safetensors

Size:

6.46 GB

SHA256:

1aba15decd15da1810054ed4c58984610dfafc728bef2fb3f3ebe846b2e287a4

Mirrors

HuggingFace (8 mirrors)

rouwei_v080Epsilon.safetensors

rouwei_080_epsilon_fp16.safetensors

rouwei_v080Epsilon.safetensors

CivitAI (2 mirrors)

rouwei_v080Epsilon.safetensors

rouweu_v10.safetensors

ModelScope CN (1 mirrors)

rouwei_080_epsilon_fp16.safetensors

rouwei_v080Epsilon_trainingData.zip

Size:

666.42 KB

SHA256:

99042258aa3e20f4309c669e80fac53326dc246036ee449cc47eab0f6dbfe11b

Mirrors

CivitAI (2 mirrors)

rouwei_v080Epsilon_trainingData.zip

rouwei_v080Vpred_trainingData.zip

Available On (1 platform)

Same model published on other platforms. May have additional downloads or version variants.

SeaArt

RouWei - v0.8.0 epsilon

In depth retraining of Illustrious to achieve best prompt adherence, knowledge and state of the art performance.

Large scale finetune using gpu cluster with a dataset of ~13M pictures (~4M with natural text captions)

There are also some issues and changes compared to the previous version, please RTFM.

Features and prompting:

Important change:

Basic:

Sampling parameters:

For vpred version lower CFG 3..5 is needed!

Quality classification:

Negative prompt:

Artist styles:

Used with "by " it's mandatory. It will not work properly without it.

YOU MUST ADD BREAK after artists/style tags (for A1111) or concat conditioning (for Comfy) or put them in the very end of your prompt.

General styles:

Booru tags styles:

Characters:

Natural text:

Lots of Tail/Ears-related concepts:

Brightness/colors/contrast:

Vpred version

Base model:

Known issues:

Requests for artists/characters in future models are open. If you find artist/character/concept that perform weak, inaccurate or has strong watermark - please report, will add them explicitly. Follow for a new versions.

JOIN THE DISCORD SERVER

License:

How it's made

Thanks:

Personal:

Donations

Description

FAQ

What is RouWei?

How do I use RouWei?

What should I watch out for with Illustrious models?

What other Illustrious-based models are worth knowing?

Can I use this model commercially?

What files are available and where can I download them?

Comments (58)

Details

Files

rouwei_v080Epsilon.safetensors

Mirrors

rouwei_v080Epsilon_trainingData.zip

Mirrors

Available On (1 platform)

YOU MUST ADD `BREAK` after artists/style tags (for A1111) or concat conditioning (for Comfy) or put them in the very end of your prompt.