RDBT [Anima]
Mid scale finetuned + guidance distilled.
I use it as a starting point to stack more style LoRAs.
See this page for update log. Random experiment, random quality. New version != better version. Feel free to leave feedback.
See this page for original LoRA (update more frequently, probably).
Sharing merges using this model is not allowed. Known model thieves: NukeA.I (closed-weight merged model on tensorart),
This model is based on
ym: AnimaYume (hf link) (civitai link). Has latest dataset.
b,p: Anima pretrained (hf link)
Usage:
Settings:
CFG scale: 1~4. This model has been guidance distilled. You can disable CFG (CFG 1) and run the model 2x faster. Cover images are without CFG for demonstration.
Steps: 24. Guidance distillation != step distillation. If you need low steps (8~12). Try to add 0.2~0.5x turbo lora.
Prompt
Always specify style, or use a style LoRA. Otherwise, you will get random/mixed style. This model does not provide overfitted default style. This is a feature, not a bug.
Quality tags:
It's recommended to omit all the quality tags, or just keep the "masterpiece", if you're not confident. Omitting those redundant tokens allows LLM to pay more attention on other words.
Quality tags have been reinforced during distillation. Thus they don't have noticeable effects. Same as negative tags. If you use cfg, there is no need to dump "score_1, blurry, worst quality, jpeg artifacts, extra arms,... x100 words" in your negative prompt. Those things have been distilled out.
Training settings:
~10k images finetuning -> guidance distillation
All captions are NL from Google Gemini.
Optimizer: adamw, constant lr 0.00002.
LoRA rank/alpha 24.
Guidance distillation target CFG 4.
Block 0-2 and adaln linear layers are skipped.
Description
FAQ
Comments (13)
this poster claims to have made the nag work with anima using turbo and 1 cfg but with your model at least , it changes the output too much. Can you take a look at it , maybe we need specific settings for it to work with your model ? https://www.reddit.com/r/StableDiffusion/comments/1sto22j/i_implemented_nag_normalized_attention_guidance/?
I feel like the v27 model is losing its anime style
The further away from V0.24, the less anime it is and the more 3D and 2.5D pollution in data there is :(
Your the stability king for a reason. You have experience on all preview models so I trust yours the most. The official turbo Lora is decent But it isn't as detailed and doesn't follow the prompt exactly unlike yours. Also this is the official NAG that's finally got anima support However, my experience with it has been a little bit mixed:
https://github.com/BigStationW/ComfyUI-NAG-Extended/tree/main
The official Turbo LoRa has the superior style preservation. RDBT the superior compositions. And something went horribly wrong with the NAG implementation. Hope it gets fixed.
@deitychaser I noticed that with I noticed that with NAG. To be honest, I don't think it's really great on anima I think it's more designed for distilled models like Flux Klein 9b/4b and z Imege turbo not really the base model with a distilled LORA.
@AnimaXx Maybe, not sure. It works better on non-distilled Chroma as a negative conditioning guidance than CFG though, in my experience. Simply because you can go up the NAG guidance to 6.0 or something which you can't do with CFG without frying your image, thus in my tests with NAG i could get more control.
Hi boss, is there any possibility that lora version will be released again? Being able to control the strength of lora/distilled model at any time is really useful.
For example, I also use the lore for distillation and it seems to be better, because the built-in lore in the model, it seems to me, has less power overall.
Probably, the reason I decided to release a ckpt because I think mixing distilled models is not a good idea. Distilled lora is not style lora. Distilled lora changes how model works.
@reakaakasky there seems to be many people thinking that RDBT makes @artist less effective, and actually it seems that you don't cite the artists in your work, and that's how your distill different from the official version, what do you think about this issue?
@reakaakasky so putting a distilled lora in a non-distilled base model or vice-versa, either is a bad idea?
@m4rbleye mixing different distilled models, e.g. 0.5 rdbt + 0.5 turbo
@jimzlf if I have 100x compute, I might consider taking care all 20k+ @ tags, but I don't.







