RDBT [Anima]
This is a finetuned model with 10k high aesthetic images paired with natural language captions from LLM. Then distilled to further improve quality and stability. Due to personal needs, training data does not contain any shiny plastic glossy AI image.
It's not overfitted and doesn't have a default style. (Might sound strange for some people, see FAQ.)
I use it as a clean starting point to stack more style LoRAs. I can stack whatever I want and get exactly what I stacked.
See this page for update log.
For advanced users: The RDBT model is trained as LoRA natively. See this page for original LoRA, update more frequently.
This model is based on:
prefix with ym: AnimaYume (hf link) (civitai link).
prefix with b,p: Anima pretrained (hf link)
Sharing merges using this model is not allowed. If someone is selling this model as their own, I'm happy to list them here so everyone knows.
Known model thieves: NukeA.I (behind paywall on tensorart).
I wrote a story about it. Also contains a guide for trainers about "how to bake special trigger word into your model".
Usage:
Settings:
CFG scale: 1~3. This model has been guidance distilled. You can disable CFG (CFG 1) and run the model 2x faster. Cover images are without CFG for demonstration.
Steps: 16+
Prompt
Always specify style, or use a style LoRA. Otherwise, you will get random/mixed style.
This is a feature, not a bug. This model does not provide a overfitted default style.
Quality tags:
It's recommended to omit all the quality tags, or just keep the "masterpiece", if you're not confident. Omitting those redundant tokens allows LLM to pay more attention on other words.
Quality tags have been reinforced during distillation. Thus they don't have noticeable effects. Same as negative tags. If you use cfg, there is no need to dump "score_1, blurry, worst quality, jpeg artifacts, extra arms,... x100 words" into your negative prompt. Those things have been distilled out.
FAQ:
What's wrong with the "default style"?
If an image generation model has a default style, it means the model will unconditionally generate that style, even if you don't specify it in the prompt. This phenomenon has a technical name: overfitting.
If you like this default style, then great, it's always here and you even don't have to prompt. But if you want to generate something other than the default style, unfortunately, you'll find that you cannot overwrite it.
FYI: 95% ckpts you see out there, are just base ckpt + overfitted style LoRAs.
Training settings
All captions are NL from Google Gemini.
Optimizer: adamw, constant lr 0.00002, weight decay 0.1, batch size 16.
LoRA rank/alpha 24.
Timesteps shift 3.
Block 0-2 and adaln linear layers are skipped.
Description
FAQ
Comments (11)
Without picking a specific style, the generated characters lean into a semi-realistic AI look (tested 10+ times). It works perfectly for anime styles once I add the right style tags. Overall it’s pretty good, gonna go test out more style tags next.
Yes, there is no default style. Without style conditions distilled model will give you the "ultimate AI style" that averaged every style. Which looks really bad.
Bossman producing banger after banger
0.29 produces some great results when you're asking for something generalized and well understood but I feel like prompt adherence went out the window for fringe content. Anima as a whole is really good at mashing concepts together using natural language, but throw in some simple contradictions ("A solo illustration of a girl with a penis") and it struggles to understand.
Try and make sure that you specify the style or artist at the beginning of the prompt to lock in that style. Otherwise it can be a little bit wild with all Checkpoints especially the base one. pretty much all of them unless it's specifically stylised towards a specific style.
I know you are not really supposed to use the turbo lora with this checkpoint since it's already turbo/distilled to begin with... buuuut I did anyway and the results are honestly very interesting. A lot less detail but VERY strong 2D anime styles are possible with very stable composition. I kind of love it. Posted some pics.
I've tried it with the Cosmos Predict 2.5 DMD2 LoRA too and it works well.
Like the other person, I tried v0.29 RDBT Checkpoint, Cosmos Predict, and Anima Turbo LoRA all together with 4 steps. Pretty interesting stuff!
V0.29 exceeded all limits in flash-banging with 3D looks/shading
Further we are from V0.24 the worse 3D it gets :(
"Style is required! This model does not provide a default style. You should always prompt specific style. Or use a style LoRA. If you don't give the model style conditions, the model will give you the "ultimate AI style" that averaged every style because of dmd2. This is a "feature", not a bug. I don't like bake a strong style into the model, I prefer having choices."
You need to add some "@artist" tags.















