hyperfusion LoRA 550k images - CivArchive (CivitAI Archive)

hyperfusion LoRA 550k images - v5

NSFW

This LoRA was trained on 550k images of normal to hyper sized anime characters. It focus mainly on breasts/ass/belly/thighs, but now handles more general tag topics as well.

**If you're wondering where the v8 LoRA is, read the Changelog Article.**

Also a back up HuggingFace link for these models

Uploaded 1.4 million custom tags used in hyperfusion here for integrating into your own datasets

Recommendations:
negative (depends on base model, use what works for you): lowres, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, (worst quality, low quality), normal quality, jpeg artifacts, signature, watermark, username, blurry, monochrome, [3d], comic, (sketch), transparent background, artist name
cfg: 9-12
resolution: 768 to 1024, but prefers to be closer to 768
clip skip: 2

sampler: With v7+ try to avoid Karras samplers. Training with --zero_terninal_snr makes karras samplers problematic, but they still work somewhat.

Changelog Article Link

Tag Info (You definitely want to read the tag docs, see :Training Data)

Because hyperfusion is a conglomeration of multiple tagging schemes, I've included a tag guide in the training data download section. It will describe the way the tags work (similar to Danbooru tags), which tags the model knows best, and all my custom labeled tags.
For the most part you can use a majority of tags from Danbooru, Gelbooru, r-34, e621, related to breasts/ass/belly/thighs/nipples/body_shape.

The best method I have found for tag exploration is going to one of the booru sites above and copying the tags from any image you like, and use them as a base. Because there are just too many tags trained into this model to test them all.

Tips

If you are not getting the results you expect from a tag, find other similar tags and include those as well. I've found that this model tends to spread its knowledge of a tag around to other related tags. So including more will increase your chances of getting what you want.
Using the negative "3d" does a good job of making the image more anime like if it starts veering too much into a rendered model look.
Ass related tags have a strong preference for back shots, try a low strength ControlNet pose to correct this, or try one or more of these in the negatives "ass focus, from behind, looking back". The new "ass visible from front" tag can help too.
...more tips in tag docs

Extra

This model took me months of failures and plenty of lessons learned (hence v7)! I would eventually like to train a few more image classifiers to improve certain tags, but all future dreams for now.

As usual, I have no intention of monetizing any of my models. Enjoy the thickness!

Training Hurdles

-Tagging-

The key to tagging a large dataset is to automate it all. I started with the wd-tagger (or similar danbooru tagger) to append some common tags on top of the original tags. Eventually I added an e621 tagger too, but I generally only tag with a limited set of tags and not the entire tag list (some tags are not accurate enough). Then I trained a handful of image classifiers like breast size, breasts shape, innie/outie navel, directionality, motion lines, and about 20 others..., and let those tag for me. They not only improve on existing tags, but add completely new concepts to the dataset. ~~Finally I converted similar tags into one single tag as described in the tag docs~~ (I stopped doing this now. With 3m images it really doesn't matter as much).

Basically any time I find its hard to prompt for a specific thing, I throw together a new classifier, and so far the only ones that don't work well are ones that try to classify small details in the image, like signatures.

Starting in v9 I will be including ~5% captions along side the tags. These captions are generated with CogVLM.

I used this to train my image classifiers
https://github.com/huggingface/transformers/tree/main/examples/pytorch/image-classification

Ideally, I should train a multi-class-per-image classifier like the Danbooru tagger, but for now these single class-per-image classifiers work well enough.

-Poor Results-

For a long time I was plagued with sub par results. I suspected maybe the data was just too low quality, but in the end it just ended up being poorly tagged images. Sites like r-34 tend to have too many tags describing an image like "big breasts, huge breasts, hyper breasts" all on the same image. This is not great for a model where you want specific sizes. Using the classifiers I mentioned above I limited each image to a single size tag for each body part, and the results were night and day.

2023/08/13 Coming back to this after even more experience in labeling/training, I still agree with the above statement. As I tag more and more images, the model becomes more reliable with prompts. You can see this clearly with the new bottomheavy, topheavy, bellyheavy tags. It makes it easier to generate specific body types, and helps the model understand what you want from your prompt. I didn't have to add additional images to make these tags work. I just improved tagging.

-Tag Bleeding-

An example of tag bleeding is using the tag "gigantic breasts", but you end up with everything being gigantic, breasts, ass, thighs. It's been an annoying problem.

2024/03/15 After training some larger models, I find bleeding is less of an issue. It seems the answer is more data.

-Testing-

In order to determine if a new model is better than the last, it's important to have some standard prompts that you can compare with. x/y plot is great for this. Just keep in mind that the seeds between models will be totally different, and you likely need to compare dozens of images at a time and not 1 to 1.

It's also important to compare new models against the base model output to make sure what you are training is actually having an overall positive effect compared to the original model (obvious, but often overlooked).

2023/08/13 The hardest part about testing is trying to determine when you have overcooked your text encoder. At some point the models ability to interpret the prompt starts to degrade when training the text encoder at a high enough LR. I've seen it happen with smaller and larger models alike. Unfortunately I still don't have a good way to test this other than comparing prompts to older models. I've tried training without the text encoder many times, but the results are always sub par or too slow to train. Concepts that are foreign to the base model, are so much better understood with TE enabled in training.

-Software/Hardware-

The training was all done on a 3090 on Ubuntu. The software used is Kohya's trainer, since it currently has the most options to choose from.

Description

Doubled the dataset size to 200k

Lots more image classifiers used to tag images for better size/shape control

Training Notes:

200k+ images from Danbooru, r-34, e621 etc...
LR 8e-5
TE_LR 2e-5
batch 4
GA 32
dim 32 (I reduced this since it didn't make a huge difference and saves space)
conv_dim 32
alpha 16
scheduler: cosine_with_restarts, 33 restarts
base model NAI
Av3 VAE
flip aug
clip skip 2
225 token length
bucketing at 768
tag drop chance 0.15
tag shuffling
47,000 unique tags, but about 500 frequently occurring ones (see attached training data for tag list)
about 4 days training time (if you ignore the dozens of models I trained for days and scraped)

FAQ

Comments (34)

216396May 20, 2023

CivitAI

I'm having trouble getting the best results with the Upgrade. Are there any Tips and Advice?

throwawayjm

Author

May 20, 2023

I would check out the prompts I used in the example images.

216396May 20, 2023

@throwawayjm Ok

MaggieBluxomeMay 26, 2023

CivitAI

Hello! Any tips on having gigantic breasts fully clothed? Most of my outfits have nude breasts bulging THROUGH the clothing. :)

BBBenthusiastMay 26, 2023

I put topless:2 and nipples:2 in the negative prompt and cleavage in the positive prompt. This next one only works sometimes, using clothing lora and textures that can help depending on what they are. Also being very descriptive of the clothes or there being a lot of samples of the clothes in the checkpoint helps. I can get a t shirt to work 90% of the time, button ups about 50% for instance. Thinking about styles of clothes that are already popular for big booba helps.

MaggieBluxomeMay 26, 2023· 1 reaction

@BBBenthusiast Thank you, that sounds about what I'm dealing with. Especially button tops, working about 50% of the time. ;D

I will try your negative prompt tips. :)

throwawayjm

Author

May 26, 2023· 3 reactions

@MaggieBluxome "clothing" and "clothed" are the 2 highest cloth related tags, but I have a feeling because the dataset has more nude breasts than clothed breasts at large sizes, it might be difficult. Maybe in the future I can train a classifier to tag clothed/vs unclothed, but its low on my list of classifiers to train.

Will_MarkerJun 3, 2023

CivitAI

I'm fairly new to this and was wonder what you do with the training images download? I'm guessing it trains the Ai use models that it's trained with but I can't figure out where to put them to get them to work?

throwawayjm

Author

Jun 3, 2023

The Training Data provided is just the tags the model was trained on for informational purposes. The real training data including images is over 100 GB.
Civitai just doesn't provide a place to put tags, so I put them there.

Will_MarkerJun 4, 2023

@throwawayjm Ah okay, thank you for the help!

belover900Jun 22, 2023

CivitAI

what model do you use this with, because im trying to use it on happyaccidents.ai and it just keeps saying All LoRAs and Embeddings must match Stable Diffusion v1.5‘s base model (SD 1.5)

belover900Jun 22, 2023· 10 reactions

nevermind figured it out

enderscar14326Jun 23, 2023· 11 reactions

@belover900 hey, pro tip, when you figure out the solution, rather than just say you found the solution, why not just describe how you fixed it?

1966473Jun 26, 2023

@belover900 I need the solution

MaskedFreakAug 3, 2023

I'd second that solution too

draglorr630Aug 14, 2023

@belover900 HOW?!

1966473Jun 26, 2023

CivitAI

How do I use your loras on happyaccidents.ai? This one doesn’t support any models on there that are SD1.4, and your hyperbreasts one does support SD1.5 but I can’t get it to change anything

throwawayjm

Author

Jun 27, 2023

This LoRA works with SD 1.4 and 1.5, so It sounds like a problem on their side.

walterpaps259Jul 13, 2023

CivitAI

the size thing confuses me can you give a number example?

throwawayjm

Author

Jul 14, 2023

replace <size> with one of: small, medium, big, huge, etc... and give it more weight if you want it more accurate in size.
If the tag docs are not clear enough, let me know

walterpaps259Sep 6, 2023

ok i think i get it

nsfwpersonalaiJul 18, 2023· 1 reaction

CivitAI

No Hyper Pussies?

throwawayjm

Author

Jul 19, 2023· 2 reactions

Not my interest, but it would be easy enough for someone else to make a LoRA of. Only takes 100-200 images.

thebiologist92137Jul 19, 2023· 5 reactions

CivitAI

Suggestion: crotch bulges, hyper genitalia

nsfwpersonalaiJul 28, 2023· 1 reaction

and hyper pussy

jgatsuJul 25, 2023

CivitAI

Hey, I was only able to get v5 to work by using LyCORIS. Same for the hyper breasts extension. Does that seem correct?

throwawayjm

Author

Jul 26, 2023

Normally no. You can use it without the LyCORIS extension, but if you are on certain older versions of A1111 there was a bug loading some LoRAs. A1111 v1.2+ should work as expected.

jgatsuJul 26, 2023· 1 reaction

That was it! Updating fixed it! I can't believe it was something so simple, I feel silly. Thank you!

MaggieBluxomeJul 28, 2023· 1 reaction

CivitAI

Will there be an update for SDXL 1.0?

throwawayjm

Author

Jul 28, 2023· 1 reaction

I'm going to try it eventually. Training v6 on SD1x at the moment

SentenceAug 11, 2023

CivitAI

How should i combine your previous LORAs and this one? Looks like they're just dont work with each other. I've tried to increase or decrease the weights for each one, but it didn't help. Any hints please?

SentenceAug 11, 2023

rn i'm trying to deal with hyperbottom and hyperfusion, but help with others will be useful too.

throwawayjm

Author

Aug 13, 2023

@Sentence set hyperfuison to <:1> and use a much smaller weight for the others <:0.3> to start , along with tags related to what you want. Or just wait a few hours and ill upload the next version of hyperfusion to try it with

SentenceAug 14, 2023

@throwawayjm Thanks for the advice!

LORA

SD 1.4

by throwawayjm

Download (Beta) View on CivitAI