TAME Pony: The Authenticity MachinE
IMPORTANT ANNOUNCEMENT!!! - Please do NOT use anything with the word "Karras" in it to generate images with any of the TAME models. They are NOT compatible, and you will NOT get clean images. Best settings are Euler a with SGM Uniform scheduler.
You can use other samplers, but always choose SGM Uniform or Simple as your scheduler (unless you have read the version information on the right, which provides a fix for other schedulers, depending on the TAME version you are using).
Remember: the showcase images are meant to be reproducible. There are no special techniques used to produce them, just good settings and a mild upscale (1.5x-2x) using highres fix. If you are not getting images that nice, the problem is your settings. I suggest to try downloading one of the sample images and load it into your image generator, then use the settings for your own prompts until you figure out settings that works better for you.
Version 2.5 is here!
Like version 2, the changes in this version have been achieved only through merging.
Since I had a lot of feedback that people missed the vivid colour and Pony knowledge (characters, men) from version 1, I went back and reworked the recipe from the start to try to get as much of that as possible back into the model while retaining the improvements from version 2.
I also improved the diversity somewhat. It's still limited, but at least now you can actually prompt for Asian girls and get them!
I did have to compromise slightly on anatomy and realism, but overall I think it worked pretty well. Let me know what you think in the comments.
Version 2.5 Usage:
I like CFG 5 for this one, but anything from 2-7 generally works well. It has the same wide CFG range like version 2, so feel free to crank it up higher as long as you raise the steps to compensate.
I haven't tested the samplers as thoroughly as I did with version 2, but the DPM samplers are still working well with SGM Uniform or Simple. Still not working out of the box with Karras, but it does work if you set sigma min to 0.1 (note that Version 2 required 0.3). You can set this in options/sampler parameters in A1111, or with a node in ComfyUI.
In my opinion, DPM++ SDE and DPM++ 3M SDE with SGM Uniform scheduler typically give nicer results than Euler a on this model, but try around.
As always, please post images and feedback so I can see what everyone is up to!
P.S. This will likely be the last version for a while, as I don't think I can squeeze much more out of merges. Version 3 will come, but not until I manage to do more custom training.
Version 2:
No new training this time, but hundreds of hours of merging, testing, and tweaking to squeeze more quality and realism out of the model. Two more SDXL models (bigASP and NightVision) and two more Pony models (CinEro and One-Trick) were introduced to the mix. Version 2 still has the realism, responsiveness and capabilities of the TAME you know and love, but with improved anatomy, clarity, image quality, sampler compatibility, lighting, and artistic capabilities.
This is NOT just a porn checkpoint. Yes, it can do realistic XXX really well, but there is much more it can do, so look at the example images and try around.
Version 2 Usage:
Same guidelines as version 1, but with a few extra tips:
CFG 3 will give good, realistic, high quality results, but the output will be less vivid than version 1. If you prefer that bright and colourful feel, increase CFG to between 5-7.
If you want to get artistic, higher CFG even up to 20+ can give interesting results! But increase the number of steps if you start seeing colorful artifacts or other issues.
All the DPM samplers are now working well, IF you choose SGM Uniform or Simple as your scheduler. If you are using an old version of Auto1111 or something where you cannot independently choose the scheduler, they may not work.
If that is the case, or if you want to use one of the non-working schedulers, you can go to settings, sampler parameters, and set sigma min to 0.3 (may not be the optimal value, but works pretty well for me). This should fix Karras and most other schedulers except KL Optimal. BUT remember to set it back to 0 at some point because it will negatively affect the results if you use schedulers that were already working!
Please post interesting results so myself and others can see what you have been up to with the model, and get some new ideas of what it is capable of!
Version 1:
This model is all about maximum realism and sexiness. It aims to achieve a new level of realism for Pony models. While there are a lot of amazing Pony realistic models out there, most of them suffer from "Ponyness": their Pony heritage is immediately clear when you look at faces or anatomy. TAME certainly has its flaws, some of which I hope to remedy in future versions, but from what I can tell the amount of "Ponyness" is very low.
Many creators are jumping on the Flux bandwagon now, which is understandable. It's a great model. But for those of us stuck with older GPUs I don't think it is the best option (if it is an option at all). I've also noticed a degree of "Fluxness" is present in most/all of the fine tunes.
TAME began with a series of checkpoint merges, using block weight merge to combine a set of realistic PonyXL and SDXL models in a way that maintained the prompt adherence and flexibility of Pony. I then trained the resulting model on my own dataset to further improve the realism.
The Authenticity MachinE will not win any awards for creativity but it is damned good at making realistic pictures of women in any state of dress or undress.
Quick start:
Sampler: Euler a
Steps: 20
CFG Scale: 3
Resolution: 912x1280, 1024x1400, 1280x1536
Hires fix: ESRGAN_4x
Upscale by: 1.5
Hires steps: 10
Denoise: 0.3
Usage guide:
Score tags: You do not need score tags with TAME. Putting them in may not hurt but it likely won't help either, so why waste the prompt space?
Quality words: You do not need quality words (8k, masterpiece, best quality, etc) with TAME. They are a waste of prompt space.
Negative prompts: You do not need negative prompts with TAME unless there are specific things you want to exclude.
Positive prompts: Prompt length doesn't matter too much, but keep it simple. Words and phrases with commas in between. The model understands Pony style prompts, but does not do well with natural language prompts. TAME usually responds well to gentle prompting (take a look at the example image prompts), so don't use a lot of emphasis e.g., (large breasts:1.8) unless the model is being stubborn. Start by just telling it what you want, then play with emphasis, rearranging words, and more advanced techniques if you aren't getting the right results.
Don't fill up your prompt with nonsense words. Look at this example I copied from another model's gallery (RealVis XL V5.0):
photograph the little catgirl, cat ears, wearing fur dark coat, 50mm . cinematic 4k epic detailed 4k epic detailed photograph shot on kodak detailed cinematic hbo dark moody, 35mm photo, grainy, vignette, vintage, Kodachrome, Lomography, stained, highly detailed, found footage, (masterpiece), (best quality), (ultra-detailed), the little catgirl, cat ears, wearing fur dark coat, illustration, disheveled hair, detailed eyes, perfect composition, moist skin, intricate details, earrings, cinematic still the little catgirl, cat ears, wearing fur dark coat . emotional, harmonious, vignette, 4k epic detailed, shot on kodak, 35mm photo, sharp focus, high budget, cinemascope, moody, epic, gorgeous, film grain, grainy, the little catgirl, cat ears, wearing fur dark coat, detailed, elegant, highly colorful, warm light, sharp focus, beautiful, intricate, expressive, rich deep colors, cinematic, cute, enhanced quality, creative, positive vibrant romantic atmosphere, depicted, perfect background, professional, thought, iconic, best, thoughtful, pretty, attractive, charming, confident, passionate
I can't speak for RealVis because I didn't create it, but I strongly suspect that much of that prompt is doing nothing. What I can tell you is if you use that prompt with TAME you will not get great results. Here is a quick rewrite:
catgirl,earrings,cat ears,wearing dark fur coat,warm lighting
See how simple that is? From 142 words down to 10, that's more than a 90% reduction. Now, observe the difference in the actual renders using TAME:
Long prompt:
Not terrible, but probably not what the user was looking for.
Short prompt:
See the difference? 90% less words and a much better image that is likely also closer to what the user was looking for (romantic atmosphere, elegant, etc). In fact, it looks so good that I decided to use it in the model showcase!
Adetailer: I do not recommend using Adetailer with TAME. In my experience it makes things worse. Most of the time you are better off with just Hires fix. If you are having trouble with e.g. a really small face then maybe give it a go, but don't expect miracles. Often changing the settings (sampler, prompt, resolution, steps, denoise, etc) will get you a better solution. For example, full body poses often render best at a more narrow resolution (square resolution often makes the faces smaller) and e.g. work better with Euler a than with DPM++ SDE. Too few steps can make things blurry while too many can cause artifacts and strange eyes. Higher denoise can fix more issues but can also mess things up.
Resolution: This model was trained on a high quality dataset with a max training resolution of 1536x1536. That means you can often generate at up to 1536x1536 without deforming your subjects. The model also works well at pretty much any width-height combination, even extreme ones.
For example the image below was generated at 2048x408:
And this one was generated at 504x2048:
These are just test images done on the fly, without Hires fix, but as you can see neither of them has any obvious deformities or other major issues. The second image would look significantly better with Hires fix and the extra finger could likely be eliminated by trying a different seed.
Basically you have about 2.4 million pixels to work with, and as long as you don't exceed that by much (width x height) the model works. For example, you can crank the width all the way up to 2048 and set the height to 400, 800, or 1200... but if you go much above 1200 you will need to reduce the width or you will start to see weird things happen.
Even a tiny change in the resolution typically has a large effect on the image (pose, etc). So be creative, try different resolutions!
Upscaling: I highly recommend using Hires fix, but make sure you choose a good upscaler. My favourite is 4x_NMKD-Siax_200k, but 4x_foolhardy_Remacri and ESRGAN_4x are also good choices. I generally set it to 1.5x resolution with approximately half the number of steps used for the first pass and a denoise strength of 0.2-0.4 (default for me is 0.3). You can of course upscale further if you desire.
Samplers: Euler a is highly recommended as it will give you good results with a range of settings. DPM++ SDE can work well with some images (avoid it for full body poses) but it really has to be dialed in or it will look terrible. Other working samplers include DPM++ 2S a, DPM++ 3M SDE, Euler, DPM2, DDPM, and LCM. Most others do not work properly with this model.
Schedulers: I haven't played around much with different schedulers, so you are on your own with that. I use Forge (an offshoot of Auto 1111) which has limited options for sampler/scheduler combos, but I haven't noticed huge differences in any case.
CFG Scale: Typically 1.5-5 is best (I keep it at 3 most of the time), but you can try going higher if you wish. Worst case you get a bad looking image or two, right?
Steps:
Euler a - 15-30 steps + 8-15 hires steps (for best quality I typically use 25 + 12)
DPM++ SDE - 6 steps + 4 hires steps (for pure speed you can even drop it to 5 steps and turn off hires fix, but don't expect mind blowing quality with this sampler)
DPM++ 2S a / DPM++ 3M SDE / Euler - 12 steps + 6 hires steps (these are decent starting points, but I haven't done in-depth testing with these samplers)
DPM2 / DDPM / LCM - I haven't played with these except to test that they work, so you are on your own
Notes:
On occasion you might notice a watermark appearing. This is either due to one of the models I merged in, or I made a few mistakes in cropping my own dataset. Either way just change the seed and it should go away.
The model is not great at counting fingers and sometimes creates too many or too few, especiallly in close-ups. If you have the the rest of the image dialed in but can't get the hands right, start generating batches using variation seed at a low strength... hopefully one of them will give you the correct number of fingers without drastically altering the rest of the image.
This model can generate very realistic vulvas, including the inside bits...the closer you are to your subject, the more realistic the vulva will be (see examples in the sample images). To get your subject to show you her innermost parts, use the term "spread pussy". Variations might work, but this is the term the model was trained on. You can also use "pubic hair" or "female pubic hair" in the positive or negative prompt or with an emphasis of less than one (e.g. pubic hair:0.5) to dial in the amount of pubic hair. You can try adding clitoris and urethra to the prompt if the anatomy isn't quite right, especially in close-ups, but I'm not sure how reliably this works. The model should also understand "gaping pussy" or "pussy gape" but again I am not sure how reliably.
The model can also do peeing, squirting (to a limited extent), penetration, masturbation, fingering, dildo, vibrator, anal, titfucking, etc. If you are having trouble getting the girl to e.g. stick a cucumber in her ass, don't fill the prompt with different words and phrases. Use something like this: anal object insertion, anal cucumber. Those two phrases, in that order, should do the trick. The same trick works for bananas, bottles, etc. Usually the phrase "cucumber in anus" will render a cucumber, but the girl will not put it in her ass. This is generally true for many other models too, in my experience. If you are having trouble getting cunnilingus or titfucking to work, try altering the positions of your subjects to something that makes anatomical sense (might take a bit of trial and error, giving directions to multiple subjects in one prompt can be a pain in the ass).
Please use this model with care, given its realistic capabilities. I have used only images of adults in the training dataset, but the model may still be capable of generating inappropriate images due to existing content or merged models. I have thus far not encountered anything inappropriate by accident, and I do not have any intention of testing for it. In any case, please do not post anything inappropriate in the gallery. Furthermore, I am not responsible for any misuse that may occur.
Finally, I would like to acknowledge and thank the creators of these other wonderful models, whose work I built upon:
GODDESS of Realism by Oppkllll
CreaPrompt_Lightning_Hyper-SDXL by jice
Another Pony Realistic Merge by Error666
iCatcher Realistic by iCatcher
LEOSAM's HelloWorld XL by LEOSAM
Pony Diffusion V6 XL by PurpleSmartAI
Better Cum - Pony (Lora) by Topplok2
NightVisionXL by socalguitarist
One-Trick Pony XL by DarkDescent
Description
About this version
Version 2.5 - quick start settings:
DPM++ 3M SDE with SGM Uniform sampler (NOT Karras unless you use the fix mentioned below!), 25 steps, CFG 5, Hires fix 1.5x with ESRGAN 4x
The idea behind this version is to combine the Pony knowledge and vivid colours that many loved from version 1 with as many of the improvements from version 2 as possible. I had to compromise a little on the anatomy, but it is still significantly better than version 1.
Improvements compared to version 2:
More knowledge of Pony characters
Easier to create men
More diversity (you can prompt for an Asian woman and actually get one!)
Better prompt following
Better sampler/scheduler compatibility (still doesn't work properly with Karras but it's close. Sigma min (settings/sampler parameters) now only needs to be at 0.1 instead of 0.3).
More vivid
Compromises:
Slightly less accurate vulvas (but much better than version 1)
A bit less realistic overall
It still has a very high useful CFG range, good image quality, good faces at a distance, etc.
Notes:
Works with most samplers if you select SGM Uniform or Simple as your scheduler. Best results with:
Euler a (20+ steps)
DPM++ SDE (8+ steps)
DPM++ 3M SDE (20+ steps)
DPM++ 2M SDE (10+ steps)
DPM++ 2M (15+ steps)
If you want to use Karras (this will work with some other schedulers as well), set sigma min to 0.1 (note this is different from version 2, which required 0.3). Go to settings/sampler parameters in Auto1111 or use a node in ComfyUI. However, it will negatively affect working schedulers, so don't forget to set it back to 0 again!
Resolution: 832x1216, 1024x1400, 1280x1400, 768x1400, 1280x1280... these are my goto standard resolutions, but use whatever you like, it shouldn't matter much.
I highly recommend upscaling (e.g. high res fix) for best quality.
For general use, CFG of 5 gives clean, bright, vivid images. But usually anything from 2-7 looks nice. As with version 2, if you want to get more artistic you can crank it up pretty much as high as you want, even 20+, but you will need to increase the steps.