Pony Diffusion V5 is a western cartoon style SD 2.1 768px finetune capable of producing stunning SFW and NSFW visuals of various anthro or feral species, humanoids and their interactions based on simple natural language prompts.
Please join our Discord Server to support development of new versions of this model and get access to free SD bot and check out more examples of this model capabilities on our prompt sharing website or follow the author on Twitter.
Important information
You will need to use either --xformers or --no-half (super slow) to load this model, I am not entirely sure why this is necessary yet.
This model supports a wide array of styles and aesthetics but provides an opinionated default prompt template that allows generation of high quality samples with no negative prompt and otherwise default settings
score_9, just describe what you want, tag1, tag2which can be further refined with negative prompt of
watercolor painting, brush strokesif you prefer "soft shading" style.
You may also specify whether you want no background as by default the model tends to put characters in scenic floral environments.
Other special data selection tags include, 'source_pony', 'source_furry', 'source_cartoon' and 'source_anime' and ratings of 'score_safe', 'score_questionable' and 'score_explicit'.
This model is capable of recognizing many popular and obscure characters and series.
If you are looking specifically for pony style, I recommend using one of the two following templates `anthro/feral pony, rest of the prompt` or `source_pony, rest of the prompt`.
This model is very capable of understanding of natural language so just describing intended result works in most cases, although you can add some tags after the main prompt to boost them.
One side effect of this, is that if you rely only on tags, you may want to add 'solo' as otherwise the prompt may be interpreted as multiple character, i.e.
cute pony, fancy pony, solo (without solo you will get a cute pony and a fancy pony)Using Euler a with 35 steps and resolution of 768px is recommended although model generally can go up to 1024 as long as one of the sides is kept at 768px. Please use Waifu Diffusion VAE.
Special thanks
Iceman for helping to procure necessary training resources
Haru for assistance with captioning efforts
Cookie for technical expertise in training
PSAI Server Subscribers for supporting the project costs
PSAI Server Moderators for being vigilant and managing the community
Technical details
The model has been trained on ~1.3M images aesthetically ranked based on authors personal preferences, with roughly 1:1 ratio between anime/cartoon/furry/pony datasets and 1:1 ratio between safe/questionable/explicit ratings. About 25% of all images has been captioned with high quality detailed captions, which results in very strong natural language capabilities.
All images has been trained with both captions (when available) and tags, artists' names have been removed and source data has been filtered based on our Opt-in/Opt-out program. Any explicit content involving underage characters has been filtered out.
License
This model is licensed under a modified Fair AI Public License 1.0-SD (https://freedevproject.org/faipl-1.0-sd/) license.
The following modifications have been added to Fair AI Public License:
You are not permitted to run inference of this model on websites or applications allowing any form of monetization (paid inference, faster tiers, etc.). This applies to any derivative models or model merges.
If you want to use this model commercially, please reach us at [email protected].
Explicit permission for inference has been granted to CivitAi and Hugging Face.
Description
V5.5 is an updated V5 with same dataset and training length, but improved overall training process resulting in better small details and less blurry artifacts.
FAQ
Comments (24)
Hello,
Error on load
flshattF is not supported because:
device=cpu (supported: {'cuda'})
dtype=torch.float32 (supported: {torch.float16, torch.bfloat16})
tritonflashattF is not supported because:
device=cpu (supported: {'cuda'})
dtype=torch.float32 (supported: {torch.float16, torch.bfloat16})
Operator wasn't built - see python -m xformers.info for more info
triton is not available
cutlassF is not supported because:
device=cpu (supported: {'cuda'})
smallkF is not supported because:
max(query.shape[-1] != value.shape[-1]) > 32
unsupported embed per head: 64
For the record, it does load on ComfyUI just not on A1111. One of the best models I've tested.
hmm, this is unfortunately first time I am seeing this issue, have you tried generic recommendations like using --xformers and updating auto1111 to latest version?
@AstraliteHeart Yeah I run xformers. I'll try another installation of A1111 when I get home. Thanks.
-Edit:
Tried it out on the latest build of A1111 and now I get "TypeError: 'NoneType' object is not callable", clean installation.
getting an error with this one,
As a note both A1111 and ComfyUI start to ship without xformers and the support is dropping, making a model only working with xformers is a mistake on the long run
This "just works" on current Comfy at least in Nvidia GPU mode
So how exactly do you install this? And do you install this WITH the pony diffusion from "suggested Resources". I need file paths please.
I'm gonna assume you're using A111's WebUI. The Pony Diffusion in "Suggested Resources" is just the older version of it, you don't need to get that one.
You would just download the file, and go to wherever you installed the WebUI: \stable-diffusion-webui\, (I'm pretty sure i renamed mines though, so it might not be called the same thing) and then go into the "models" folder, then "Stable-diffusion", and then you would just put the file into there.
TLDR:
"\stable-diffusion-webui\models\Stable-diffusion", put the file in there.
It appears that this model is entirely unmergeable
Update: after using the model I decided that it's too good by itself to merge anyway
it's the curse of Stable Diffusion 2.1
I would say this is the best pony model on the internet.
To those who wants to use control-net,you must choose "prompt first",there is no compatible control-net model for SD2.1768,so both "blance" or "control-net" first will ruin the pic
Please, the author, do it in a future update so that can generate a gradient on the pony's hooves. This is the gradient hooves tag. And the flower in hair tag generates flowers on the background too when it should be an accessory near the ear. It is impossible to limit this in negative prompts
Why is this so ridiculously good at non-pony stuff?!
Byproduct of being good at pony stuff.
quality captions go a long way
dude who made this and all other versions manually tagged and rated every image he used for training so its just a insanely high quality model and v6 is even better
is anyone getting colorful noise when using this with animatediff?
Yes, I'm also getting colorful noise when I ask for "green fur" on a fox and other keywords seem to throw it off a bit. I'm not sure why.
negative strobe lights works a bit
How to use it?I dont konw where to open it
Animatediff and Controlnet don't seem to work at all. Is there a specific motion model that you need to use with Pony? It's not just this model, all pony models don't work for me.
can this be used at 512z768 resolution?
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.

















