Pony Diffusion V5 is a western cartoon style SD 2.1 768px finetune capable of producing stunning SFW and NSFW visuals of various anthro or feral species, humanoids and their interactions based on simple natural language prompts.
Please join our Discord Server to support development of new versions of this model and get access to free SD bot and check out more examples of this model capabilities on our prompt sharing website or follow the author on Twitter.
Important information
You will need to use either --xformers or --no-half (super slow) to load this model, I am not entirely sure why this is necessary yet.
This model supports a wide array of styles and aesthetics but provides an opinionated default prompt template that allows generation of high quality samples with no negative prompt and otherwise default settings
score_9, just describe what you want, tag1, tag2which can be further refined with negative prompt of
watercolor painting, brush strokesif you prefer "soft shading" style.
You may also specify whether you want no background as by default the model tends to put characters in scenic floral environments.
Other special data selection tags include, 'source_pony', 'source_furry', 'source_cartoon' and 'source_anime' and ratings of 'score_safe', 'score_questionable' and 'score_explicit'.
This model is capable of recognizing many popular and obscure characters and series.
If you are looking specifically for pony style, I recommend using one of the two following templates `anthro/feral pony, rest of the prompt` or `source_pony, rest of the prompt`.
This model is very capable of understanding of natural language so just describing intended result works in most cases, although you can add some tags after the main prompt to boost them.
One side effect of this, is that if you rely only on tags, you may want to add 'solo' as otherwise the prompt may be interpreted as multiple character, i.e.
cute pony, fancy pony, solo (without solo you will get a cute pony and a fancy pony)Using Euler a with 35 steps and resolution of 768px is recommended although model generally can go up to 1024 as long as one of the sides is kept at 768px. Please use Waifu Diffusion VAE.
Special thanks
Iceman for helping to procure necessary training resources
Haru for assistance with captioning efforts
Cookie for technical expertise in training
PSAI Server Subscribers for supporting the project costs
PSAI Server Moderators for being vigilant and managing the community
Technical details
The model has been trained on ~1.3M images aesthetically ranked based on authors personal preferences, with roughly 1:1 ratio between anime/cartoon/furry/pony datasets and 1:1 ratio between safe/questionable/explicit ratings. About 25% of all images has been captioned with high quality detailed captions, which results in very strong natural language capabilities.
All images has been trained with both captions (when available) and tags, artists' names have been removed and source data has been filtered based on our Opt-in/Opt-out program. Any explicit content involving underage characters has been filtered out.
License
This model is licensed under a modified Fair AI Public License 1.0-SD (https://freedevproject.org/faipl-1.0-sd/) license.
The following modifications have been added to Fair AI Public License:
You are not permitted to run inference of this model on websites or applications allowing any form of monetization (paid inference, faster tiers, etc.). This applies to any derivative models or model merges.
If you want to use this model commercially, please reach us at [email protected].
Explicit permission for inference has been granted to CivitAi and Hugging Face.
Description
FAQ
Comments (20)
It just throwing all NaNs error without generating anything. Previous version working very well.
Try with either --xformers or --no-half (super slow), I am not entirely sure why this is necessary.
Going to give this a try out soon, can it produce 3d styled characters like clopician?
I tried to make a 3D Twilight Sparkle but it didn't work out so well. With 2d V5 it works very well.
It can do various styles of 3d, but not sure about being able to replicate specific artist style: https://postimg.cc/xqkV2hx5
You might want to train a lora for a specific artist's stlye
This is borderline non-functional for me. I always use xformers anyway, and I tried no-half too.
I can make images if I'm using no resources other than Pony Diffusion. If I load a Lycoris or LoRA, I get errors. Even with no other resources this checkpoint eats through my VRAM like nothing else. I can't make images as big as I do with other checkpoints.
Finally, I tried to merge this with some other models and my computer completely ground to a halt before my webui crashed.
Something went wrong during the creation of this checkpoint. I hope you can figure out what happened and fix it.
I have not heard about anyone else having such issues but it would help to know what is your hardware. This model is pretty standard in terms of training so unlikely there is anything different, but 2.1 it is a 768px which is just a bigger model, Can you load other 2.1 768px models fine?
We had some users on Discord report similar issues with --no-half but it worked well for them with --xformers, so perhaps try to reinstall them/makes sure they are enabled?
This model was trained on base SDv2.1-768, old Loras trained on 1.4 or 1.5 will not work, likewise model merges with 1.4 or 1.5 base models will not work.
@AstraliteHeart This is all my bad then. I didn't read closely enough and missed that this is SD 2.1.
I have a 3060Ti and a Ryzen 5950X and I haven't really done anything with other 2.1 models so I don't really know what to expect.
I call into question my own sanity that I was using v4 the entire time instead of v5
Are you sure you're using V5? Looking at the metadata on https://civitai.com/images/1312053?period=AllTime&periodMode=published&sort=Newest&view=categories&modelVersionId=101779&modelId=95367&postId=339677, civitai claims it was generated with V4.
Also, your prompt seems kind of weird? ie, "score_explicit" isn't a score, the score isn't first token, etc.
Well, let's untangle this mess.
Your images metadata indicates they were generated using V4 (1.5 based model which uses "derpibooru_p_95" for prompt and "derpibooru_p_low" for negative tags) and not V5 (which only needs "score_9" in prompt).
If I correct your prompt to use V4 quality tags and remove `score_explicit` as it is not a valid tag, i.e.
outdoors, solo, (derpibooru_p_95:1.1) (princess_luna) pony, beautiful and derpibooru_p_low, (lowres worst, low :1.3), bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, , jpeg artifacts, signature, watermark, username, blurry 3d, dark, blurry, mutated, deformed, ugly, bad proportions, glitch, watermark, signature,` I get
if I simplify prompts to just outdoors, solo, (derpibooru_p_95:1.1) (princess luna), beautiful and negative to `derpibooru_p_low, bad anatomy, text, jpeg artifacts, signature, watermark, username, blurry 3d, blurry, deformed, ugly, bad proportions` (there were a bunch of hand related tags) and switch to Euler A with 35 steps (generally works better for PD) I get
If I switch to V5, with prompt of `(score_9:1.1), feral pony Princess Luna outdoors` and no negative prompt. Euler A 35 step 768x768 (model reacts to this size best), I get
or with watercolor painting, brush strokes in negative to bring it closer to typical V4 look.
so in retrospect, you used wrong model with wrong prompt.
yes I am indeed retarded
the breakdown for v5 and v4 is appreciated
still waiting for an answer to the question in my review thread
Sorry, several noob questions.
What score_9 tag does?
What does --xformers do? I know it enables the afformentioned xformers, but what are they?
Also, how do you sign up on purplesmart.ai?
1. When the model was trained all images that went into it were tagged with various score_X tags based on how 'good' the image looks, so score_9 allows you to make images nicer by trying to get closer to all images with score of 9. Which tends to generate paint-like images sometimes but you can fix that fix negative prompt.
2. --xformers enables use of xformers module which generally makes things faster and is required for some models, i,.e. 2.1 based ones, depends on what app you use but Auto1111 has instruction on the install page for xformers
3. http://discord.gg/94KqBcE (you sign up via discord)
@AstraliteHeart thanks! And the last question, will xformers work with sdp-optimisation for RTX4090?
I think I did something wrong? I'm getting images that are just a black square
Details
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.












