Ponydiffusion is an excellent model for 2d content, but it seems rather inconsistent with 3d. This model is designed to more consistently produce photorealistic 3d images of a variety of subjects. Currently, the beta version still produces a more CGI effect as I do not believe I have enough sample images, but hopefully future versions will be more realistic. I would recommend checking the description of each version to see what it does and what its drawbacks are for the time being for more detailed info.
Description
While I am calling this release 2.1, theres a fairly significant number of changes to it. For one, ive shortened the trigger tag to "source_photo" for ease of use (though I could probably just drop it to just "photo" in the future), and used Booru tags instead of sentence captions for the images. Its still capapable of processing sentence style prompts, obviously, though it does seem to like booru tags when specifying backgrounds. This version also doesnt need an excess of negatives like the prior versions, and seems to function best with a CFG scale at 4-5 and the DPM++ 2M Karras sampler at around 15-17 steps. After some additonal testing, I have determined that a CFG around 7 with a Euler A Automatic sampler at 30 steps also yields very good results, though it takes a bit longer on lower end hardware. Depending on the character, you may need to add more negative and positive tags to enforce realism and discourage it from doing a 2d or CGI effect. I also recommend putting
"plastic,plastic skin,overexposure,blurry" in the negatives for most prompts unless you like the shiny skin effect that most realistic AI models seem to give people.
FAQ
Comments (2)
Why do I get the weirdest skin textures with this. It all comes out like white people who have almost frozen to death. All bluish and splotchy...
I think its due to the fact that pony is still trying to make the images anime/cartoon style and giving them light, flatly shaded skin. It seems to be a pretty consistent issue with all the realism LORAs for pony.
I found that using higher sampling counts with Euler A (around 30) and a lower CFG fixes the issue a bit, though it still may be a bit overblown. Alternatively, you can use DPM++ SDE Karras with a CFG around 4 and around 14 steps for quicker images that arent as overblown, but you will need to to fix the faces with something like ADetailer.

















