Update 1.5: You fool! That wasn't even my final form! finally cut out the burn effect that hurt some prompts at normal CFG like 7, for what I hope will be once and for all, and reduced the darkness even more. No more random super dark prompts and way fewer super saturated images, while keeping the model's personality intact. Done with MBW with Vodka v4.0 and Centerflex v1.0. Oh yeah, I also have an RTX 3090 now, so expect some fun training from me in the future.
1.5 update quick start info:
<neg-sketch-2> negative embedding highly recommended for realism and 3D style images (among others).
When prompting for paintings, I suggest using
framed, borders, photoas your negative prompt to get fullscreen images and cut out any weird 3D-like people.When prompting for illustrations, I like to use
photo, realisticas my negative prompt.When prompting for realism, I normally use a negative prompt of
<neg-sketch-2>, anime, render, pixar, illustration, sketchat 1.2 weight.Garbage-bin LoHa recommended for some any silliness ;)
Example v1.5 images were generated using latent upscaling from close to ~512x resolution up a few hundred pixels each side at 0.55-0.65 strength w/ DDIM. This was followed up with an ESRGAN model upscale, then converting the image to latents and using ControlNet Tile in a latent to latent stage at 0.2-0.4 strength w/ DDIM.
My personal merge of Stable Diffusion 1.5 custom models using the noise offset to improve contrast and dark images. An inpainting model is provided to make inpainting in the model’s styles and detail easier.
I like to think of this model as being like base SD 1.5 hyped up on energy drinks and ADHD. Maybe some artist names aside (I don't use them ever, anyway), it has a lot of 1.5's general-purpose power, but has a much higher baseline detail and quality and can be very responsive, provided you're using appropriate settings.
This model is meant to be:
Artistic and elegant
Drop-dead easy to work with
Good at making cool characters and landscapes
Not bound or leaning towards any single style
Killer at digital and conventional art in many aesthetics
And above all, fun
It’s not so great at explicit sexual content and anime*, including anime-based embeddings. There’s a million other models for those if that’s what you’re after.
*There is some ability to bring out a neat anime aesthetic when you prompt for 'anime style', which I find to be quite cool to look at, although it can be a bit finicky. If you try to make anime-esque art with this model, do not put 'portrait' in your negative prompt, or use 'close' or 'closeup' in your positive prompt, as those seem to force it into a 3d-like style even if you add more weight on the anime style.
I want to also bring attention to whosawhatsis' verisimilitude, which is great at readily making wallpaper-quality photorealistic stuff.
I also want to shoutout coreco and his seek.art MEGA v2, which was responsible for much of the composition of V1.3-V1.4, and is an excellent update to his Mega model.
Example images were generated in Invoke AI. This means unless you use Invoke AI, you likely won't be able to recreate my images exactly. Just learn from the prompts and modify the weighting in prompts as needed for the UI you use (if you use the A1111 UI, any (plus sign)+ is equal to one set of parentheses).
By downloading, you agree to the creativeml-openrail-m and Dreamlike-art licenses.
Credits (V1.4 / V1.3.5):
Roboetic’s Mix – Roboetic
seek.art MEGA v2 – coreco
RPG V4 – Anashel
HeStyle V1.5 - krstive
Movie Diffusion - Dalle2Pictures
Analog Diffusion and Portrait+ - wavymulder
RealSciFi - AIfriend
Foto-Assisted Diffusion - Dunkindont
fantasy-art-style v1.8 - kasukanra
Vintedois Diffusion - Predogl and piEsposito
noise offset – Nicholas Guttenberg
Description
V1.4.5 without the baked Waifu Diffusion 1.4 VAE. Only download this if you know what you're doing with it.
FAQ
Comments (15)
<neg-sketch-2> links to a differently named file that is a BIN file. am I missing something?
The author didn't rename or convert the embedding after they trained it. It still works fine, though you may want to rename it to something more identifiable. If you use the A1111 UI or a fork of it, the trigger should be whatever you rename the file to, as I recall.
@526christian thank you I assumed that might be the case, and I tried it (in auto1111), but I get an error when trying to use it saying it is corrupt. any way you can upload it to this model as an addition?
@egormly That's odd. I re-downloaded the .bin and booted up A1111 quick to try it in case I needed to let the author know there was something wrong, and it worked fine on my end.
https://i.imgur.com/usq7YZc.jpg
You downloaded just the .bin and put it in your embeddings folder, right? No overlapping filenames?
@526christian I got it to work and realized at the same time I am an idiot. I must have downloaded the file as an html file (wrong click I suppose), thus the corruption, thank you for taking the time to respond to me sorry to waste your time.
@egormly Happens to the best of us, lol. But at least it's working!
This is my favorite model. Responds to prompts about color and painting with enthusiasm; lots of Checkpoints are very much alike, but 526 mix is very distinctly different, produces beautiful and atmospheric. A particularly strong response to prompts referencing the painter Maxfield Parrish. . .
What does the + and ++ mean in certain prompts?
closeup b&w (photo)++ of a (tired)+ (firefighter)+ (filthy and covered in soot)+, sitting and leaning forward, upper body, skin pores, Kodak Tri-X 400tx, exhausted
Negative prompt: <neg-sketch-2>, (illustration, sketch, painting, pixar, render)+
Seed: 1768081972, Steps: 30, Sampler: DDIM, CFG scale: 6
Ah. That's the weighting syntax the UI I use works with. +'s are equal to an extra .1 weight, or a set of parentheses in most SD software, and -'s are .1 less weight. I realize it's caused more confusion than I previously expected, so when I finish this model's rework I'm going to use syntax that's what most are familiar with.
@526christian Thanks, seems to be something specific to your model rather than a stable diffusion prompt :)
Is there a way to remove the pink/purple shade that appears practically every time on cheeks, particulary with illustrations (and very strongly with artists like Frazetta)?
I've tried everything, but no can do...
Definitely a deal breaker for me.
And that's too bad, because for the rest, your merge is stunning! :'(
A simple example :
pos prompt -> "girl, closeup, fantasy landscape, illustration"
neg prompt -> "3D digital, photo, low quality, worst quality"
"seek.art mega" have the same problem, but to a lesser degree.
That's interesting... Could you upload a pic of that somewhere so I can see what you're referring to? I'm in the middle of hardware upgrades and my Linux installation with all my SD stuff got broken somehow, so I can't test at the moment.
Sure. Thanks for your answer. :)
By the way, look at the nose too.
As you can see, I gradually put a stronger negative prompt, but no luck...
https://temp-file.org/AIace9wMwB8IIIl/preview
https://temp-file.org/l0xLYeRvNQN3kLM/preview
https://temp-file.org/FWGbdn0STVYWk1V/preview
https://temp-file.org/NpmBtPGPhu0LaJM/preview
A last one, by Frank Frazetta :
https://temp-file.org/QsaA2gmMsVuAVEt/preview
The color saturation is really high.
@SixtyFourthNinja Thanks. Sorry it took me a bit, I've been having to reinstall stuff on Windows.
I was able to cut the pink blush makeup effect with "rosy cheeks, blush" in the negative prompt with more effect from "rosy cheeks", so you might wanna give that a try and adjust the weighting as needed.
As for the saturation, this model is... a bit overbaked from my overzealous use of add-difference merging, so saturation tends to be a bit elevated from the sort of "burn" or "crisp" effect. You'll probably want to try a CFG of like 5 or 6 -- it varies from prompt-to-prompt.
I do know a good way to fix the model so it looks better at normal CFG and has more balanced saturation. It'll just be tricky to pull off well while maintaining the model's existing personality and breadth. I'll probably get around to it after I finish a project or two (I have an RTX 3090 now as of a day ago, so I can finetune stuff myself now!). But in the meantime, CFG is the way.
Or, if you're feeling adventurous, I have a model at 526christian/526RealForTune on HuggingFace that's pretty much a "cleaned up" version of this mix. I created it to be a realism-focused training base, but it seems to perform okay for your prompt too - https://imgsli.com/MTkyNTU2
@526christian
Happy to hear that you haven't lost interest in your model.
I did try "blush" and a lower cfg before, I just forgot to tell you. :)
"Rosy cheeks", even at 1.5, doesn't really alleviate the prob and gives a completely different style (and composition: the girl often doesn't even appear).
I tried "526RealForTune" with good results. (Of course I feel adventurous, that's the whole point. ;P)
The pink is still there, but more like "normal pink". Still a bit much with a Frazetta style, but I can live with it.
You should put this model on civitai!
Anyway, thanks for your help. I hope to see a newer version soon!
Details
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.
