Intro
I made it just for fun and as experiment to build a model good for augmenting professional photographs. I am using Nikon camera with bunch of vintage lens. I expect to build an SD model which is able to produce moody, cinematic pictures with nice smooth bokeh and "analog style". Please note, that I don't plan to train this model on any hardcore nsfw. Don't expect / request it from "cinero" models ;) My preference is art, beauty and emotions.
Some tips on Prompting
Few examples:
"[grayscale : [dimmed colors : vibrant color splashes : 16] : 8]" - I call it "temporal trick". What it does is just make your prompt depending on current step. With this prompts SD will use "grayscale" on steps 1..7. SD will use "dimmed colors" on steps 8..15. On further steps SD will use "vibrant color splashes". I believe that there is no strict limits on nesting level. What you can do with it? You can effectively reduce the number of tokens SD process on each step (reduce the length of active prompt). On the first steps there is no sens to specify fine details. You only need to specify the scenery roughly. On the later steps there is no sense to spend tokens on describing the composition and lighting (I suspect). So, in theory, with this trick and big number of steps you can keep your prompt short and have build very rich prompt at the same time.
PS: this prompt above force the SD to draw the scene with very little colors and super vibrant segments (I showed grayscale images where a subject has few vibrant hair curls or clothes parts). Probably, you can reverse this effect by making whole picture colored with some parts made grayscale.[Audrey Hepburn : Milla Jovovich : 16] - you can have fun with smooth transition from one face to another with XYZ plot script in Automatic1111. Also, this particular temporal trick with face / body helps my model to render most realistic and correct anatomy. I suspect you can also implement dynamic LoRa weighting with this trick. If LoRa don't have a trigger word you can just put the LoRa token like [ <lora: ...:0.42> : <lora: ...:0.99> : 16] or you can use multiple levels of nested "trigger words" from different loras.
"shot on %Brand Name% %Lens Mark Name% vintage lens" - if you find the vintage lens names which SD have in its memory, then you have a chance to improve an "analog style" of your picture. I used to use "Carl Zeiss Sonar", "Nokton", "Helios 44-2", but cannot confirm that each particular lens model gives unique effect. If you have you own list of confirmed lens models, then please consider to share it with community in comments to this model [%PICTURE OF LEELOO saying HELP%]
In near future I plan to build a training dataset with many images shot on beautiful vintage lenses to bring old-school photography soul into this model. I will use some unique trigger word for that or will use a "vintage lens" (not sure yet).
use "perfect anatomy", "anatomically correct body", "anatomically correct hand", "perfect hands", "anatomically correct fingers", "perfect limbs anatomy" and similar anatomical phrases to increase the chance to get correct anatomy.
Us words smooth bokeh, swirly bokeh, depth of field, smooth background to increase the separation of main subject and scenery.
Use "turbulent fog", "mist" and "haze" with "mystical lighting" to get nice atmospheric picture with super noticeable depth of scene. Also use "early morning" and "blue hour" phrases if you want to get cold morning vibes.
Use "scary face expression", "surprised expression", "inviting expression", "lustful face", etc to increase the chance to get noticeable emotions on face and visible "body language". It works, but not yet very noticeable.
Priorities of this model
Cinematic photo-realistic pictures of female character (sfw, softcore nsfw)
Natural body, skin texture, [to be improved] environment (dirt, dust, stuff on floor, retro furniture and devices)
Realistic optical / photo effects (smooth swirly bokeh, analog film grain, aberrations [in progress]) of vintage lenses (Carl Zeiss Sonar, Jupiter 37a, Helios 44-2)
[To be improved] Urbex, abandoned, decaying interiors, depressive vibes, dimmed colors, fog, mist, vapor
How it was created
It is based on few merges of Analog Madness, URPM, Cyber Realistic, epiCRealism, ICBINP, Cine Diffusion with coefficients in 0.18..0.35.
It was trained with two datasets of carefully selected art photos with similar features (cinematic mood, atmospheric, charming anatomy, soft core / ero, retro interiors, morning outdoors, etc.). Total number of images in datasets: 600-700.
Trained as LoRa with 20 steps per image using Kohya_SS then merged with coefficient ~0.3 into Merge of mentioned Checkpoints. Better to use with my LoRa with the same name to amplify the effect.
Further improvements
By priority:
[done] Fix / Improve hand and fingers generation
[in progress] Improve gloom, bokeh, chromatic aberrations, spherical aberrations, light leaks and old analog film features
Fix / Improve feet and toes generation
[in progress] Add more urbex, abandoned, vandalized interiors and lost / forgotten outdoor scenery (suggest me good datasets pls ;)
Fine tuning / improvements of eyes and anatomy
Feedback appreciated...
Description
FP16 Pruned version of v1.2 beta fp32
FAQ
Comments (18)
I love how this model brings a new and unique atmosphere and style to the images, it really stands out in the sea of ordinary sameness here. Thanks for sharing with us!
I was afraid it won't make any unique additions. Glad to see these kind words. I hope to add more by training with dataset containing the photos made with vintage lenses.
Your (beta) V1.2 fp16 has directly started into my alltime top7 of all realistic models.
I added a Post with a comparison to my actual most realistic used models.
I am not using / compare to the newer strains of the realisticvision and epic versions, because after rv 3.0 mostly all my tested models are suffering from arm-merging / conjoining and missing fullbody poses.
My Testfactors are - and your model:
Nsfw / sfw respect: good
Fullbody respect : nearly good (min. 80% are fb)
Asian influence in Western Bias: Zero (Good)
Skin Texture / Age resepect: good
Creativity (eg. Mr. Lagerfeld) : good
Arm merging prevent: good
Missing poses plague: nearly Zero (Good)
Sunshine reflection : good
Teeths: good
Eyes: good
Epoche stability: not tested
This Version works really good with my prompts (lowlevel promting (und 75 tokens) .. no Lora, No TI, no hiresfix, no facerestore, ..). Thank you very much for this great work.
Owww I feel realy shy to get so much "Goods". I am glad that my first line of Trained Models gives a good result.
What you feel about fingers? I feel the fingers is still a big trouble...
hi, tbh i am not a fan of prompting out "bad hands, fingers etc." and limit the quality and creativity of a diffusion. If a generation has six fingers and it dont look like a ape hands, i am accepting this as a creation feature ..(may this person has really 6 fingers :-) ) ..
Actually i started with your model. Normally i take 20+ hours of testing and adapting the "sweetspot" of cfg, Steps, prompts positions to the models need. I tested in the last eight month nearly 400 (realistic based) models and only 13 has found the way to the"extensive" tests. Your first release had on my prompt-sets many anatomy problems, and the skin texture didnt blow me up ..
After testing in the last hour your 1.2 beta version more i am getting more and more satisfied. ..
After so many tests i am very tired / fague of the "always the same look and feel" of Faces, Clothes, Hairs, Breasts, Backgrounds, Bias, etc. .. After tweaking some creation parameters i found some very interessting and insane developments in your Model ! WOW .. don´t be shy, your model is a hidden king.
With the creativity forced i am also running now in arm merging / conjoining / posing errors.
Have you merged "realisticVision" higher than 3.0 and / or "epicRealism" higher than newEra ? Those are often "my" culprits for this effect. Epicrealism at most, where the "major portrait" to more flexible poses / fullbody / midbody was effected with many issues.
If you are interested in this test prompt, i can send you a png with the creation datas (discord, email ..).
@joge25 >> ..(may this person has really 6 fingers :-) ) ..
I like this point too...
"How we call animal has two tails, six eyes and 7 legs? Elementary, Watson... Two-tailed 7even-legged six-eyegle" )))
@joge25 As for merges.. I used to merge several models with low factors like 0.192...0.250 (CyberRealistic, CineDiffusion, Opiate) just for injection of disturbance (shift weights back to mainstream before training) and then long... long training with 100+ carefully selected and captioned images (60% captions from WD model in Kohya SS and 40% - my manual phrases).
You can send me png to asm.luden@gmail com
I am always curious what solutions other engineers have to learn from.. exciting.
@homoludens thank you for the contact possibility. i noticed it. may for privacy you can take it out.
Hope i will find a silent hour mid / end this week to make the "package" :-)
Actually i am in the scene testing and i think there is a deeper issue inside.
i added a post with generation data (hope the tos censor will allow), and may you see the problem.
On all tested models this scene / seed is generating a more or less like the instructed scenery, the 1.2 beta is often very creative (it does not (always) make a boring spreaded legs (like other models) with p*ssy in focus) but it runs in anatomy issues.
May this helps to find the (merge?, cliperror?, traindata?, idk.) to get over this problem.
I am using SD Webui 1.5.1 with default settings and xformers 0.20. No optimize params or compatibility params changes on Nvidia 4090s.
It hapens mostly on landscaped generations. 40% - 60% more than on other models.
@joge25 will look into anatomy more. I am preparing for hands training. Should add some anatomical pics into dataset. Thanks for notice about problems.
@homoludens
Thumb Up Smiley ;.) ..
if you compare the seed results with ok-ish model generations, you may see the "missing" / problematic poses. First look:
Landscaped: Laying, sleeping, Relaxing, Horizontal, half Laying, upper body 45 degrees, ..
In Portrait Mode, Standing, upright .. etc. the issues are much more less..
Hope you find here some trainingdatas and i am sure, the result is insane. :-))
@joge25 Noted... THX
Funny thing, this model is also great for animals.
Great Job!
great stuff! hope you'll make a SDXL version, too :)
I tried to make Lora for SDXL... And it seemed that my lora for SDXL didn't make any difference. I decided to train myself first (to become more confident in what I'm doing) on SD1.5 and then go PRO with SDXL. It will be a challenge with my 3060 w/ 12GB of VRAM. So, we need to be patient....
@homoludens good luck! really hope you can make it!
@sandro609 I started to train LoHa for vanila SDXL.. 25 hrs passed and 64 hrs to go... ))
If it will show itself baked good - will make a release for XL. Be patient.
@sandro609
experimental LoCon model released (find it in recommendations)
There was few failed attempts. This one is fairly unstable. But it gives interesting results.
Feel free to try it and review it. Constructive critics are welcome.
@homoludens thank you very much! I'll try it out!
Details
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.



















