Consider following my work on Patreon, where the free tier unlocks everything. Models, Workflows, Research, Process.
Watch a deep dive of the training process here
A year ago, I released version 3, and was surprised to see the volume of both support and criticism. I still stand by my belief that we can not only take control of the technology's potential by training on our own imagery, but also that we can bring an empowering version of the post-AI-visual-sphere into realization through publishing tools made by individuals, not just corporations, and that are accessible to anyone with a laptop, not just those with industy credentials or formal education. The democratic nature of free, open-source tools will inherently create a lot of slop, but I feel expanding the reach of any medium is a net positive.
You can support me on Patreon and get everything for free :)
https://www.patreon.com/CalvinHerbst
Aesthetic Properties of the model:
HerbstPhoto_v4_Flux2 produces intensely imperfect images that feel candid and alive. The model creates analog degradation micro-textures that break past the plastic look by introducing filmic softness, emulsion bloom & hailation, optical artifacts - such as lens flares, light leaks, chromatic aberration, barrel distortion - and grain that behaves naturally across exposure levels. Compositions are moody and take form in chiaroscuro light, with dark regions that blanket the frame to create asymmetry and bright slivers that form hotspots to maintain balance. The contrast curve is aggressively low latitude, embracing clipped highlights and crushed shadows, while preserving a high black point to feel true to the celluloid nature of the images the model was trained on.
Version 4 is trained for Flux 2 Dev from @Black Forest Labs because I beleive it’s the best image diffusion model, however it’s a heavy and can take several minutes to generate a single high-res image, so I will also be releasing an updated version for Z-image, Flux 1 Dev, and SDXL in the coming weeks for those who are looking to use less compute or create faster.
Best Practices using the model:
Prompts: Include “HerbstPhoto” in the prompt. Though the Flux 2 Model can handle prompts that are long and complex thanks to its incorporation of the minstral_3_small_fp8 text encoder from @Minstral AI I tuned this LoRA to produce dramatic effects even with simple language writing that does not include style, texture, and lighting tokens.
LoRA strength: 0.4 - 0.75. (0.73 sweet spot) 0.8-1.0 for less prompt adherence and max image texture/degradation.
Resolution: 2048x1152, though the model also produces good results across aspect ratios and sizes up to 2k.
Schedulers and Samplers: I tested every combination of Schedulers and Samplers for Flux 2 (378 total) and can recommend a handful of combinations that I tested on a Pro 6000 WK GPU @ 1024x1024 @ 20 steps that each have different aesthetics and render speeds.
dpmpp_2s_ancestarl + sgm_uniform: Best balance of texture & fidelity. 160 sec. Render
er_sde + ddim_uniform: Good balance of texture & fidelity. 60 sec. render
dpmpp_sde + simple: Softer focus, lower contrast, less artifacts, brighter. 130 sec. Render
dpmpp_3m_sde_gpu + simple: higher contrast, brighter, more chromatic aberrations. 60 sec. render
Ipndm + simple: Higher clarity, less softness, fewer artifacts, cooler. 60 sec. render
dpmpp_sde + ddim_uni: higher saturation, color shifting. 130 sec. Render
Training Process Overview:
I used AI Toolkit from Ostris on an H200 GPU cluster from Runpod to train over 100 versions of the model, all using the same dataset + simple captions. For each run, I changed one parameter to get a clean A/B tests and figure out what actually moves the needle. I’ll share the full research soon :) After lots of testing, I am happy to finally release HerbstPhoto_v4_Flux2.
Coming soon:
HerbstPhoto_v4.1_Flux1Dev
HerbstPhoto_v4.2_ZImage
HerbstPhoto_v4.3_SDXL
HerbstPhoto_v4.4_Flux2_DarkAbyss
HerbstPhoto_v4.5_Flux2_FishEye
HerbstPhoto_v4.6_Qwen_ImageEnhancer
Description
FAQ
Comments (10)
Nice, just played with the Flux2 version and I love it. Works well with edit-tasks as well!
Love to hear more about your training effort, and what settings you ended up with and what learnings you have made :)
Just started training Flux2 LoRA myself, and it's a bit of a hit-and-miss experience at the moment – some turn out really well, while others don't.
Hey there I'm happy you like the model :). I'm going to release a documentation of the research soon. The training depends on what you are trying to capture. The dataset and captioning is just as important as the training parameters. you just need 20-40 images that are extreme cases of what you are trying to capture, and to craption them in a way that does not describe the attribute you want the model to learn. It sounds counter intuitive but, for example, if you want to train on a style of lighting, you would caption the data to describe everything except the lighting. As far as training parameters, I would start with Network Dim (linear rank) 128 and Alpha of 64. Try decreasing the decay by 2-10x. Train at a lot of steps, 5-10k, save every 500 steps and test each epoch against each other. Some models have a really narrow sweet spot and the only way to find is by AB testing the epochs.
@Calvin_Herbst Thanks. Yes, that has been my experience as well, some concepts seem to converge already at 500 steps, while others still struggle to grasp the concept at 3000.
Found any sweet spots for the LR?
Flux2 LR was super sensistive and destroyed images if it was turned up, which was surprising because with most other models I found it was usually good to increase. Not sure why this is but the default 0.0001 seemed to be a good spot.
Any plans for z image?
Yeah, It's tricky thought becauase this style is really about fine textures which is hard to train into Z because of it's distillation, but I have done a few rounds of trainings and tests and will push something out as soon as I feel it's good enough.
@Calvin_Herbst awesome,I hope you can take a look at chroma someday also
@Neon_signs Stll no luck with Z, but I recently released a Flux2-Klien-9B version that is more friendly for consumer hardware! Free on patreon: https://www.patreon.com/posts/herbstphoto-v4-149156503?utm_medium=clipboard_copy&utm_source=copyLink&utm_campaign=postshare_creator&utm_content=join_link
@Calvin_Herbst hey there 👋🏻,that's too bad ,will take a look but, today z image releases the base version,maybe one last try with it?
@Neon_signs yes great idea!!









