V4
Alright, so what’s new in this version? I cranked up the aesthetic dial, added more diversity in ages, and improved how it handles Asian features. But - because there’s always a but - I did notice the hands got a little wonkier. Eh, can’t win ‘em all.
I highly recommend pairing this with my LoRAs, like the realism amplifier, 2000s analog core, and others, since this checkpoint works best as a base for stylized LoRAs. Might do one more version (because, let’s be real, I kinda scuffed v3 and v4 a bit), but first, I’m diving into fine-tuning Flex.Alpha.
This time available versions: bf16, fp8, q8_0 - pruned fp16 name and q4_k_m - pruned fp8 name
P.S: Don't use my UltraRealPhoto LoRA with this checkpoint - it has a huge impact on style, so image become overbaked. If you're using the UltraReal Fine-Tune, go with Realism Amplifier instead for the best results. UltraRealPhoto LoRa was created to fix crappy shadows, light and faces, but all that stuff already baked inside checkpoint, you can just add amplifier for better realism
V3 Update (Experimental)
This release marks a step forward, although it’s still very much a work in progress. I focused on improving several key aspects, such as nudes, feet, and lower body anatomy. While the results are better than before, they’re not yet at the level I’m aiming for. That said, this version brings noticeable quality and texture enhancements, offering more detailed and refined outputs compared to the previous versions.
Recommended Settings:
CFG Scale: 3 (instead of 2.5 used in earlier versions)
Steps: 50 (helps with stability, though some minor instability remains in hands and fingers)
CFG 0.9 vs. 1.0: Lower CFG on 0.1 or even 0.2 may sometimes improve some details (may not improve, so feel free to expriment with this too), though it might take longer to generate.
Regarding nudes: they are still not working as intended, but I’m actively working on this issue and expect to address it in the next version.
The good news is that I already have the datasets prepared for V3.5, which I aim to release much faster than the gap between V2 and V3. With more experience and feedback from this version, I’m confident the next update will deliver significant improvements.
As always, I truly appreciate your support and feedback - it’s invaluable as I continue refining this project ❤️
P.S.: I feel like the more I fine-tune Flux, the more it degrades in other areas. Also i thought about trying finetune Flex Alpha (project looks very promising)
What's New in v2.0?
Enhanced Anatomy: Hands, feet, and poses have seen major improvements, offering more natural and accurate results. Say goodbye to overly distorted limbs!
Improved Textures & Quality: Upgraded skin details, richer textures, and sharper results overall. Blurred images still happen occasionally, but much less frequently than in the previous version or when using LoRAs alone.
Improved Text Rendering: Efforts have been made to improve the generation of text in images, and it’s much better than before. However, artifacts can still occur, and strange symbols might sometimes appear instead of readable words. This remains a work in progress.
Expanded Dataset: A larger and more diverse dataset (1800 images) introduces better balance across styles, lighting, and compositions.
Added Checkpoint Variations
To ensure compatibility with different workflows, I’ve included multiple checkpoint variations:
BF16
FP8
Quant 8 (Q8)
Quant 4 (Q4)
NF4
From my testing, I’ve noticed Quant 8 (Q8) offers slightly better quality than FP8, providing finer details while maintaining manageable resource requirements, but other works nice too. Pick the version that works best for your setup
Known Limitations
NSFW Capabilities: Still a weak area in this version. However, a minor fine-tune focusing specifically on NSFW content is already in the works.
Text Rendering: While text generation is better, occasional artifacts like odd symbols or incomplete words may still occur. But noticied usage of t5xxl fp16 instead of fp8 helps a lot with text
Tips for Optimal Results
Sampler: Use DPM++ 2M samplers for smooth and consistent outputs.
Steps: Aim for 30–50 steps to capture finer details without over-processing.
Scheduler: Beta Scheduler remains the best choice for this checkpoint.
Prompting TipsThe best prompting style involves complex prompts with clear, comma-separated phrases. While you can get creative with storytelling prompts, unnecessary descriptions like “this crap added more vintage to her style” won’t improve the results. Keep it concise and descriptive, focusing on essential visual details for the best output.
Future Plans
I’m committed to further developing this fine-tune. The next update will likely focus on:
Expanding NSFW capabilities
Enhancing edge cases like dynamic poses and lighting scenarios
Improving text rendering for sharper, more accurate results
P.S: If you still don't have realistic effect, then try add my ultrareal lora, usually helps me a lot
Ultra-Realistic Flux Fine-Tune v1
This is my first experiment in fine-tuning a checkpoint, built upon the foundations of my UltraReal LoRA and expanded with an extended dataset. The aim? To push realism to the next level, finding that sweet spot between amateur aesthetics and professional, high-quality visuals.
While this is only the first version and I see room for further refinement - the results are good, but not ideal (hands and feet can be broken sometimes, but situation is not critical, still better then defaul flux). This fine-tune isn’t just about amateur-quality outputs; it shines with professional-grade images, offering exceptional detail, lifelike shadows, and lighting. It’s a versatile model designed to unlock a wider range of realistic image generation possibilities.
This is very much a work in progress, and I’m sharing it to gather feedback and see how others use it creatively. If you test it, I’d love to hear your thoughts or see your results!
Also i uploaded both versions: fp16 (in ComfyUI it's better to use with e5m2) and fp8 and Q4_0
🌟 What’s New in This Fine-Tune?
Expanded Dataset: Nearly double the dataset size of the original LoRA, covering a diverse range of styles, lighting, and compositions.
Improved Realism: Sharper details, richer textures, and more natural lighting, bridging the gap between AI-generated and real-world imagery.
Versatility: From casual amateur-style snapshots to cinematic, professional-quality renders, this fine-tune adapts to a variety of creative needs.
Enhanced Anatomy: Better hands, limbs, and more natural poses compared to the base Flux model.
💡 Tips for Best Results
Use DPM++ 2M samplers for smooth and consistent outputs.
Aim for 30–50 steps for finer details without overdoing it.
Select the Beta Scheduler for optimal rendering performance.
⚡ Why Fine-Tune?
This fine-tune was crafted to overcome some of the limitations of the default Flux model. It enhances its ability to handle complex scenes while maintaining consistent quality across a range of prompts. The goal is simple: make ultra-realistic image generation accessible, reliable, and visually stunning, without requiring endless adjustments.
P.S: i plan to train this model more to make ultimate checkpoint with best anatomy and realism. This version is not very good with nsfw (this will be fixed in next version)
P.S.S: so far you can randomly get a low resolution image (dunno what exactly trigger this one, but will search for fixes). But seems like using high-resolution in prompt helps
Description
FAQ
Comments (42)
I tried one of your samples and it came out mostly the same except the face on yours is much prettier, curious what I'm doing differently. are you using the fp16 version or the fp8?
Hi. I use fp8 version with default option in Load Diffusion Model. Dunno, maybe you can provide me examples that u got?
On tensor please. Wow, it's amazing man
How does it work with a persistant Lora caracter ?
I tested with some celebrities LoRA and it works good (but i'm not guarantee that all LoRas will work good. Also pulid is working fine too
because all my images come out backwards, I mean if it's a girl it comes out of swords and like that almost anything, I'm using the fp8 version
You are legendary, before I had to use with my models a lot of different loras for hands and realism, now it is not necessary anymore, keep improving it, then maybe try to make a sd 3.5 large and 3.5 medium version, nice work!
Hi, glad that u liked it=) it's just a matter of money to improve it more (on checkpoint for example i spent 100+ dollars, for LoRa 50+), cause dataset is not a big problem. Also i would like with a pleassure to fine-tune 3.5 (cause model is good, but it sucks so in anatomy)
Please add the GGUF format too, Q4.
I thought about it when release a new version (plan it on next week)
@Danrisi Thank you, it will help users with low equipment specifications to enjoy your work.
@karldonitz28599 Uploaded. Full Model nf4 (6.33 GB) here it is. i choose nf4, cause i didn't see quants option, but this is q4
@Danrisi I have tried the nf4 version and it works very well and has better anatomy than the base model.
@Danrisi Sorry to ask this but from where I can download nf4 version, can't see link or version for that.
@printyparadiseart312 sorry for misleading in this thread. i don't have nf4 version at this moment (i labeled q4_0 with nf4, cause there is no option to label it properly). So i changed label nf4 to fp16
I am curious, what cards do people use for 22Gb+ Flux models? I have a 4060 Ti 16GB VRAM and it's rough ;)
i'm using 3090 =) But i have also fp8 version and also making q4 gguf
What do you mean by rough? ... like it takes too long?
if yes... how long?
because I have an RTX 4070 12gb vram and 32gb ram, so I to wonder if its good for me lol
@Lucii_Flynn speed is ok, but vram is an issue. but that's what quantized models are for
@kunde2 i know that you feel. I used 6600xt before and it was a big pain in ass
me an rx580 user ¯\_(ツ)_/¯
Can you add Q4 K M quants because the NF4 version does not support LoRA?
Hi, thanks for pointing that out! I’ve actually already uploaded the Q4 quant version - it’s just labeled as NF4 because Civit doesn’t currently have an option to mark it specifically as Q4. Feel free to give it a try, and let me know if you run into any issues
I decided to label it as FP16 instead, as it’s less confusing (just search for the GGUF version, and it’s the one that weighs less)
Thanks.
will this be good for my RTX 4070 12gb vram and 32gb ram?
with q4 checkpoint version i had 14.8gb vram consumption. but i think with quant t5 clip will be ok even with 12gb
okay, i remembered that in ComfyUI you can just use Force/Set Clip Device and Force/Set VAE Device to CPU and will have vram consumption 10.3gb and 19 on ram. (i used q4 quant checkpoint and fp8 clip)
specially for you i made workflow for comfyui with 10.5gb vram usage (u can find it on my image with the sign about 10.5gb)
Your work is great, but when I try to make a picture of a woman with a full bust size, it can't be achieved with this model. It can be achieved if using Lora, but if I use Lora it can reduce the quality of the resulting image. Do you have any suggestions for that?
Hi, thanks! 😊 Are you saying the checkpoint doesn’t produce great close-up ass? Yeah, LoRAs generally work best with the checkpoint they were specifically trained on. My LoRA was trained on the default flux.dev, so it performs better with that setup. The checkpoint I trained is sort of an analogue to my LoRA but with slightly better quality - though it’s not perfect yet and still needs more training. Also there are some light nsfw images in dataset, so i think after the next training butts will be much better (boobs too)
@Danrisi The only problem is the size of the breasts and buttocks is too flat and small. Also, I have a bit of trouble making a woman with a voluptuous or hourglass body. But I must commend that your model can produce good anatomy, especially the hands and feet, which are much better than any model I have ever used.
@chatgptkdanstore10561 Yes, I know there are some issues with that, as I didn't originally plan the model as for nsfw content at all and in the dataset initially everyone was clothed. Then now I realized that I need to change that
@Danrisi I tried to make a picture of a woman with a voluptuous body but wearing clothes. But the result is like I said earlier.
seems to go both genders, yes? What about ages, 0-99 or is it more like a lora for specific things only?
i didn't have any problems with generating middle-aged people or 90 years old like people
Q8_0 quant?
As u wish =)
BTW, have better quality imo then fp8
@Danrisi Thanks!
Please make SANA version of this model.
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.

















