Please consider buying my two albums Sisters of the Wicked and Danger Ahead, my book Dracula and also please check out my YouTube channel. Civitai won't support creators, so any purchase is a real help. Thanks.
DRACULA https://www.amazon.com/dp/B0FM4DXYPL
Sisters of the Wicked https://amazon.com/music/player/albums/B0DSJY3ZHY?marketplaceId=ATVPDKIKX0DER&musicTerritory=US&ref=dm_sh_CdyeuxIBNBDOjZ1hpB1PPJxT0
Danger Ahead https://www.amazon.com/music/player/albums/B0DB2DV354
Music In The Machine https://www.youtube.com/@musicinthemachine/videos
Lora Creation: https://www.upwork.com/services/product/design-flux-lora-for-creation-of-ai-art-1909718528813429477?ref=project_share
For the De-Distilled Model, Distilled CFG Scale should be set to 0. Then use actual CFG. For me 3.5 to 8 seem good.
If you like the models, please follow. The page is getting crowded as I keep releasing my model in as many formats and options as I can.
V1.1 Hyper 8-Step is now available and works in ForgeUI as well as Comfy. Schnell V1 Much faster, getting good results with 1-4 steps. Quality is not quite on the level of the Dev model, but much faster. So far it seems text is more consistently good though. New GGUF version of the Schnell model up now. Thanks to pkmngotrnr for converting them for me. Much appreciated. So far works in Comfy only at least for me. DEVfp8V1.1GGUF is now up as well. V1 version has European Flux mixed in at what seemed like a pretty low percentage, but it seems like it is a bit too strong. If you like this look, feel free to use this one.
This is my initial FLUX model. It is base on the flux1-dev-fp8 model. This is a merge only in the sense that I merged in some Loras that I like to make it more realistic. As with all of my models, the goal is the best photo realism that I can achieve. If it will do other things well, that is a bonus. Until I can learn a lot more about FLUX, and see if at some point true merges are possible, this is a pretty good start. I don't know about compatibility with all software. I use ForgeUI and I am able to run it fine. Most gens take about a minute and a half for me at 1024 X 1328. I use Euler with the Beta scheduler or the Simple. I haven't experimented much yet with other samplers. I typically use 24 to 28 steps.
Description
This is the Schnell version. Gets good results with 4 steps if you want speed. I believe more steps still yields better results, but not hugely better. Seems Euler sampler with Simple, Beta and SGM Uniform schedulers work the best. I tend to run at 1024X1280 or 1024X1328 resolutions but I think lower resolution likely work fine as well if you have a lower end PC.
FAQ
Comments (19)
The new Schnell version uses the same merged Loras as the Dev version except for the European Flux. It gets good results with very few steps. The lowest I have tried myself is 2 steps and that was pretty good. I have done several others in the original sample gallery varying from 4 to 20. I do think a few more steps get better results but not by a huge amount. In a few of my tests it seems it may not play as nice with Loras as well, but I didn't try very many. In all of my test shots I didn't have even one instance where it produced wrong text. Admittedly it was not a huge sample size, but so far it seems better at text than the Dev version. You all can throw whatever you want at it and see how it does.
As someone who actually prefers the Schnell model to Dev, I look forward to each new "true" Schnell release (rather than the Dev-Schnell merges). This one seems excellent so far in my testing.
I actually like to use 2 steps with Schnell and sometimes even 1 because you can get some crazy creative results that still adhere to the prompt. Then I do an upscale and refine to clean up the pixelation and minor body horror and add better detail.
Some Schnell models don't do well with any attempt to make them more photorealistic but this one does very well, if you look at my first images with it. It'll take a little trial and error to figure out which settings it likes best but I barely had to change anything in my default workflow to achieve near-photorealism.
Well done @Seeker70 !
P.S. So far it seems to work best with a Euler/Beta combo.
Yeah, on both Dev and Schnell so far I have found Beta to be my go to. I haven't used Schnell a lot yet. Since my primary goal is phtotorealism, I have focused on Dev more, but Schnell isn't awful at least with the realism Lora and a couple others merged in. My first attempt adding in my own character Loras didn't come out too well, but I haven't messed with any strength setting on them or anything. Was just trying to get it all uploaded before my AD&D game started tonight.
@Seeker70 I don't use any Loras for realism. I just do a quick 1-3 step base generation with Schnell and then upscale and/or refine with a SD 1.5 model like LazyMix. I find it gives more control over the final result than with a realism Lora where all you can really do is set the strength.
Have to say, I'm very impressed by this Schnell variant. It's very flexible and has a wide diversity of facial features that it outputs. And you don't just get fish-lip model faces with gigantic boobs (unless that's what you want). You get real looking people that display emotion on their faces, even in subtle ways.
It seems to like Euler/Beta the best and results are good with steps as low as 2. If you want it to output text then you'll need at least 3 steps in my testing. Any less and it outputs misspellings or just plain gibberish.
I think a lot people who merge Loras into models do them at too high of weight. Like they leave everything at 1. I tend to set most of them closer to .3 or .4 depending on the type of Lora. For this one only the realism Lora was set at 1. Same for my Dev version.
As for the text, I didn't try much if any below 4 steps. What surprised me though it that I didn't get even one that had the wrong text in my tests. For that it was quite a bit better than my Dev version. Now so far, the only text I have done is the ones on the shirt for the sample pics, but on the Dev version I had to try 4 or 5 to get a really good text output.
I don't have much reason generally to go below 4 steps though. In the tests I did at 2 and 3 steps, they were noticeably lower quality and 4 steps only takes 15 to 20 seconds to generate, so for the most part I will stick to 4 or higher.
@Seeker70 I've never made a model or Lora but I would say, based on my testing, whatever you did to make this one was pretty much perfect.
As for steps, if I wasn't further refining the image then I found 6 steps was enough to give excellent results. When I am refining (which is almost always) then 2 steps worked best without text and 3 steps with text. With a refiner you want to work with the lower quality noisy image because the refiner can inject realistic details into the noise rather than change the anything about the base image, if that makes sense.
I'm pretty stunned by the quality of some of these images, especially this one - https://civitai.com/images/26987899. I barely had to refine that and it came out in only 2 steps. It's hard to get that kind of dynamic realistic emotion out of a Dev imo.
DPM_2/Beta also works well. It probably provides a bit more color saturation but loses a little realism. Everything has its tradeoff.
Can these models run on A1111?
Vlad has been adding support for flux gradually on his fork. Per the wiki entry, sounds like it should work with this particular finetune out of box by now (but I haven't tested, waiting for it to stabilise more), albeit without sampler / scheduler selection, controlnet etc. Vanilla a1111, I have yet to see signs going in that direction. High demand, low supply.
I use WebUI Forge. It is a fork of Automatic 1111. The same good old UI but much better performance to SDXL models and native Flux support. You should try it to, if you don't like ComfyUI.
https://github.com/lllyasviel/stable-diffusion-webui-forge/releases
This won't work on 8GB VRAM right?
I doubt it but I can't say for sure. I have 16GB and I can run it fine but I can't do batches of more than 1 or 2.
you can run the gguf Q8_0 version of this model on 8gb vram. if @Seeker70 is fine with it i will post the link to my converted gguf model here and if he/she wants to, you can then also provide the gguf model in your model post here in civitai ;)
Actually, it works. I use GPU with 8Gb and generate 1 image with resolution 1024x1328 in 2 minutes with WebUI Forge
@Akalabeth That's good to know. Thanks for answering that one.
with the schnell checkpoint i can do an image around 30-45 seconds in 4 steps. RTX 4060 8gb
How to use this in Forge. I get Clip error. Please advice
You are missing some files. Use this guide:
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050



















