Deep under a mountain lives a sleeping giant, capable to eighter help humanity or create destruction...
A Colossus arise...
After my SDXL series its time for the FLUX series of this Project... This time I trained this thing from ground up. For training I used my own images. I have created them with my schnell Flux model DemonFlux/Colossus Project schnell + my SDXL Colossus Project 12 as refiner.
This SD Flux-Checkpoint is capable to produce nearly everything.. Colossus is very good creating extremly realistic pictures, anime and art.
If you like it, feel free to give me some feedback. Also if you want to support me you can do this here. I have spend some good money to build a computer that is capable to actually train Flux-models.. Also training and testing takes also a lot of time and electricity..
https://ko-fi.com/afroman4peace
Version V12 "Hephaistos"
Publishing this checkpoint makes me happy and sad at the same time.. V12 will be the last checkpoint of this series.. The main reason are the upcomming EU-AI laws... Another reason is the license from Flux .1 DEV itself. Thank you all for the support! I have sunken a lot of time into this Project over the last year. Now its time to move on to a different Project.
Anyways.. I will end this series on a high note...
V12 is build on V10B "BOB" but got basically the best parts of this series blockmerged into this one checkpoint. (It was the result of a new merge method which took about 1:30h to merge and used up my entire 128GB RAM). I also enhanced the face and skin textures in comparison to V10. The eyes are much more realistic and more "alive" than before.
Test it out yourself and give me feedback about V12. "Thanks" to my slow internet connection I will first upload the FP8_UNET. After that the FP8 "all in one" version and then the FP16_unet and FP16_BEHEMOTH. I will also try to get it converted into int4 and fp4 (wish me luck on that matter)
As always give me some feedback about V12..
Version V12 "Behemoth" (AIO)
This "all in one" model is the best of my V12 series.. well and the biggest in size of course :-)
The Behemoth is got an costom T5xxl and Clip_l baked inside the model. If you prefer quality over quantity this is the checkpoint for you!
Version V12 FP4/int4
Thanks to Muyang Li from Nunchakutech who did the quantification of V12. https://huggingface.co/nunchaku-tech and their amazing nunchaku!
This version is truly mindblowing. Combining quality with speed never seen before.
ATTENTION!
There are two versions FP4 and int4. FP4 is for Nvidia 50xx graphic cards only! While int4 works with 40xx and below. (you need at least a 20xx series graphics card)
You also can download both versions directly here: https://huggingface.co/nunchaku-tech/nunchaku-flux.1-dev-colossus
INSTALL GUIDE and WORKFLOW
Here is a quick install guide and WIP workflow.
https://civarchive.com/articles/17313
DETAILED GUIDE for the Workflow
https://civarchive.com/articles/17358
I am still working on my new workflows for Nunchaku.. so the following workflow is still very WIP (work in progress) I will add a detailed article at the weekend.
Version V12 FP16_B_variant
Thanks to a small mistake I made late at night (2AM) I renamed and uploaded the "wrong" checkpoint. Its an very experimental checkpoint never meant to be published. Its not much tested but performed really good when I have created the showcase. Its might better than the standard version.
It likes to lean more into asian faces.. That is because I wanted to test something to mix in a side project I am still working on. Tell me your experience with this checkpoint :-)
Version V12 AIO FP8
This version is a all in one version of V12. This means that all clips are baked inside it. It will give the same output as the FP8_unet with my custom clip_l
Version V12 GGUF Q5_1
This version was a request. Its not bad in quality..
Version V10B "BOB"
This is an alternative version of V10. I have created this to improve the FP8 version of V10. In general the FP8 version is more precise and the colors are better. Sadly I have not much time recently.. (RL goes first). Thats why this took so long.. Let me know if you prefer this version. I do have a FP16 version of "BOB" too. Depending on the feedback I will also consider to publish a int4 version.
WORKFLOW:
here is the workflow for V12 and V10: https://civarchive.com/articles/17163
Version V10_int4_SVDQ "Nunchaku"
First I want to say thanks to theunlikely https://huggingface.co/theunlikely who converted the FP16_Unet into int4_SVDQ. Go visit his page and leave a like.
This version is more or less equal to the FP8 version. Even on the normal mode inside my workflow this thing is about 2X-3X faster than the regular model.. With the "fast mode" of the workflow I can render an 2MP image in around 19 seconds with my 3090ti.
What is SVDQ "Nunchaku"?
This new quantification method allows it to shrink Flux models (in this case a native FP16 model) from 24GB to about 6.7GB. But thats not all: you can run generations faster than ever before without loosing too much quality. Sure you will see a small difference between my 32GB_Behemoth but for this thingy you will need a lot more Vram/RAM to even run it.
For more information visit: https://github.com/mit-han-lab/ComfyUI-nunchaku?tab=readme-ov-file
Installation: Please visit my workflow/install guide: https://civarchive.com/articles/15610
Version V10 "Behemoth" (FP16_AIO)
This version is still experimental. The main focus was to get more realistic results. Also I managed to reduce some "Flux Lines". This thing is based on Colossus Project V5.0_Behemoth, V9.0 and another Project I call "Ouroborus Project"
The FP16 version is very stable. I am also releasing a FP8 version soon. This version is also very good but not as stable..
I let you experiment with it though.. Tell me what you think of this version.
Have frun creating :-)
Version V9.0:
Well I have to explain a lot.. First why is it even V9.0?
I recently moved in a new flat and because of some errors the internet provider did I had no real internet connection.. So while doing the whole moving stuff.. I left my computer running. The result was that I created a lot (most broken) Checkpoints. I do have some very good V8 versions though I might will publish as well..
What changed?
I trained new faces and skin textures into the model by taking basically the best results of V5.0. Also the model got an feet/legs training for better anatomy. The V5.0 versions sometimes clipped the head and feet.. I think that I managed to fix some of those isseues..
In addition I trained it with more of my own landscape images.. And yes I did that all while moving into a new flat... I think it was a overall training time of about 2 weeks computing time which isn't exactly cheap.. (every hour basically costs me around 25 cent in electricity)
Anyway I hope that you like this version.. If you want to support me: Post some nice images/ or maybe tip me even with buzz or on Kofi..
Tell me what you think of it :-)
Version 5.0:
V5.0 is actually based on V4.2 and V4.4 (which will be also released soon). It got additional training on skin details and for anatomy in general which mostly fixed stuff like hands and nipples. The face details are much better. I also tried to fix the some minor flux lines..
In general this version is more realistic than V4.2 and better with smaller details.. Like Version 4.2 this version is also a hybrid de-distilled model. You can use it basically with the same settings like V4.2.
Here is also a new Workflow to play with: https://civarchive.com/articles/11950/workflow-for-colossus-project-flux-50
Tell me what you think of this version compared to 4.2 or V2.1..
Version 4.4 "Research":
I have added this version just for completion.. Its slightly more realistic than V4.2 and the base of Version 5.0. You can try it if you want. You can also use the workflow for V5.0 and V4.2..
Version 4.2:
This version is basically a further development of Demoncore Flux and Colossus Project Flux. The goal was to get a more stable outcome with and better skin textures, better hands and more variety of faces. So I have trained it on a hybrid model which is partly Demoncore Flux. I also enhanced the nipples and NSFW a bit. Tell me if you prefer V4.2 over version 2.1 :-)
For the showcase images: I have only used native images with SDXL resolution or 2MP resolution (for example 1216x1632). This model can handle even higher resolutions.. I have tested this checkpoint for up to 2500x2500 but I only recommend going for around 2000x2000.
For the settings I recommend using about 30 steps and 2-2.5cfg. I mostly use 2.2 or 2,3 in my workfow. For the showscase I have used DPM++ 2M with Simple sheduler.
I will add more versions soon but I don't have much time before Christmas..
Settings
I will add a more new dedicated Comfy workflow soon. You can always download and open the showcase images for now..
The "All in One version also works well with Forge too..
Basically it works with the same settings as Version 2.1 (see below)
Give it 20-30 steps with around 2.2cfg..
Version 2.1_de-distilled_experimental (MERGE)
This version is completely different and works actually different than a normal Flux model!
Its a experimental merge between my version 2.0 and a de-distilled version https://huggingface.co/nyanko7/flux-dev-de-distill. This happend a bit by accident but the results are mindblowing. You will get mindblowing details. Also follows the prompts extremely well... So the next thing I am gonna do is to train on the de-distilled model directly. I have already done some test Loras with it. This is highly experimental so please let me know if you find errors which are not listed down below. If you have good images post them.. post also the bad ones this can help improve thing :-). May try also version 2.0 and tell me which type of checkpoint fits you best.
!Attention!
The normal Flux workflow isn't working with this version. YOU NEED to download my workflow for it!
You also can figure something yourself out but please don't blame me for bad images. Also this is a highly experimental model... check the downsides below..
UP- and Downsides of this checkpoint:
Well this checkpoint can create extrem details..This will come with a price.. Its slow compared to the normal Flux- checkpoints. The upside of it is that you often doesn't need a additional upscale anymore. Instead of using the Flux Guidance this model uses the cfg scale. Which also mean that it will not work with standart workflows.
You can use negative Prompts! This helps to get stuff out of the image you don't want.
Sometimes can artifact appear.. You can solve this by a small and simple upscale (I am working on this). Here is an example.. this strangly happens not with every seed.. UPDATE: This is not a issue with the model itself.. more a workflow one.. I am working on fix for it. If this happens you can try setting the first upscale to 1.14 instead of 1.2.


Settings and Workflow V2.1:
Here you can find the workflow for it: https://civarchive.com/articles/8419
Settings: other then the normal Flux it doesn't need the Flux Guidance scale. Use the cfg instead. I mostly use 3 cfg for the workflow.. Some images may require lower cfg-scales
the most important thing is may to shut off the flux guidance scale..
Without the Workflow I have tested it with 30 steps and 2-3cfg. This is also may the settings for Forge. try to experiment here.
I recommend using the word "blurry" in the negatives
Sampler and scheduler:
You can pick from a range of working samplers:
Euler,Heun, DPM++2m, deis, DDIM ware working great.
I mostly used "simple" as scheduler
If you find better settings tell me.. :-)
For Forge I recommend using the AIO model.. here is a example setting for Forge

Version 2.0_dev_experimental
Well.. this a experimental version.. The goal was to create a more coherent and faster model. I have trained in some additional own trained loras and then merged the resulting models in a special way (Tensor merge). It got a costom T5xxl which I have modified with "Attention Seeker". For gaining speed and additional quality I have merged in the Hyper Flux lora from ByteDance. This means that it shifted the working area.. I show you what this means.. Here is the main title image..
16 steps V 2.0
30 steps V 1.0
Downsides:
Well first.. This version is a bit bigger than the last one.. second I still have to create the Unet only version. I will update this when its done..
Settings and Workflow V2.0:
You can run the model now with less steps.. 16 steps equals 30 steps from the old model.
I still recommend using around 20- 30 steps because it will get you more quality in most cases.
Sampler: I prever Euler with Simple as scheduler. The guidance can be set from 1.5-3 (feel free to test it outside this range of course). The guidance of 1.8 still works well for realistic images. You can also test out other samplers. DPM++2M and Heun also working great.
Workflow 2.0:
I have created a new workflow for V2.0 and V1.0. This got the new Flux Prompt Generator. Additionally I got the second upscaler stage working. https://civarchive.com/articles/7946
Forge:
I have tested this model also with Forge and it worked very well.. The images may can differ between Comfy UI and Forge though..
Version 1.0_dev_beta:
This model is my first entry of the series. So please give me some feedback and post some images. This helps me to improve this project further. There are several versions to choose from. The best model regarding quality is the FP16 version Well the FP16 version is huge in size and will need a beefy graphics card and lots of RAM. The FP8 version is the version I consider as good solution between quality and performence. If you want to get a GGUF version download the Q8_0. The GGUF Q4_0/4.1 version was a request. They small in size but you will loose some quality.
There are basically two types of my models "All in one" models which only needs one file to download. It got the Clip_l, T5xxl fp8 and the VAE baked in. (look down below). Place this inside your checkpoints folder.
The other versions are the UNET-ONLY ones. Here you need to load all files seperately.
In any case you need to download my Clip_L for those to get them working right..
Also important is to choose the right T5xxl clip. For the FP8 version it is the fp8_e4m3fn t5xxl clip. For the FP16 it is the FP16 clip. make sure to select the default weight type. (down below is a example image for the fp8 version)
For the GGUF version you need the GGUF loader!
Some known things for now regarding V1.0:
This is just the first model of the series so at the moment it might can struggle with some prompts or styles like art. The next version will receive more training. Let me know some things the model can't do..
Settings and Workflow:
I have tested it with around 30 steps, Euler with Simple as scheduler. The guidance can be set from 1.5-3 (feel free to test it outside this range of course)
The guidance of 1.8 works well for realistic images.
Feel free to experiment with those settings.. If you get good results, please post them.
I have added the showcase images as training data.. Inside it is the workflow for Comfy. Here is the workflow for download: https://civarchive.com/articles/7946
"All in one" model:

UNET_only:
You need download the clip_L as well. its the 240MB file.
GGUF: I have added the workflow for GGUF here: https://civarchive.com/articles/7946
Important:
The dev model is not intented for commercial use. For this I will publish the "schnell" model on a different place. Its more intended for personal or scientific usage.
LICENCE:
https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
Credits:
theunlikely https://huggingface.co/theunlikel (thank you again)
Version 2.1/V4.2/5.0: Flux_dev_de-distill from nyanko7
https://huggingface.co/nyanko7/flux-dev-de-distill
From V2.0: Hyper Lora from ByteDance https://huggingface.co/ByteDance/Hyper-SD
Black Forrest for their amazing Flux model https://huggingface.co/black-forest-labs
Description
FAQ
Comments (43)
Do you plan to finetune chroma when is ready?
I am always looking for new challenges.. but well I am taking a break soon.. Maybe after my vacation.
I will upload some other versions soon. This time the FP16 version is the best version. The FP8 version will be uploaded tomorrow.
To be honest, I prefer Unet versions. You already have the clips and VAE on the SSD anyway. And you can also choose which clips you want to use.
Thanks a lot for the fp16 version. And I'd agree, the unet version for ComfyUI users saves a lot of bandwidth and space on the SSD.
@wolfdd87 Hi today I will upload the unet only versions. This will take some time however. My internet connection isn't very fast. So uploading both versions will take around 7-9 hours
@Afroman4peace Thank you! I will try it.
@Daedalus_7 In Forge Unet versions can be used as well. You just have to add the Clips and VAE on the top of the screen.
@wolfdd87 I have published the fp8 and the fp16 unet
@Afroman4peace Thanks a lot! I appreciate the help
Some quick v10 feedback (early testing):
- flux lines, that were present in v9 when generating at higher resolutions, are gone - well done
- v10 seems to have less detail (skin especially) than v9, comparable with v5 but anatomy seems better than v5
-v10 seems to perform better when negative prompt is empty
I would really really like the behemoth version of v9. v9 without flux lines would be so incredible.
Thank you for your hard work on this model line! @Afroman4peace
Hi, thanks for testing it out and your feedback. I do have more versions. V9 is still amazing but I hated the flux lines. This is unfortunatly not fixable with V9.0 itself. I had to rebuild everything from V5.0_Behemoth.
I might will bring out an alternate version of V10. The main problem with flux is to find a sort of balance. If you go into more realism artifacts can be more present. I have some ideas left. In my option this version got some good skin details..
@Greywolf666 I have desided to upload an very interesting checkpoint after I have uploaded the unet versions. Its a checkpoint I made actually just two days ago. I have created it while I was a bit high to be honest. So I called it "BOB" Its very experimental but the results are very good to be honest so far.. stay tuned
@Afroman4peace well, now I’m curious ;)
@Afroman4peace can you write an article about what you have learned about training along the way? I guess it'll be a very interesting read for everybody else trying to train
@Afroman4peace Did more testing. With Euler Beta skin detail is a lot better. Euler SGM Uniform and DPM++2M SGM Uniform produce sometimes nicer skin detail, but anatomy takes a hit. DPM++2M Simple doesn't perform well, skin becomes very plasticy (at least on my machine with different prompt strategies tested). Tried a lot of other Sampler-Scheduler combinations, nothing worked particularly well. Euler Beta and no negative prompt seems the best combo for me with v10 and makes v10 truly amazing. Still curious about "Bob" though ;)
@Greywolf666 Hi I will test this setting with it. I got very good results with DPM++ 2M Simple but I used 30 sampling steps with it. In theory the model can work with and without Flux guidance. In my current workflow I am using no flux guidance at all.. You can download all images if you want... This week I will (try) to upload the new BOB version. This can take some time.. I only have an upload speed of about 1.5 MB/s.
@Afroman4peace I use 30 steps as well, no distilled cfg but 2.2 regular cfg (as in your workflow)
@Greywolf666 Here is a WIP workflow for Colossus Project Flux V10 Also this post shows the difference between V10 and V10B "BOB" This checkpoint is a bit special about the used Ksampler.. Sounds weird but it's true. I thought about your idea using no negative prompt. This also accelerates the workflow. The downside of that workflow is that you can't change the cfg. Here is the image post https://civitai.com/posts/17783365
@Afroman4peace Nice efficient workflow. Seems faster than before. ComfyUI now clearly beats Forge for me by a large margin when it comes to speed on 5090. Btw.: Only the v10 image in the post has a workflow attached, the BOB images don't.
@Greywolf666 Hi, I accidentally hit the hide prompt thingy,, now it should work.. the workflow is very basic right now. Today I have started to test the svg quant model of V10. combined with the faster workflow its awesome.. There is a downside of this of course with the fast mode you can't change the cfg. In my finished workflow there will be a fast mode option.. disable it and it works again with cfg. just a push of a button :-)
@Afroman4peace now I understand: the new v10 workflow already operates in „fast mode“? It’s quite significant. How did you achieve that? Can’t figure out why it’s so much faster.
After a lot more testing I can say: v10 outperforms both v9 and v5. Anatomy is better than v5, sharpness and Details are on par with v9, but without the flux lines. But: That’s only when you use it with the dedicated workflow by @Afroman4peace (which in its newest version is very speedy). In Webui Forge v10 is not as sharp/detailed as v9, but anatomy remains better than v5. I don’t recommend running it on Webui Forge, comfy workflow will be faster and produce better results. Overall v10 is an amazing model!
👀👋
Hô ! Looks amazing buddy !!
I'm tempted to switch to Flux for the very first time.🥂
Thanks for your kind words. I hope you like the checkpoint. It's a bit demanding on the hardware side for now. I am planning to release some interesting quantifications in the near future. This will make it more hardware friendly
could you integrate nsfw next
Sadly no. First this checkpoint is meant to be a base model. Second I may stop training on base of Flux dev soon.
There are several reasons for this. First I don't get enough money to even pay my expenses. The licence doesn't allow to sell the model. Second the EU AI Laws are getting more restrictive soon. I have basically publish everything I have until August.
From this point I have to follow the new rules. From what I know.. All checkpoints I have uploaded to this point are basically save for two years. Then they also have to follow the guidelines.
This doesn't mean I will stop training entirely. I just will shift my focus on other not so realistic models. I also want to start a complete new project with Unreal 5.
@Afroman4peace thanks man , flux will not ever do realistic nsfw except if super heavely trained from scratch , every nsfw flux reduces how original model works ...
is it possible to make an NF4 version of v10
Hi, This is what I want actually. I have to figure out how to do it first. I also want to test it as SVG quantification. Today I will publish the FP8 and FP16 Unet.
I also have an alternative version of V10 laying around. Uploading any version takes a lot of time for me because my internet connection isn't fast at all. So that 32GB version took around 7 hours to upload ;-)
@Afroman4peace did you mean svdq int4? That's my favorite right now. Under 8gb and just as accurate as FP16
@pychobj2001741 yes. I have to look into it. Still I don't know how I can convert it
@pychobj2001741 is it working in Forge?
@Afroman4peace jibmix creator is very helpful w/ doing that QUANT, reach out to him.
gguf version?
@pychobj2001741 the svdq int4 version is now in the making :-). I asked the guy who can created it.
Is there any chance to make a V10 GGUF Q8.0 version ?
sure thats also possible. The upload will take some time though. I also have to test it before.
I will probably do a couple of quants in the morning just to help out. Recommend checking out the. K_quants, though. They're a bit more optimized. Q6_K is just a hair below Q8_0 in terms of quality but much better VRAM usage.
Here's the Quants. Took me a few more days to get to it than I meant to. Images on the model page were generated with the GGUF's to show they work.
https://civitai.com/models/1652245
Hoo man! StorageFull, message: "There is not enough space on the disk." I would need to buy another had drive for this one XD
you can also download the FP8_unet instead :-)
Great model! Poses and camera angles are far more dynamic and backgrounds no longer unnaturally symmetrical 10/10 times! Amazing work! 👏
Do you have plans for HiDream as well? I read somewhere that it's supposed to be far more susceptible to training without being a PITA like Flux. At least in regards to LoRA training I believe.
Details
Available On (2 platforms)
Same model published on other platforms. May have additional downloads or version variants.




