SPEED FP8 - CivArchive (CivitAI Archive)

SPEED FP8 - e5m2

NSFW

✨ What Makes This Model So Special? ✨

🔄 Perfect fusion of Flux1-dev and Flux1-schnell strengths
💾 Full FP8 version (includes CLIP and VAE)
💻 Runs smoothly on modest 12GB GPUs
🖼️ Premium quality outputs rivaling resource-hungry monsters

🛠️ How To Get The Most Out Of It:

👑 Advanced Workflow:
1. FLUX -AI-Stylish-prompter(nf4 supported)
2. Advanced AI Art Remix Workflow for Flux
📁 Setup Structure

📂 ComfyUI/
├── 📂 models/
│   ├── 📂checkpoints/
│   │   ├── SPEED-FP8.safetensors

💻 Tested on forge https://civarchive.com/images/24458443

👏 Special Thanks

Big thanks to @Dunc4n1dah0 ! 🙌

👨‍💻 Developer Information

This workflow guide was created by Abdallah Al-Swaiti:

For additional tools and updates, check out the OllamaGemini Node: GitHub Repository

No alternative text description for this image

Description

I found little enhancment using e5m2 quantization !

FAQ

Comments (85)

Dunc4n1dah0Aug 8, 2024· 3 reactions

CivitAI

Please remake with e5m2 weight type, it's more close to fp16.

AbdallahAlswa80

Author

Aug 8, 2024· 1 reaction

thanks , i will try as soon as possible

cybermiawAug 8, 2024· 1 reaction

CivitAI

Exactly what I was looking for. I will test this asap. Thanks.

amazingbeautyAug 8, 2024· 3 reactions

CivitAI

..what e5m2 and e4m.. !!??

AbdallahAlswa80

Author

Aug 8, 2024· 2 reactions

that's types of AI model quantization (make it smaller in size with keeping same quality) whatever some times e5 better sometimes e4 better its depend on model , here choose what you see it better ,,, i think e5 more accurate

amazingbeautyAug 8, 2024· 1 reaction

@AbdallahAlswa80 well explained , thank you

NanashiAnonAug 8, 2024

~~It's a level in~~ ~~Heretic~~.

zelgadisexeAug 8, 2024

CivitAI

I'v get only black images. What i do wrong?

D model works fine.

AbdallahAlswa80

Author

Aug 8, 2024

did you try this work flows https://civitai.com/models/629858 or https://comfyanonymous.github.io/ComfyUI_examples/flux/flux_schnell_checkpoint_example.png

samineimpactnet801Aug 8, 2024· 4 reactions

CivitAI

I had to remove some nodes, but it worked almost flawlessly on an Intel I5, Geforce 3060 6GB VRAM, 32 GB RAM.
Thank you sir! I offer you my first comment on civitai

AbdallahAlswa80

Author

Aug 8, 2024

that awesome please let us see what you have

karnasAug 10, 2024

This gives me hope, since the first time I've tried Flux on my 12 gb, it froze whole PC for half an hour.

samineimpactnet801Aug 10, 2024

@karnas the very first time, I got a bsod! but then everything went almost smoothly. rendering time is decent

cathylevermanAug 9, 2024· 1 reaction

CivitAI

awesome! Finally i got FluxDev to work

CratesifyAug 9, 2024· 1 reaction

CivitAI

The new version actually does seem to give me better results too. Thanks for the update

raidmachine132017712Aug 9, 2024

CivitAI

Where to put this? in unet or in checkpoint? and if put in checkpoint and load with checkpoint loader how to change type?

velantegAug 9, 2024

Use workflow from description just remove Geminy and make prompt youself.

AbdallahAlswa80

Author

Aug 9, 2024· 1 reaction

those models prepared to be used as any other model just put the in checkpoit folder and start creating

marhensaAug 9, 2024· 4 reactions

CivitAI

E5M2. NICE! Output-wise, it's great. Also, there's no need to put a double clip, and no need to choosing the weight is a bonus. It works like a charm on the RTX 3060 12GB, taking about 30 seconds only, with sharp and vibrant output. If someone want to copy workflow, search "what the flux is schnell" bellow in gallery, and click icon copy nodes from info popup, then Ctrl+V in blank ComfyUI window.

A niche note: somehow it's better than original Flux-1 Schnell to output the facial combination of mixed races than the original Flux. For example, "woman with mixed ethnicity of Malay Indonesian Singaporean Persians Irish Dutch Norwegians Ukrainian" produces a facial representation that really combines those ethnicities and nationalities, while the original Flux-1 Schnell struggles with this and weirdly producing East Asian-like facial features.

serikenhikAug 12, 2024

With your workflow on my RTX 3060 12GB the process takes about 250 seconds, what am I doing wrong?

marhensaAug 13, 2024

@serikenhik is it turned to be lowvram? here:

Tips Avoiding LowVRAM Mode (Workaround for 12GB GPU) - Flux Schnell BNB NF4 - ComfyUI (2024-08-12) : r/StableDiffusion (reddit.com )

I now using Schnell BNB NF4 btw.. lots faster.

content_for_inte4086Aug 9, 2024

CivitAI

I don’t understand what e5m2 is, please explain. And what is the generation time?

AbdallahAlswa80

Author

Aug 9, 2024· 1 reaction

this is one kind of quantization , has acurate result

content_for_inte4086Aug 9, 2024

@AbdallahAlswa80 thank you, I understand what you're talking about, I wish I knew how faster this model is

content_for_inte4086Aug 9, 2024

@AbdallahAlswa80 thank you, I understand what you're talking about, I wish I knew how faster this model is

AbdallahAlswa80

Author

Aug 9, 2024

@content_for_inte4086 there is awesome comments about that its depend on your Gpu device theres many factors

ThisIsAName343Aug 11, 2024

Exponent 5, Mantissa 2, Sign 1

kunde2Aug 9, 2024

CivitAI

Damn, great job! This is working really well and thank you for the simple instructions! This model barely but perfectly fits into my 16Gb vram. 768x1280 takes 8.5s on a 4060 Ti with 4 steps.

AbdallahAlswa80

Author

Aug 9, 2024

also tested on 6 Giga vram , show me your arts

dani229mk824Aug 10, 2024

@AbdallahAlswa80 How long does it take to generate after you change prompts? I mean I get the 1st generation takes a lot of time to load the model but is it every time?

pigeliAug 10, 2024

CivitAI

Is this a fusion model of dev and schnell fp8, so the protocol is still non-commercial. Am I right?

AbdallahAlswa80

Author

Aug 10, 2024

yea , i used them as source so the license still same

tazztoneAug 10, 2024

CivitAI

so the t5xxl is also fp8 ? baked in?
i think for my 24GB VRAM it's better to use fp8 flux with fp16 t5xxl

AbdallahAlswa80

Author

Aug 10, 2024

its not noticed when using lcm as in this model , increasing 1 step mean = t5xx fp32

tazztoneAug 10, 2024

@AbdallahAlswa80 ok i just read in reddit that it's a very bad idea to use fp8 t5xxl, major prompt adherence loss:
check out this comparison:
prompt: "Miku playing golf with Michael Jordan"

https://files.catbox.moe/uqsii5.png

And this the picture with the T5 fp16 with the same exact seed

https://files.catbox.moe/olsf64.png
SOURCE:
https://www.reddit.com/r/StableDiffusion/comments/1el79h3/comment/lgqqmv8/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

AbdallahAlswa80

Author

Aug 10, 2024

@tazztone try that yourself , the article was not detailed about which flux version he used , what ever if the prompt become more complex , the diffrence between fp8 and fp16 start apearing , so if you smart in writing prompt , the result will become the same , also here the scheduler is "LCM"

somerandomguyAug 10, 2024· 1 reaction

@tazztone even I'm using your workflow (T5 fp8) with same seed, it produce miku and michael jordan playing golf just fine. I'm using Flux Schnell FP8 + T5 FP8. here's the workflow jpec89.png (2145×1066) (catbox.moe ) (it's png, can be dropped into ComfyUI for you to try). I heard many times that T5 FP8 is bad, and it's should be, because it's lower in size, but it's not THAT bad.

tazztoneAug 10, 2024

@somerandomguy then maybe it was cherrypicked example :shrug

MysticDaedraAug 10, 2024· 1 reaction

CivitAI

8gb performance?

AbdallahAlswa80

Author

Aug 10, 2024

i think it works how much ram do you have?

DrossProwlerAug 16, 2024

RTX 4060 mobile 8 GB VRAM + 32 GB DDR5 SDRAM took ~142 seconds.

amazingbeautyAug 10, 2024

CivitAI

i tried it and usually it takes like less than 1 minute per step using my cpu with any large model but this one the step passed over than 12 minutes , would like you to share with me your thoughts what's really wrong ? i'm tried my own setup and the comfy wf , both ran to this very slow 12mins per step while sampling ?! what should i d to figure out that ?

AbdallahAlswa80

Author

Aug 10, 2024· 1 reaction

unfourntly , this model "flux" 1step = 6-10 steps of other model , thats normal, sorry for that

amazingbeautyAug 11, 2024

@AbdallahAlswa80 thanks , then there's nothing wrong with my setup ...they have to mention that , they didn't . that flux required more h.w power , then if i want to use any flux model i have to get more h.w .

amazingbeautyAug 11, 2024

@AbdallahAlswa80 is that for all flux right ? should i save my time and don't try any other merge ?

AbdallahAlswa80

Author

Aug 11, 2024· 1 reaction

@amazingbeauty i think if it converted to onyx or another type but it's yet not supported ,

singultusAug 11, 2024

CivitAI

The file name entails e4m3, shouldn't that be e5m2? Maybe I don't get it...

AbdallahAlswa80

Author

Aug 11, 2024

sorry for hear that , what is your opinion depend on ?

singultusAug 12, 2024

@AbdallahAlswa80 my mistake, sorry, forget about it.

NowhereManGoAug 28, 2024

For those who are confused by this. There are two commonly used ways to "compress" a 16-bit floating point to 8-bit floating point:

https://new.reddit.com/r/FluxAI/comments/1ej3uga/what_is_the_difference_between_fp8_e5m2_and_fp8/

Wilderness_19Aug 11, 2024

CivitAI

Does this still go in unet folder or checkpoints now?

AbdallahAlswa80

Author

Aug 11, 2024· 1 reaction

checkpoint

cathylevermanAug 11, 2024

CivitAI

This is the best Fluxdev-model on civit right now. It works great in swarmUI for me.

if we could only uncensor it, then this would defeat not only SD3, but also pony :D

AbdallahAlswa80

Author

Aug 11, 2024

it used schnell as base !

pihlawrkr738Aug 16, 2024

Yeah right now flux is cool as a concept or as an alternative to some other things; but if you want NSFW material, or cartoons, or good character lora functionality, etc....all that only comes from SD and pony.

cathylevermanAug 19, 2024

im having some issues with upscaling, inpainting and img2img using this (in swarmUI).
Let me know if anyone else managed to make it work well. the upscaling is distorting more than increasing in quality, and the inpainting is hard to control..

lesjoDec 9, 2024

@pihlawrkr738 That aged like fine milk. It took a while and will always be a work in progress (as is the case with Pony, it took a while to get third party extensions), but there are a decent number of good options for Flux now, enough that I don't reach back to the older models. And of course it stacks up against SD3, they exfiltrated the training data for SD3 and included it in Flux's base training set.

pihlawrkr738Dec 10, 2024

@lesjo Yes, stuff improves over time. Eventually something better than both will come around. Right now I've been preferring Illustrious for most things; flux still is not great for what I personally want to generate.

No need to be so sour about it; and there is no need to get so attached to any one model. Things are always changing as models and the technology improve.

lesjoDec 11, 2024

@pihlawrkr738 Oh I know full well that Flux is just the current new hotness, just as Pony was, and XL, and so on and so on. I just thought it would have aged better if it had said "WHEN we can uncensor it" because it was a clear selling point from the start that Flux is not actually censored, just untrained in certain areas. Then kohya and OneTrainer added Flux LoRA support, and the floodgates started to open.

GrehgyHilsAug 13, 2024

CivitAI

This did not seem to be an faster for me on w 2080 TI, with 64 GB of RAM...

100%|████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:50<00:00, 12.66s/it] Requested to load AutoencodingEngine Loading 1 new model Prompt executed in 98.21 seconds

Did I do something wrong with my ComfyUI flow? i put the e5m2 file into my checkpoints directory, and loaded that as a checkpoint. Using the model's linked workflow.

AbdallahAlswa80

Author

Aug 14, 2024

who say its faster , and which model you compare with , the fp8 way just use less gpu so the low gpu ram can used it , in your device setup i think most of model loading go to cpu , which lower speed ,time needed =expfunc(cpu percentage)

akshaydixit007Aug 15, 2024· 5 reactions

CivitAI

Amazing Speed for m2...in 60s per image for same prompt. in my RTX3060 6GB laptop..i7 12th gen

polloloco769Sep 5, 2024

For me it's 3 minutes with Ryzen 9 32Gb ram and RTX 3060Ti 8Gb...

I must have the wrong workflow, which workflow do you use, cliploader and vae... please..?

puma0000Aug 15, 2024

CivitAI

can i use this in forge?

AbdallahAlswa80

Author

Aug 15, 2024· 1 reaction

yes you can use it as nf4 or fp8e52 dev or schnell as show in this video https://civitai.com/images/24458443

AbdallahAlswa80

Author

Aug 27, 2024

https://civitai.com/articles/6715

albihanyAug 17, 2024

CivitAI

Ok this seems great on paper, i wanna ask before downloading, will this be faster than the normal NF4 on forge?, the other question is which is the whole reason why i want to download FP8 is 1- my other FP8 model only goes to unet folder which forge doesn't support. 2- currently most loras are only supported by the FP8 version is this the case with this custom model?

AbdallahAlswa80

Author

Aug 17, 2024· 1 reaction

this model support lora ,,, work on comfyui and forge

SuzanneAug 28, 2024· 6 reactions

CivitAI

Very good model and very fast - only 26 seconds for an image of 896 X 1152 px

AMD Ryzen 5 5600X 6-Core Processor 3.70 GHz

Installed RAM 32.0 GB

NVidia RTX 3070 8G Vram

polloloco769Sep 5, 2024· 1 reaction

For me it's 3 minutes with Ryzen 9 32Gb ram and RTX 3060Ti 8Gb...

I must have the wrong workflow, which workflow do you use, cliploader and vae... please..?

SuzanneSep 6, 2024· 1 reaction

@polloloco769 I obtained this performance with forge and found that this model was slower with ComfyUI, but I'm just starting out with this one.

I advise you to try the GGUF Q8 versions is fast enough for me :

https://civitai.com/models/647237/flux1-dev-gguf-q2k-q3ks-q4q41q4ks-q5q51q5ks-q6k-q8?modelVersionId=724149

Here's a basic workflow for this model :

https://civitai.com/images/27892896

Good luck!

544221Sep 2, 2024· 1 reaction

CivitAI

THE BEST!

eraticmuscleSep 2, 2024· 1 reaction

CivitAI

Suggestion: Adding some pointers to run this would have been awesome!

AbdallahAlswa80

Author

Sep 3, 2024

https://civitai.com/models/628210/advanced-ai-art-remix-workflow-for-flux

https://civitai.com/models/644552?modelVersionId=721172

and this model because some guidelines was needed for single model

https://civitai.com/models/622579/flux1-dev-fp8

thagsau621Feb 2, 2025

CivitAI

How to use this models in diffusers FluxFillPipeline python code guys

AbdallahAlswa80

Author

Feb 2, 2025

its normal flux not flux-fill

tuantpa925Feb 3, 2025

@AbdallahAlswa80 yeah i means how to use it in diffusers python code. do you have a example code or any thing would be a big help bro

AbdallahAlswa80

Author

Feb 3, 2025

@tuantpa925 it need to be converted to diffusers , if u favorite diffusers i'll send u useful link

AbdallahAlswa80

Author

Feb 3, 2025

https://huggingface.co/sayakpaul

tuantpa925Feb 4, 2025

@AbdallahAlswa80 thank you

AbdallahAlswa80

Author

Feb 4, 2025

@tuantpa925 theres way called load from single file (load transformer) and then load normal flux-dev ,

from diffusers import FluxPipeline, FluxTransformer2DModel

HF_TOKEN = "hf_***"

flux_repo = "multimodalart/FLUX.1-dev2pro-full"

ckpt_path = "https://huggingface.co/Comfy-Org/flux1-dev/blob/main/flux1-dev-fp8.safetensors"

transformer = FluxTransformer2DModel.from_single_file(ckpt_path, subfolder="transformer", torch_dtype=torch.bfloat16, token=HF_TOKEN)

pipe = FluxPipeline.from_pretrained(flux_repo, transformer=transformer, torch_dtype=torch.bfloat16, token=HF_TOKEN) ,,, this work for single model not full checkpoint , search around if theirs a way

tuantpa925Feb 4, 2025

@AbdallahAlswa80 so you mean is try to convert this check point (SPEED FP8) to diffusers and then load it to a transformer (FluxTransformer2DModel) and use others stuff like tokenizer , text encoder of the Flux dev1 ?

AbdallahAlswa80

Author

Feb 4, 2025

@tuantpa925 no you can load model directly in that way , but this is checkpoint witch contain 3 parts , the transformer(main model), text encoder , and vae ! try !

Dunc4n1dah0Feb 28, 2025· 1 reaction

CivitAI

Thank you for mentioning me. I really appreciate it :-)

basharbj915Apr 2, 2025· 1 reaction

CivitAI

Very good model

Checkpoint

Flux.1 S

by AbdallahAlswa80

Download (Beta) View on CivitAI

base model

Details

Downloads

2,288

Platform

CivitAI

Platform Status

Available

Created

8/8/2024

Updated

5/13/2026

Deleted

Files

speedFP8_e5m2.safetensors

Size:

15.90 GB

SHA256:

0090f6fa3c4ae936d716786119e2b9bdc23dbf83212fb6bc3a8ab848aee93012

Mirrors

HuggingFace (1 mirrors)

speedFP8_e5m2.safetensors

CivitAI (1 mirrors)

speedFP8_e5m2.safetensors

Available On (1 platform)

Same model published on other platforms. May have additional downloads or version variants.

SeaArt

SPEED FP8 - e5m2

✨ What Makes This Model So Special? ✨

🛠️ How To Get The Most Out Of It:

👏 Special Thanks

👨‍💻 Developer Information

Description

FAQ

What is SPEED FP8?

How do I use SPEED FP8?

What should I watch out for with Flux models?

What other Flux-based models are worth knowing?

Can I use this model commercially?

What files are available and where can I download them?

Comments (85)

Details

Files

speedFP8_e5m2.safetensors

Mirrors

Available On (1 platform)