SDXL-ProteusSigma Training with ZTSNR and NovelAI V3 Improvements
- [x] 10k dataset proof of concept (completed)[link](https://huggingface.co/dataautogpt3/ProteusSigma)
- [x] 500k+ dataset finetune (completed) [Note: not aesthetically tuned whatsoever]
- [ ] 12M million dataset finetune (planned)
Proteus and Mobius datasets
CUSTOM INFERENCE IS REQUIRED FOR BEST RESULTS!
https://github.com/DataCTE/SDXL-Training-Improvements/tree/main/Comfyui-zsnrnode
use this comfyui custom node from the training repo.
and the workflow here: https://github.com/DataCTE/SDXL-Training-Improvements/blob/main/Comfyui-zsnrnode/ztsnr%2Bv-pred.json
Model Details
Model Type: SDXL Fine-tuned with ZTSNR and NovelAI V3 Improvements
Base Model: stabilityai/stable-diffusion-xl-base-1.0
Training Dataset: 500,000 high-quality images
License: Apache 2.0
Key Features
Zero Terminal SNR (ZTSNR) implementation
Increased σ_max ≈ 20000.0 (NovelAI research)
Training Details
Training Configuration
Learning Rate: 4e-7
Batch Size: 8
Gradient Accumulation Steps: 8
Optimizer: AdamW
Precision: bfloat16
Epochs: 80
Performance Improvements
47% fewer artifacts at σ < 5.0
Stable composition at σ > 12.4
31% better detail consistency
Improved color accuracy
Enhanced dark tone reproduction
Repository and Resources
GitHub Repository: SDXL-Training-Improvements
Training Code: Available in the repository
Documentation: Implementation Details
Issues and Support: GitHub Issues
Citation
@article{ossa2024improvements,
title={Improvements to SDXL in NovelAI Diffusion V3},
author={Ossa, Juan and Doğan, Eren and Birch, Alex and Johnson, F.},
journal={arXiv preprint arXiv:2409.15997v2},
year={2024}
}Description
FAQ
Comments (8)
Cool stuff man
Okay this is awesome, imagine at 200k or even 12 million 😮💨
No hints on prompting?
honestly not sure yet on that front tbh. testing the 10k on prompt currently but I am fairly sure it will change drastically when scaled up.
its a big question mark until the 200k is done cooking
i don't know what you did in this but the results a 100% random. Some look like partial noise. Some became blurred cartoons. Some are normal others are drawings. It can't seem to make up its mind what style it wants to be in. And any other results are blurred and mangled.
this is a very early concept of model. not surprising
What settings are you using? I only get a swing between cartoon and realism, and when I try to force realism on a cartoon output (by adding realism, photographic, etc) it pushes the cartoon to semi-real.
Aside from that I really like the results of this model so far.
i hope this become the next frontier in realistic models because they have been lagging behind anime ones.
Details
Available On (2 platforms)
Same model published on other platforms. May have additional downloads or version variants.







