CivArchive
    UnCanny (Photorealism Chroma) - v1.3
    NSFW
    Preview 115746134
    Preview 114425545
    Preview 114425544
    Preview 116753427
    Preview 114425787
    Preview 114425797
    Preview 114426208
    Preview 114426209
    Preview 114426216
    Preview 114426210
    Preview 114426215
    Preview 114426213
    Preview 114426205
    Preview 114426207
    Preview 114425805
    Preview 114426204
    Preview 114522850

    This finetune aims to improve reliability in realistic/photo-based styles while preserving Chroma’s broad concept knowledge. The v1.3 flash version has the rank-256 lora (from here) baked in. bf16 and fp8 available --->. GGUFs on HuggingFace.

    Prompting: Simple prompts describing what you want to see in natural language works well (example1, example2). Chroma prompts work well. Examples of captioning style used in training: woman sitting, waterfall, wolf, woman in dessert. Negative prompts don't work at CFG one, but above one negative prompts can be important.

    Example settings (not necessarily optimal) using Comfy's default Chroma workflow:
    Base: Steps:: ~30-40. CFG: ~3.5 (best settings depend on steps/CFG/sampler, etc.).
    Flash (lora rank-128 or 256): Steps: 15-17. CFG 1. (Depends on lora rank, sampler, etc.)
    Sampler: Examples use res_2m, dpmpp_sde, or exp_heun_2_x0_sde. Others are also good.
    Scheduler: I like bong_tangent | beta & beta57 and others are also good

    Support:
    Have too much money? Want to support further training? https://ko-fi.com/dawncreates

    Training Details
    The model was trained locally, using Chroma-HD as the base. Each epoch included images at 3–5 different resolutions, though only a subset of the dataset was used per epoch. Except for the extra resolutions, OneTrainer's default config for 24gb Chroma finetuning was used. The dataset consists almost exclusively of SFW-images of people and landscapes, so to retain Chroma-HD's original conceptual understanding, several layers were merged back at various ratios. All the juice, compositions, subjects, and concepts come from Chroma itself, my model just nudges it towards realism. Honestly, this version is a showcase of how good Chroma is. So get to work on Chroma finetuners - it has so much potential!

    All images were captioned using JoyCaption: https://github.com/fpgaminer/joycaption

    The model was trained using OneTrainer: https://github.com/Nerogar/OneTrainer

    Description

    Tries to fix the main issues people have reported having with v1 and v1.2