CivArchive
    ThinkDiffusionXL - v1.0
    Preview 3062343
    Preview 3050461
    Preview 3050454
    Preview 3050460
    Preview 3050455
    Preview 3050457
    Preview 3050785
    Preview 3051479
    Preview 3050456
    Preview 3050459

    ThinkDiffusionXL (TDXL)

    ThinkDiffusionXL is the result of our goal to build a go-to model capable of amazing photorealism that's also versatile enough to generate high-quality images across a variety of styles and subjects without needing to be a prompting genius.

    You can find it preloaded on ThinkDiffusion.

    Read more about the model, click here

    Please leave a review if you're happy with it, this will encourage us to create more and improve on it.

    The work

    • Data source: TDXL is trained on over 10,000 diverse images that span photorealism, digital art, anime, and more. The smallest resolution in our dataset is 1365x2048, but many images go up to resolutions as high as 4622x6753. In total, our dataset takes up 42GB.

    • Training: With 1.8 million steps, we’ve put in the work. For comparison, Juggernaut is at 600k steps and RealVisXL is at 348k steps

    • Hand-captioned images: Each image is carefully captioned by hand, enhancing the model's ability to generate accurate and high-quality results from minimal prompts.

    • NSFW capabilities: The model includes over 1,000 tastefully curated NSFW images.

    Our thoughts

    • Detail and quality: Most XL models in the Realistic category suffer from poor detail, especially in the background and even in basic features like eyes, teeth, and skin. We believe TDXL outperforms in these areas due to its large, high-quality dataset. For comparison, Juggernaut has about half the image material, and RealVisXL has only 1,700 images. Ultimately, TDXL simply possesses much more "knowledge".

    • Less-Bias: We made sure to use an equal number of images for each style, gender, etc. Other models we tested over the past few months had some kind of bias, sometimes it was bias toward portrait shots, gender bias, certain ethnicities, etc. For instance, Juggernaut has a bias in the Close-Up area, and the Cinematic Light is quite dominant in that model. RealVisXL also has a bias towards Portrait shots. On the other hand, TDXL gives you what you want: Landscape, Midshot, Full Body, Close-Up, Portrait, Sideview, Backview, Action Shots, Cinematic...whatever you want without always being pushed in a certain direction due to a bias.

    • Versatile base: Because of its large balanced quality dataset, TDXL is versatile to serve as a base model for future trainings. You can create new finetunes in entirely different directions, add LoRAs to fill in missing concepts, or do additional trainings with more balanced quality data.

    Description

    FAQ

    Comments (73)

    ostrisOct 20, 2023· 11 reactions
    CivitAI

    You hand captioned 10,000 images?

    I think that's BS myself, unless they have outsourced it, I don't think whoever wrote that realises just how long it would take to hand caption 10,000 images. It would be like a full time job for 2 or 3 people over 3 or 4 weeks.

    FloyoAI
    Author
    Oct 24, 2023· 3 reactions

    Yes. This model has been in the works for quite some time. We knew early on that it was of the utmost importance to focus on having the best dataset we could possibly attain. So we felt it was a worthwhile effort to invest in.

    CCgeOct 21, 2023· 8 reactions
    CivitAI

    so good!

    mmajk75Oct 21, 2023· 7 reactions
    CivitAI

    i love it.
    Thank you for your hard work. The result is evidence of real quality work.

    Lolly_8888Oct 21, 2023· 2 reactions
    CivitAI

    Thank you for your hard work, really appreciated

    CSUGenfanOct 21, 2023· 4 reactions
    CivitAI

    my new go to. dope guys 👊🏾🔥

    hycnmpah712Oct 21, 2023· 1 reaction
    CivitAI

    Looks great. Do you have any recommendations for CFG, Samplers VAE etc? Do you recommend using the refiner for this model? Do you have any recommended comfy workflows?

    hycnmpah712Oct 21, 2023· 2 reactions
    CivitAI

    Looks great. Do you have any recommendations for CFG, Samplers VAE etc? Do you recommend using the refiner for this model? Do you have any recommended comfy workflows?

    tomwinOct 21, 2023· 1 reaction

    I think everything just works with defaults for example
    CFG: 5-10 ( i use 7 on default)

    Sampler: DPM++ 2M Karras

    VAE: Normal Stability VAE

    Refiner: Not needed

    ayahareedyya808Oct 21, 2023· 15 reactions
    CivitAI

    With this excellent model, you can achieve excellent results without the use of lora or refiner.

    kubilayanOct 24, 2023

    Would you still recommend using <lora:sd_xl_offset_example-lora_1.0:0.3> ?

    ayahareedyya808Oct 24, 2023· 1 reaction

    @kubilayan I use only this model and sdxl_vae.safetensors · stabilityai/sdxl-vae at main (huggingface.co) as vae for all of my generated images.

    9ballOct 21, 2023· 4 reactions
    CivitAI

    Any recommended settings?

    josephduffy1969390Oct 22, 2023· 3 reactions

    From the creator on Reddit:

    https://www.reddit.com/r/StableDiffusion/comments/17d17gr/comment/k5up1vp/?utm_source=share&utm_medium=web2x&context=3

    CFG: 5-10 ( I use 7 on default)
    Sampler: DPM++ 2M Karras
    VAE: Normal Stability VAE
    Refiner: Not needed

    rocketguyishereOct 23, 2023· 4 reactions
    CivitAI

    Looks like this is my new fav model!!! Thanks for all the effort guys!

    mpr9348378Oct 24, 2023· 1 reaction
    CivitAI

    Anyone able to make this work with Animatediff?

    CsetiOct 24, 2023

    no because it is an SDXL model. Animatediff doesn't work with SDXL yet as far as I know

    black_jack_5223Oct 24, 2023· 2 reactions

    use hotshotXL for SDXL models

    mpr9348378Oct 24, 2023

    @Cseti @black_jack_5223 I really appreciate your guys helpful comments! I'm a complete newbie to this so I'm sorry for sounding so dumb!

    LouDioxOct 25, 2023· 1 reaction

    @mpr9348378 don't ever apologize for not knowing something. At some point, everyone came through this and asked for help before

    shubhOct 25, 2023· 3 reactions
    CivitAI

    Why does TDXL results are less detailed compare to RealisticStockPhoto_v10 or other fine-tuned models. What's the way to get results with high detailed? Adding "highly detailed" prompt is not very effective...

    studioffan408Oct 25, 2023

    Friend, what models can you suggest to me for photorealism?

    simartem07Oct 26, 2023

    @studioffan408 canon dslr camera

    ComradeMittensOct 29, 2023

    You have to prompt TDXL a bit differently than other SDXL models, go look at the devs example pictures and the prompts they used, thats a good place to start anyway.

    shubhNov 3, 2023

    @ComradeMittens I looked in the devs example. What exactly do you mean by differently? Can you elaborate please?

    ComradeMittensNov 3, 2023· 3 reactions

    @shubh The devs made this model using 10k hand captioned images, a lot of other SDXL models either use less images or they use machine captioning. In the case of machine captioning a tag approach to prompting tends to work better. But this isn't one of those models, so if you want more detail out of your images you should try using short descriptive sentences, and not too many, alongside very few descriptive tags like "short black hair" for the little details (compare the TDXL example prompts to the realvisXL ones to see this clearly). I also find that having just one quality tag like "best quality" in the positive prompt helps, but not as much as in other models. The last thing to keep in mind is that this model is trying to create real, imperfect, looking people, not supermodels, so unless you specify that, you wont be getting the same results as in other realistic models and if you do, you will be removing the detail in the skin especially. I hope this helps you!

    shubhNov 4, 2023

    @ComradeMittens That's super helpful. Thanks!!

    Endless_Oct 25, 2023· 1 reaction
    CivitAI

    You can try the embedding I trained for the sdxl model, which is particularly effective for models with excellent realistic performance.

    Endless_Oct 25, 2023· 1 reaction
    CivitAI

    You can try the embedding I trained for the sdxl model, which is particularly effective for models with excellent realistic performance.

    eglor66Oct 25, 2023· 7 reactions
    CivitAI
    simartem07Oct 26, 2023

    it doesnt matter because generated images will carry the model signatures

    AkOZoOmOct 30, 2023

    @simartem07 wov! interesting ! .. also you say about marked images and detectable after generation ? so ? as even the containing artists names used and referenced ? May be far useful in a way.

    eglor66Oct 31, 2023

    @simartem07 the seaart pics doesn't carry that info

    simartem07Oct 31, 2023· 2 reactions

    @eglor66 the point is, this is an open-source world, all models are variants of each other and nobody knows which rights may have been violated in training process of each model which are based on real-world generations of photo artists and all hand-made digital arts, where all the models are already variants of only few authentic originated unique model. If you give me any image created with any ai-model (including the HASH) , i will convert and embed any data you want it to carry. Generate me any image with MidJourney, i will change the embedded data into Dall-E, or remove all embeddings and put only EXIF info.. This is not something easily preventable in today's conditions.. Unfortunately..

    YuugenMagenOct 26, 2023· 2 reactions
    CivitAI

    Truly amazing. Needless to say I love playing around with this model and never stopped relying on it! Thank you for your hard work!

    abacabbmk3Oct 26, 2023· 2 reactions
    CivitAI

    Mine works until last frame. When it finishes, the color burn. I've tried using a plugin "anti burn" but didn't fix it! I don't know how to fix it

    MidlazOct 27, 2023
    you can't use SD VAE ,after putting mine in none it started to work
    MidlazOct 27, 2023

    and is a 1024x1024 model not a 512x512

    ComradeMittensOct 28, 2023

    what is your cfg value and are you using any loras? also use the SDXL vae if you are not already, you need it for inpainting/img2img with this model anyway.

    ironcloudOct 28, 2023
    CivitAI

    Anyone has issue with inpainting and ADetailer? The masked areas become slightly desaturated for me. 😔

    ivanbonefacicOct 28, 2023

    same thing here, it's very irritating, it's the only problem I have so far

    ivanbonefacicOct 28, 2023
    ComradeMittensOct 28, 2023· 6 reactions

    If you are not already, use the SDXL vae instead of the model's vae (most SDXL models do not have baked vae's for inpainting/img2img)

    ironcloudOct 29, 2023· 1 reaction

    @ComradeMittens Damn~ Thanks a lot! This solves it!

    @ivanbonefacic You can get the sdxl vae here https://huggingface.co/stabilityai/sdxl-vae/tree/main

    bissonfrederic199Oct 30, 2023

    Same for me. And I'm using the SDXL VEA.

    ericmaengNov 2, 2023· 4 reactions
    CivitAI

    Nice work Think diffusion, One of the top notch realistic SDXL model at the moment. Really appreciate your efforts.

    I am Eric, I run a Gen AI startup leveraging SDXL Loras and want to suggest a collaborative research opportunities making 'Midjourney for human portraits'

    Let's jump on a quick call/chat and I would cover detail + opportunities we could make together.

    many credits, reputation and compensation is assured.


    Find me on the contact below:
    - Discord : eric_sdxl
    - Email: [email protected]

    Ash_LoveNov 3, 2023· 1 reaction
    CivitAI

    "A tensor with all NaNs was produced in Unet" is what I get in img2img, no one else has this issue?

    TrafficMeanyNov 19, 2023

    I was here to post something else assuming this is resolved by now. Answer is check your VAE settings

    marjon94Nov 13, 2023· 1 reaction
    CivitAI

    Not working with TensorRT? I get a

    RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1

    AIAIXLIINov 14, 2023

    Hi, I'm not shure already but I think I had the same problem with another checkpoint. Try to change or deactivate the VAE, if you use one. Maybe this could help.

    Rick_68Nov 15, 2023
    CivitAI

    I'm dying to try It out, but It won't load.

    It keeps loading back the previous model.

    tmobleycpc514Nov 22, 2023

    having the same issue. did you find a solve?

    deadgremNov 30, 2023

    Same was happening for me once with another model and I ended up re-downloading the model as a fix.

    gibiluxDec 29, 2023

    That's a weird thing that happens sometimes in a1111. It never happened to me in comfyui

    TrafficMeanyNov 19, 2023
    CivitAI

    Question about the vocab.json! I noticed in the tokenization vocab. Many of the words were duplicated with the end of word string attached the second time, some words are not duplicated and either have the string or do not with no rhyme or reason. (((for anyone that doesn't know: in subword tokenization methods such as Byte Pair Encoding (BPE) used in NLP, </w> is used to indicate the end of a word when a word is split into subword units. This kind of tokenization is often used in tasks like machine translation or language modeling)) I don't know if that is an error in logging the tokens or if the tokens are actually effected by the </w>. I plan on testing the prompts both ways but I'd like to know if that was just an error in transcribing the file or if its actually in the model.

    28565Nov 22, 2023
    CivitAI

    Does this model require the refiner?

    AI_Art_LoverNov 27, 2023· 3 reactions

    You don't need to use the refiner with this model

    nicoakira1126Nov 23, 2023
    CivitAI

    it takes HUGE time while using controlnet, anyone have same issue?

    alfaranko69Nov 23, 2023· 2 reactions
    CivitAI

    I have only 4GB of RAM, and it takes a lot of time to generate one image

    SuzanneDec 17, 2023

    @alfaranko69 use Fooocus UI, it's very faster

    dmytro40uahDec 22, 2023· 1 reaction

    @Suzanne focus is better than automatic1111 ? I use automatic1111 and generate images on all XL models takes a lot of time. I have 3060ti +16 ram +i5 11th gen

    SuzanneDec 22, 2023

    @dmytro40uah yes, you're right

    I've got an RTX 3070 8Go Vram, and images that used to take more than 3 minutes on A1111 now only take 30 seconds with Fooocus.

    I love it and don't need a turbo model to do that... 😊

    acdseeunlearnedDec 23, 2023

    Try putting --medvram in your command line arguments of your webui-user.bat file. I have a 1080 GTX with 8GB and my renders take less than a minute or so on XL models

    MadTuneBKMar 1, 2024

    well use tiled vae encode/decode and tiled ksample

    tomwinNov 25, 2024

    @Suzanne thanks! 

    acdseeunlearnedDec 23, 2023· 11 reactions
    CivitAI

    Are there "recommended" settings for the model? i.e. CFG scale, Sampling steps, etc. for certain types of image creation?

    AI_Art_LoverJan 5, 2024· 5 reactions

    Here were the original recommended settings when the model was released

    CFG: 5-10 ( I use 7 on default)
    Sampler: DPM++ 2M Karras
    Sampling steps: 25 to 35
    VAE: Normal Stability VAE
    Refiner: Not needed

    acdseeunlearnedJan 6, 2024

    @AI_Art_Lover Thank you so much for this. I appreciate you.

    designing839641Apr 5, 2024· 1 reaction
    CivitAI

    any upcoming versions in future??

    yofoton174609Apr 20, 2024· 2 reactions
    CivitAI

    probably the best model for pagan/viking photorealistic characters right now :D

    ManofDoom94Jun 4, 2024
    CivitAI

    Hey you can use this offline on your phone but u need 12gb ram minimum phone. U can use fp8 in comfyui I have an guide on installing ComfyUi on Termux in android

    https://github.com/KintCark/COMFYUI-ANDROID-TERMUX

    JanetSep 3, 2024· 1 reaction
    CivitAI

    This was a great model, I hope you make it for Flux!!!

    Checkpoint
    SDXL 1.0

    Details

    Downloads
    30,070
    Platform
    CivitAI
    Platform Status
    Available
    Created
    10/20/2023
    Updated
    5/13/2026
    Deleted
    -

    Available On (2 platforms)

    Same model published on other platforms. May have additional downloads or version variants.