HiDream O1 Image - CivArchive (CivitAI Archive)

HiDream O1 Image - Dev

HiDream-O1-Image (codename Peanut) is an 8B text-to-image foundation model from HiDream.ai, built on a Pixel-level Unified Transformer (UiT) that operates end to end on raw pixels with no external VAE or separate text encoder. The same checkpoint handles text-to-image, instruction-based editing, and multi-reference subject personalization natively at up to 2,048 x 2,048.

Originally released by HiDream.ai on Hugging Face. All credit for the model goes to the HiDream.ai team. Civitai is hosting a mirror so creators can run it on-site - head to the original repo for weights, updates, the technical report, and to follow the project directly.

Built by

HiDream.ai - upstream organization and authors of the technical report.

Versions mirrored on Civitai

Two checkpoints are mirrored, both as fp8 SafeTensors:

Standard - full 50-step model. Best quality. Guidance scale 5.0.
Dev - distilled 28-step model. Faster, guidance scale 0.

HiDream also publishes a 200B+ Pro variant upstream, but weights are not public, so it is not mirrored here.

One model, three tasks

The same checkpoint handles text-to-image, instruction-based editing with a single reference image, and multi-reference subject-driven personalization with up to ten reference images. Mode is selected by what you pass at inference - no separate adapters or LoRAs needed.

Native 2K and multilingual text

Direct synthesis up to 2,048 x 2,048 without upscaling. Strong long-text rendering in both English and Chinese (LongText-Bench 0.979 EN / 0.978 ZH), 0.90 on GenEval for compositional prompts, and 89.83 on DPG-Bench for dense prompt alignment.

Reasoning-driven prompt agent (upstream only)

The HiDream repo ships a separate "thinking" prompt agent (Gemma-4-31B or an OpenAI-compatible API) that rewrites raw instructions into self-contained prompts before generation. That agent is not part of the Civitai mirror - if you want it, run upstream locally.

Description

Comments (7)

angelomaiotaMay 12, 2026

CivitAI

If it's an Open Source product I'll definitely try it.

OppkllllMay 12, 2026· 8 reactions

CivitAI

Cool, but I'll let you in on a secret as to why it might not be successful. This one, and other new models too. I think I'll be posting this with every new model.

NO INFORMATION ON HOW TO USE IT!!

What does it work with? Forge, Nano, A1111? Just Comfy UI? If it’s just Comfy, please provide full instructions on where to place which files, what nodes are required, etc. A lot of people started out with the A1111 and SD models, and switching to Comfy’s spaghetti code is unacceptable to them, which is why they still use SDXL or Illustrious—because you just download those models from Civit and they work.

If you want your model to be successful, prepare a simple tutorial on YouTube, etc., explaining what to do and how to do it.

Translated with DeepL.com (free version)

elevendrMay 12, 2026

Wait, so their's no offical prompting guide for this model? That's a shame.

makiaeveliMay 12, 2026

It's simply not ready yet: https://huggingface.co/Comfy-Org/HiDream-O1-Image/tree/main

If you read the huggingface, they specify that the model has no VAE, I don't think a clip, but uses gemma as a prompt enhancer. So... simply the checkpoint in the diffusion-models and text-encoders directory. It wouldn't run anyway, but that may be to your point.

Also, there is information provided. They provide an app.py that will open a gradio server, with a prompt helper. I mean you're right that they don't provide detailed prompt examples, but there are a few.

idk someone posted a workflow, maybe it does work? https://civitai.red/models/2618821

brnfd24434343dMay 12, 2026· 1 reaction

You know, you could just learn comfyui ONCE, and not have any of these problems.

OppkllllMay 13, 2026

@brnfd24434343d I use the comfy when I have to; I just don't like this type of interface ;)

RhodynoliaethMay 12, 2026· 3 reactions

CivitAI

An AIO model? I wonder how one can fine-tune it? But at least it remains within the reach of the community without needing a runpod.

Checkpoint

HiDream-O1

by CivitaiOfficial

Download (Beta) View on CivitAI

base model

hidream

Details

Downloads

Platform

CivitAI

Platform Status

Available

Created

5/12/2026

Updated

5/14/2026

Deleted

Files

hidreamO1Image_dev.safetensors

Size:

7.51 GB

SHA256:

7cbf53a475e0a13f92f2ec08bcffdb9b9de4305ef3b6f35cdd784d09dcd8d0cc

Mirrors

HuggingFace (1 mirrors)

hidream_o1_image_dev_fp8_scaled.safetensors

CivitAI (1 mirrors)

hidreamO1Image_dev.safetensors

ModelScope (1 mirrors)

hidream_o1_image_dev_fp8_scaled.safetensors

ModelScope CN (1 mirrors)

hidream_o1_image_dev_fp8_scaled.safetensors

Built by

Versions mirrored on Civitai

One model, three tasks

Native 2K and multilingual text

Reasoning-driven prompt agent (upstream only)

Links

Description

Comments (7)

Details

Files

hidreamO1Image_dev.safetensors

Mirrors