Ideogram 4 GGUF Workflow for ComfyUI
A simple workflow for running Ideogram 4 in GGUF quantized format on ComfyUI, optimized for systems with limited VRAM.
Tested on RTX 3060 12GB with 16GB RAM
β οΈ Don't forget to update your ComfyUI first!
Available Versions:
- v1.0 - Standard (2 UNET GGUF models) - Default traditional workflow
- v2.0 - TurboTime (Single UNET GGUF) - β‘ Faster, better quality, super low VRAM
---
π What's New in v2.0
- β‘ Optimized workflow: Removed redundant nodes (Dual Model CFG Guider, CFG Override)
- π TurboTime LoRA support: 2-step generation with CFG=0.0
- πΎ Low VRAM mode: Works on 8GB+ GPUs with Q4_0 quantization
- π― Cleaner structure: Added BasicGuider for proper CFG handling
- π Faster generation: ~1-2 seconds per image on modern GPUs
---
π¦ Required Models
1. UNET Models β ComfyUI/models/unet/
Download unet files from HuggingFace (Recommended):
- Repository: https://huggingface.co/molbal/ideogram-4-gguf
- Main Model (choose desired quantized version):
- ideogram4-transformer-q4_0.gguf (5.64 GB) - β Best for low VRAM
- ideogram4-transformer-q4_1.gguf (6.21 GB)
- ideogram4-transformer-q5_0.gguf (6.77 GB)
- ideogram4-transformer-q5_1.gguf (7.33 GB)
- ideogram4-transformer-q8_0.gguf (10.1 GB) - Best quality
- Unconditional Model (not needed in workflow v2.0):
- ideogram4-unconditional_transformer-q4_0.gguf (5.64 GB)
- ideogram4-unconditional_transformer-q4_1.gguf (6.21 GB)
- ideogram4-unconditional_transformer-q5_0.gguf (6.77 GB)
- ideogram4-unconditional_transformer-q5_1.gguf (7.33 GB)
- ideogram4-unconditional_transformer-q8_0.gguf (10.1 GB)
π‘ Note: The repository includes inference speed and memory usage charts to help you choose the best quantization for your system.
Alternative link for unet files (Civitai - Q4_0 only):
- Download from: https://civarchive.com/models/2681714/ideogram-4-gguf
- Note: File names may differ slightly from HuggingFace
2. Text Encoder β ComfyUI/models/clip/
- Qwen3-8B-Q4_K_M.gguf
- Source: https://huggingface.co/Qwen/Qwen3-8B-GGUF
3. Prompt Enhancer (optional) β ComfyUI/models/clip/
- gemma4_e4b_it_fp8_scaled.safetensors
- Source: https://huggingface.co/Comfy-Org/gemma-4
- Automatically converts natural language to JSON format
4. VAE β ComfyUI/models/vae/
- flux2-vae.safetensors
- Download: https://huggingface.co/Comfy-Org/flux2-dev/resolve/main/split_files/vae/flux2-vae.safetensors
---
βοΈ Custom Node Required
Single GGUF custom node by molbal (for both UNET and CLIP)
- Repository: [molbal/ComfyUI-GGUF](https://github.com/molbal/ComfyUI-GGUF)
- Install: git clone https://github.com/molbal/ComfyUI-GGUF.git
- Nodes: UnetLoaderGGUF + GGUFCLIPLoader
β οΈ Important: This is a fork of city96/ComfyUI-GGUF with Ideogram 4 support. If you have city96 version installed, remove it first (same folder name causes conflicts).
Installation steps:
1. Stop ComfyUI
2. Delete the entire ComfyUI-GGUF folder in custom_nodes (not just the files inside it)
3. Clone: git clone https://github.com/molbal/ComfyUI-GGUF.git
4. Restart ComfyUI
π‘ Optional: If you encounter any import errors, install requirements:
pip install -r ComfyUI-GGUF/requirements.txtβ That's it! This single extension handles both UNET and CLIP loading.
---
π Choosing the Right Quantization
The HuggingFace repository includes helpful charts for:
- Inference Speed: How fast each quant generates images
- Memory Usage: How much VRAM each quant requires
General Guidelines:
- Q4_0: Lowest VRAM (~5.5 GB), fastest, good quality
- Q5_0/Q5_1: Balanced VRAM (~6.8-7.3 GB), better quality
- Q8_0: Highest VRAM (~10 GB), best quality, slower
π‘ Check the charts in the repository for detailed comparisons!
---
ποΈ JSON Prompting (Optional)
Ideogram 4 works best with structured JSON prompts. This workflow uses Gemma 4 to automatically convert your natural language prompts into JSON format for:
- β Better text rendering and typography
- β More accurate composition control
- β Consistent results across generations
Just write a simple prompt like "A poster for a coffee shop" and let Gemma 4 handle the rest!
---
π‘ Quick Tips
- For TurboTime LoRA (v2.0): Use only ideogram4-transformer-q4_0.gguf (no unconditional model needed)
- TurboTime LoRA: Download from: https://huggingface.co/ostris/ideogram_4_turbotime_lora
- For standard mode (v1.0): Use both main and unconditional models
- CFG Settings: 0.0 for TurboTime, 7.0 for standard mode
- Scheduler: mu=0.5, std=1.75 for TurboTime | mu=0.0, std=1.5 for standard
- Quantization: Q4_0 for lowest VRAM, Q8_0 for best quality
π Credits
- Ideogram 4 Model: [Ideogram AI] https://ideogram.ai/
- GGUF Conversion & Custom Node: [molbal] https://github.com/molbal/ComfyUI-GGUF
- TurboTime LoRA: [ostris] https://huggingface.co/ostris/ideogram_4_turbotime_lora
- Workflow: [dvdufo] https://civarchive.com/user/dvdufo
Description
v2.0 - TurboTime Support & Low VRAM Optimization β‘
If you want to test Ideogram 4 with a single GGUF file in just 2 steps, here's the optimized workflow!
What's new:
- π TurboTime LoRA support (2-step generation, CFG=0.0)
- πΎ Single model mode - no unconditional model needed
- β‘ Cleaner workflow structure (removed redundant nodes)
- π― Works on 8GB+ GPUs with Q4_0 quantization
Tested on: RTX 3060 12GB with 16GB RAM
Requirements:
- Latest ComfyUI version
- Ideogram 4 TurboTime LoRA
- Only one GGUF model (Q4_0)
- Custom node: molbal/ComfyUI-GGUF
Enjoy! π
FAQ
Comments (6)
this worked, thanks! quick note to correct in your instructions.
- Install: git clone https://huggingface.co/molbal/ideogram-4-gguf
β οΈ Important: This is a fork of city96/ComfyUI-GGUF. If you have city96 version installed, remove it first (same folder name causes conflicts).
Installation steps:
1. Delete/rename existing ComfyUI-GGUF folder (city96 version)
2. Clone: git clone https://huggingface.co/molbal/ideogram-4-gguf
you need to change it to "git clone https://github.com/molbal/ComfyUI-GGUF.git"
Thanks! π You're right - I fixed it now.
Correct command: git clone https://github.com/molbal/ComfyUI-GGUF.git
Good catch! π
@dvdufoΒ The fork had an error where it prevented _K quant text encoders from loading - I fixed it, sorry about that.
Also, I have all supported Ideogram quants here if you need them: https://huggingface.co/molbal/ideogram-4-gguf
@molbalΒ Thanks for the fix! π
I just tested the latest version - deleted the old ComfyUI-GGUF folder, cloned fresh from GitHub, and everything works perfectly now. Both UNET and CLIP load from the same custom node without any issues.
Really appreciate you maintaining this fork and adding support for all the quant variants. I'll update the workflow description to simplify the custom nodes section and include your HuggingFace link.
Great work! π
@dvdufoΒ I deleted the ComfyUI-GGUF folder in Custom Nodes folder but then got this error message when trying to clone as instructed
git clone https://github.com/molbal/ComfyUI-GGUF.git
fatal: destination path 'ComfyUI-GGUF' already exists and is not an empty directory.
π Workflow Files Updated!
Both workflow files (v1.0 and v2.0) have been updated to use only the molbal custom node.
If you downloaded before: Please re-download the workflow files to get the simplified version.
- Now only ONE custom node is needed: [molbal/ComfyUI-GGUF](https://github.com/molbal/ComfyUI-GGUF)
- It handles both UNET and CLIP loading
- If you have city96/ComfyUI-GGUF installed, please remove it first
TurboTime LoRA Support (WF v2.0):
- super fast 2-step generation (4 step for optimal quality)
- Only needs 1 single UNET model
All Quantization Variants Available Now:
- Check HuggingFace for Q4_0, Q4_1, Q5_0, Q5_1, Q8_0
- Includes speed and memory usage charts
Thanks to molbal for maintaining the custom node and SCHADENFREUDE__ for catching the installation error! π
Check the full description for detailed instructions.
Happy generating! π¨




