๐ฎ Transform Your Images with Natural Language Instructions
The Perfect Companion for Seamless Editing
This guide outlines a complete workflow using FluxFill-Dev with ICEdit-MoE-LoRA for powerful natural language-based image editing.
๐ Overview: The Future of Image Editing is Here
ICEdit-MoE-LoRA represents a breakthrough in AI image editing - a lightweight Low-Rank Adaptation (LoRA) module that brings professional-grade, instruction-based editing capabilities to large diffusion transformers. By intelligently fine-tuning only 1% of parameters using just 0.5% of the original training data, it achieves extraordinary editing capabilities that match or even surpass premium commercial solutions.
๐ Complete Workflow
This workflow combines FluxFill + ICEdit-MoE-LoRA for editing images with natural language instructions. For enhanced results, diffusion models like LEOPARD , Bernoulli or SPEED_Q8 with the Gemini auto prompt node in 4-8 steps.(i used very light flux model which need less steps)
A diptych with two side-by-side images of the same scene. On the right, the scene is the same as on the left but "your edit"(prostyle included just put ur edit)
๐ ๏ธ Complete Installation Guide
๐ Setup Structure
๐ ComfyUI/
โโโ ๐ models/
โ โโโ ๐ diffusion_models/
โ โ โโโ (basic)๐ fluxfill-dev.safetensors # https://civarchive.com/models/1108146/fluxfill-devgguf
โ โ โโโ(option1)๐ LEOPARD.gguf # https://civarchive.com/models/1534861
โ โ โโโ(option2)๐ Bernoulli.gguf # https://civarchive.com/models/682369
โ โ โโโ(option3)๐ SPEED_Q8.gguf # https://civarchive.com/models/661102/speedq8
โ โโโ ๐ text_encoders/
โ โ โโโ (basic)๐ clip_l.safetensors
โ โ โโโ (option1)๐ t5xxl_fp16.safetensors
โ โ โโโ (option2)๐ t5xxl_fp8_e4m3fn.safetensors
โ โ โโโ (option3)๐ t5xxl_fp8_e4m3fn_scaled.safetensors
โ โโโ ๐ vae/
โ โ โโโ ๐ ae.safetensors
โ โโโ ๐ lora/
โ โโโ ๐ ICEdit-MoE.safetensors
๐ Essential Components
โจ LoRA Models - The Magic Touch
๐ค Text Encoders - The Brain Behind Natural Language Understanding
Note: You only need to choose ONE of the T5XXL options below based on your hardware capabilities
T5XXL Options (choose only one):
๐ญ VAE - The Visual Artist
๐ผ๏ธ Diffusion Models - The Imaging Engine
##(You only need to choose ONE of those or use flux-dev itself its up to u)
LEOPARD.safetensors - Advanced diffusion model for refined outputs
Bernoulli.safetensors - Alternative diffusion model for enhanced generation
SPEED_Q8.gguf - Enhanced flux-dev model for improved results
Note: Any of these diffusion models can be used for efficient editing with fewer steps (4-8 steps recommended). Use flux-dev-fill for editing and flux-dev for enhancing results.
๐ฌ Example Instructions You Can Try
"Add a majestic castle on the mountain in the background"
"Change the color of the car from red to electric blue"
"Remove the person from the image and replace with a flowering tree"
"Transform the style to watercolor painting while maintaining all details"
"Add dramatic sunset lighting to the scene"
๐จโ๐ป Developer Information
This workflow guide was created by Abdallah Al-Swaiti:
For additional tools and updates, check out the OllamaGemini Node: GitHub Repository
๐ Community Recognition
A special thank you to sanaka87 for revolutionizing open-source image editing by releasing the ICEdit-MoE-LoRA checkpoint and demos. Your contribution empowers creators everywhere and accelerates innovation across the AI art community!
Description
FAQ
Comments (8)
just what I need rn!
now lets return to video
The Lora for ice-edit is no longer available.
corrected
thanks for sharing.
is ICE working at low res (lets say half flux res) then need upscaling, right? all tests i made i tryed to push it at higher res but results were not good. i wonder if is me or is just how ICE works
if u check the workflow all images downscale to have 512 in width , i think already its work with half flux res !
ERROR lora diffusion_model.img_in.weight shape '[3072, 64]' is invalid for input of size 1179648. why this erros using Bernoulli checkpoint. Whats the issue here can you please tell
may this related to text encoders , firstly be sure u use gguf model loader latest comfyui update , or try change text encoders !



