Folder Image Captioner with Qwen-VL
This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.
It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).
Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.
Features:
Batch processing directly from folder
Saves image + caption with the same name
High detail and accuracy thanks to Qwen-VL
Maintains the same pose, camera angle, lighting, and location from the original image
Required Custom Nodes:
ComfyUI Custom Nodes
Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)
Resize Image v2
Load Image Dataset from Folder
Save Image and Text Dataset to Folder
Created by: bobgus39 Original profile: https://civarchive.com/user/bobgus39
Usage: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.