Folder Image Captioner with Qwen-VL WF - CivArchive (CivitAI Archive)

Folder Image Captioner with Qwen-VL

This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.

It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).

Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.

Features:

Batch processing directly from folder
Saves image + caption with the same name
High detail and accuracy thanks to Qwen-VL
Maintains the same pose, camera angle, lighting, and location from the original image

Required Custom Nodes:

ComfyUI Custom Nodes
Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)
Resize Image v2
Load Image Dataset from Folder
Save Image and Text Dataset to Folder

Created by: bobgus39 Original profile: https://civarchive.com/user/bobgus39

Usage: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.

Description

FAQ

Comments (1)

Details

Files

folderImageCaptioner_v10.zip

Mirrors