✨ TOOL — Image to text — Simple Workflow
A clean, all-in-one Image to text workflow built entirely with the UmeAiRT Toolkit for ComfyUI.
Only 3 nodes. No spaghetti wires. Just load your model, write your prompt, and hit generate.
⚠️ IMPORTANT — Nodes 2.0 Required
This workflow is built for the Nodes 2.0 (Vue) interface of ComfyUI. If you don't enable it, the workflow may have display problems.
How to activate Nodes 2.0:
Open ComfyUI
Go to Settings (⚙️ icon, bottom-left)
Find "Use Nodes V2 (Vue)" and toggle it ON
Refresh the page
Load the workflow
If you prefer the classic interface, check out my Legacy version of this workflow instead (link).
🎯 Features
Image-to-text generation
Automatic download of models
📦 Custom Node Required
Only one custom node to install:
Install via ComfyUI Manager (search "UmeAiRT") or use the UmeAiRT Auto-Installer.
The Toolkit packages everything internally — upscaler, face detailer, metadata saver. No other custom nodes needed.
Description
Base version
FAQ
Comments (3)
Hi, Thank you for this. Its interesting to try on older images. Great tool!
Thank you very useful. Is it possible to create a video to txt version of this, for example for a 5 second video?

