CivArchive
    Ovis-Image - Text Encoder
    Preview 112323013

    HF | GH
    Update your ComfyUI to the latest (Github version) => cd to ComfyUI directory -> terminal -> git pull -> restart ComfyUI

    Ovis-Image is this cool 7B text-to-image AI model from Alibaba's team. Basically, it takes your text prompts and whips up images, but it's super good at nailing text inside those images, like making sure words are clear, spelled right, and styled in different fonts without looking messy.

    It's awesome for stuff like designing posters, logos, app mockups, or infographics where text needs to pop and be readable, even in long chunks or weird aspect ratios. Handles English and Chinese like a champ, beating bigger models on benchmarks for accuracy and clarity.

    Plus, it's efficient, runs on just one beefy GPU with low lag, so great for real-world apps without needing monster hardware. It's solid for general pics too, but text rendering is its killer feature.

    These models are redistributed here for the sake of convenience.

    Description

    Ovis is not listed in the Base Model list, it will be updated as soon as it is added

    Checkpoint
    Other

    Details

    Downloads
    29
    Platform
    CivitAI
    Platform Status
    Available
    Created
    12/7/2025
    Updated
    12/10/2025
    Deleted
    -

    Files

    ovisImage_textEncoder.safetensors

    Mirrors