CivArchive
    ComfyUI Flash Head Workflow: Ultrafast Head Lip-Sync - v1.0
    NSFW
    Preview 123655659
    Preview 123655701

    Video Introdution:

     

     

     

    Click here to try workflow online:

    (Notice:Some nodes are biulding by Runninghub ,if you downloading and running offline , may not work!)

     

    Open Source Address: https://github.com/Soul-AILab/SoulX-FlashHead

     

    (Workflows can be downloaded via the links below—click the link and find the download button in the top right corner. Due to limited VRAM on my local machine, I haven't been able to test these myself. So if you're not familiar with running ComfyUI locally, it's best to run them online. The FlashHead node is built on RH.)

     

    Workflow: AA--Ultra-Fast Digital Human FlashHead

    Experience Link: https://www.runninghub.ai/post/2030340894288781313/?inviteCode=rh-v1401

     

    Workflow: AA--Emotion Control Digital Human - Ultra-Fast FlashHead + Index Voice Cloning (8 Emotion Controls)

    Experience Link: https://www.runninghub.ai/post/2030585203055398914/?inviteCode=rh-v1401

     

    Workflow: AA--Preset Voice Ultra-Fast Digital Human - FlashHead + QwenTTS - One Image, 9 Voices

    Experience Link: https://www.runninghub.ai/post/2030658043398070273/?inviteCode=rh-v1401

     

    Workflow: AA--Fully Automatic Ultra-Fast Digital Human - FlashHead + Qwen Sound Design - Auto-Prompt from One Image - Digital Human Card Pull!

    Experience Link: https://www.runninghub.ai/post/2030589859588476930/?inviteCode=rh-v1401

     

    ### Introduction to Flash Head Digital Human Workflows

     

    Flash Head is a digital human generation project running on ComfyUI, focused on speed. It achieves extreme video generation speed by only driving the head region for lip-sync, sacrificing dynamics in other parts of the body.

     

    #### Core Features:

     

    *   Ultimate Speed: At 512p resolution, generating a 5-second video takes only about 30 seconds.

    *   Two Models: Offers Pro and Light versions. The Light version is three times faster than Pro but compromises on quality, suitable for quick validation.

    *   Image Requirement: Must use a facial close-up image; otherwise, the model cannot recognize the head and lips.

     

    #### Main Workflows:

     

    The following workflows are introduced to meet different application scenarios:

     

    1.  Basic Workflow

        *   The simplest version, containing only 6 core nodes.

     

    2.  Voice Cloning Digital Human

        *   Allows you to upload an image and reference audio to clone the voice and drive the digital human.

     

    3.  Voice Preset Digital Human

        *   Similar to cloning, but uses pre-set voices within the workflow, eliminating the need for user uploads.

     

    4.  Sound Design Digital Human

        *   Fully Automatic Workflow: You only need to upload an image. The model analyzes the image via a VQA prompting node, automatically generates a voice prompt, and then a TTS model designs and generates the sound based on that prompt.

     

    #### Summary:

     

    Overall, the Flash Head series of workflows performs well in scenarios that demand ultimate speed (such as real-time interaction, rapid prototyping) and are "worth trying out." However, there is still a gap in generation quality and stability compared to more mature solutions like Infinite Talk, so currently, they are "not recommended for productivity."

    Description

    It need FlashHead !

    Workflows
    Qwen

    Details

    Downloads
    62
    Platform
    CivitAI
    Platform Status
    Available
    Created
    3/10/2026
    Updated
    5/27/2026
    Deleted
    -

    Files

    comfyuiFlashHeadWorkflow_v10.zip

    Mirrors