HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
✨ Key Features
HuMo is a unified, human-centric video generation framework designed to produce high-quality, fine-grained, and controllable human videos from multimodal inputs—including text, images, and audio. It supports strong text prompt following, consistent subject preservation, synchronized audio-driven motion.
VideoGen from Text-Image - Customize character appearance, clothing, makeup, props, and scenes using text prompts combined with reference images.
VideoGen from Text-Audio - Generate audio-synchronized videos solely from text and audio inputs, removing the need for image references and enabling greater creative freedom.
VideoGen from Text-Image-Audio - Achieve the higher level of customization and control by combining text, image, and audio guidance.
Examples and models from the following sources reuploaded for your convenience here:
https://huggingface.co/bytedance-research/HuMo
https://github.com/Phantom-video/HuMo
Compatible with both 480P and 720P resolutions. 720P inference will achieve much better quality.
Description
FAQ
Comments (6)
Great. Workflow?
Did you find any workflow?
Pretty cool! Any workflow for your samples?
Did you find any workflow?
@honryindian Here are two working processes. Only it is very demanding on video memory. My video card 3080 10 GB starts these processes, but you need to wait for the end of generation for an eternity. https://civitai.com/models/1957082/humo-dual-image-reference-digital-human?modelVersionId=2215110 , and https://civitai.com/models/1957156/humo-single-person-reference-digital-human?modelVersionId=2215207
Video Quality is Great; but, lip-sync is very Bad.
Details
Files
humoForWan_humo14BFp16.safetensors
Mirrors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Wan2_1-HuMo-14B_fp16.safetensors
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.