This workflow takes an Image and an audio track as input to generate a video.
Important Notice
Update ComfyUI, KJ Nodes and ComfyUI-GGUF. A lot of the code has been updated in the last few days.
V2 update
Changed to use the native comfyui loaders. The KJ loaders seem to be giving noise for some generations. We are using the official LTX-2 release for the VAE and Kijai's release for diffusion model GGUF. Changed to allow loading of an audio file for input.
Models to download
Place in models/diffusion_models
https://huggingface.co/Lightricks/LTX-2/resolve/main/ltx-2-19b-dev-fp8.safetensors
Place in models/vae
Place in models/text_encoders
(not needed in v2 of the workflow) https://huggingface.co/Kijai/LTXV2_comfy/resolve/main/text_encoders/ltx-2-19b-embeddings_connector_distill_bf16.safetensors?download=true
Place in models/loras
Description
Initial Release