V2.6 LTX-2.3 DEV & Distilled Video with Audio + LTX Director workflow
Minor update to LTX2.3 Image to Video with Ollama workflow:
sampling Preview implemented
NAG to allow negative prompt for Distilled model (CFG=1)
Included a Workflow supporting LTX Director node, which is awesome, it supports:
First, Mid, Last or whatever frame, basically any frame is a keyframe
Prompt Relay which allows to generate frames prior to Input Image
more precise editing: What shall happen when in the clip?
Audio Import to create lipsynced clips
Image or Text to Video, even both in one process
more Info: https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI
recommend to watch the creatorΒ΄s youtube video in above link for how to.
=> added a longer example video at the end.
V2.5 LTX-2.3 DEV & Distilled Video with Audio
Image to Video and a Text to Video workflow, both can use own Prompts or Ollama generated/enhanced prompts.
works with latest LTX 2.3 Distilled model (8 steps, CFG=1) or Dev model (20 steps, CFG=3)
Updated the processing for DISTILLED and DEV model, select the DIST or DEV model in loader node and switch to dedicated DIST or DEV processing pipeline, so each model has its own processing.
DIST model pipeline: Standard Guider and Basic Scheduler, follows the manual sigmas issued by Lightricks
DEV model pipeline: MultiModal Guider and LTX Scheduler + Distilled Lora on latent upscaler
Included a workflow version with "RTX Video Super Resolution" node, which upscales videos in highspeed.
Tip: With latest Comfy and LTX updates, the processing got faster for me, so I can increase the scale_by in sampler node from 0.5 to 0.6 or higher to have crisper videos with minor impact on render time.
V2.3 LTX-2.3 DEV & Distilled Video with Audio
Downloads for LTX 2.3:
update : April 14th 2026 : Lightricks has updated their LTX 2.3 distilled model to 1.1 (and Lora):
Model (1.1 fp8 _scaled by Kijai): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/diffusion_models
dist. Lora 1.1 : https://huggingface.co/Lightricks/LTX-2.3/tree/main
LTX-2.3 Distilled & Dev Models (fp8_scaled): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/diffusion_models
Textencoder1: (fp8_e4m3fn, same as LTX-2): https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main
Textencoder2: (projection_bf16): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/text_encoders
Video & Audio Vae: https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
Preview VAE (taeltx2_3): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
Loras:
Spartial upscaler (x2-1.1): https://huggingface.co/Lightricks/LTX-2.3/tree/main
Distilled Lora for upscaler (lora.384): https://huggingface.co/Lightricks/LTX-2.3/tree/main
Smaller, alternative Desitilled Lora by Kijai: https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/loras
Detailer Lora (same as LTX-2): https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/tree/main
Ollama Model (prompt only, fast): https://ollama.com/mirage335/Llama-3-NeuralDaredevil-8B-abliterated-virtuoso
alternative model with Vision (reads input image+prompt, slower): https://ollama.com/huihui_ai/qwen3-vl-abliterated
other model with Vision (great for I2V): https://ollama.com/huihui_ai/qwen3.5-abliterated
smaller LTX 2.3 GGUF Dev or Dist. models work as well. (replace Checkpoint loader node with Unet loader node from this custom node: https://github.com/city96/ComfyUI-GGUF ):
models: https://huggingface.co/unsloth/LTX-2.3-GGUF/tree/main
save to models/unet/
V1.5 LTX-2 DEV Video with Audio including latest π π £π § Multimodal Guider
Image to Video and a Text to Video workflow, both can use own Prompts or Ollama generated/enhanced prompts.
Replaced the Guider node with latest Multimodal Guider node, see more details in WF notes or here: https://ltx.io/model/model-blog/ltx-2-better-control-for-real-workflows Before we had 1 CFG parameter for audio and video. With multimodal guider, we now can tweak audio and video seperately with even more parameters...
added a Power Lora Loader node to inject further Loras
use Image to Video Adapter Lora to improve motion for I2V: https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/tree/main
replaced a node to no longer require comfymath custom nodes
V1.0 LTX-2 DEV Video with Audio:
Image to Video and a Text to Video workflow with own Prompts or Ollama generated/enhanced prompts.
setup for the LTX2 Dev model.
uses Detailer Lora for better quality and LTX tiled VAE to avoid OOM and visual grids
2 pass rendering (motion+upscale). Upscale process uses distilled and spatial upscale Lora
setup with latest LTXVNormalizingSampler to increase video & audio quality.
Text to Video can use dynamic prompts with wildcards.
Download LTX-2 Files: (Workflow V1.0 and V1.5 only)
Find Model/Lora Loader nodes within Sampler Subgraph node.
- LTX2 Dev Model (dev_Fp8): https://huggingface.co/Lightricks/LTX-2/tree/main
- Detailer Lora: https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/tree/main
- Distilled (lora-384) & Spatial upscaler Lora: https://huggingface.co/Lightricks/LTX-2/tree/main
- VAE (already included in above dev_FP8 model, but needed if you go for GGUF models): https://huggingface.co/Lightricks/LTX-2/tree/main/vae
- Textencoder (fp8_e4m3fn): https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main
- Image to Video Adapter Lora (more motion with I2V): https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/tree/main
Save Location:
π ComfyUI/
βββ π models/
β βββ π checkpoints/
β β βββ ltx-2-19b-dev-fp8.safetensors
β βββ π text_encoders/
β β βββ gemma_3_12B_it_fp8_e4m3fn.safetensors
β βββ π loras/
β β βββ ltx-2-19b-distilled-lora-384.safetensors
β βββ π latent_upscale_models/
β βββ ltx-2-spatial-upscaler-x2-1.0.safetensors
β βββ π Clip/
β βββ ltx-2.3_text_projection_bf16.safetensors
Custom Nodes used:
https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUI (RTX VSR Version)
Text 2 Video only: https://github.com/KoinnAI/ComfyUI-DynPromptSimplified
LTX Director only: https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI
Ollama help:
Install Ollama from https://ollama.com/
download a model: Go to a model page, chose a model , then hit the copy button, i.e. https://ollama.com/huihui_ai/qwen3-vl-abliterated
open terminal and paste the model name, i.e.: ollama run huihui_ai/qwen3-vl-abliterated
model will be downloaded and can be selected in green comfy node "Ollama Connectivity". Hit "Reconnect" to refresh.
Example longer Video
Description
LTX2 DEV Image to Video and Text to Video
(including Multimodal guider and LTXV Normalizing Sampler)
FAQ
Comments (18)
plz add custom audio input workflow also π’
Hi, will check, but cant promise soon...
Haven't tried LTX 2 because trying new video models are always such a hassle. Wonder if it's any better than Wan 2. Regardless, thank you for the work! :)
Hey, I would not say it is better or worse than Wan, depends on the use case. End of the day both are just fun tools yet. I like LTX for longer consistent clips, T2V and audio, tho its kind of bad in many cases and for memes. Wan is still great and better in other use cases.
Thank you by the way for the buzz :)
Took me less time to get ltx-2 up and running than wan. The video2video is kinda amazing, take 2-4 seconds of your favorite movie and extend it by 20+ seconds, convincingly if you get the settings right
love this workflow! thanks for adding the power lora loader!
This workflow is amazing. I do have one question, I see in the DynPrompt to Ollama there is a path D:\SD_webui_Forge_latest\stable-diffusion-webui-forge\extensions\sd-dynamic-prompts\wildcards. I found and did a git clone on sd-dynamic-prompts but the wildcards folder does not exist?
Hi, that is the location on my local pc. You can create a wildcard folder anywhere and place the path in the node.
@tremolo28Β cool! so what should go in the wildcards folder?
@plepkitty721Β Text files(.txt) that contain wildcards, you then can trigger in you prompt. The repo supposed to give more info on how to create and trigger those wildcards.
https://github.com/KoinnAI/ComfyUI-DynPromptSimplified
You can search civitAi for wildcards, here an example: https://civitai.com/models/242487/wildcards-face-expressions
you can set your wildcards folder to be any location in the node, here is a better article explaining how the node works https://civitai.com/articles/25924
Hey. WHy i have error LTXVAudioVAELoader
Model in folder 'checkpoints' with filename 'C' not found. and cant choose my audio vae ? is grey color.
https://ibb.co/xqd5cLMf
Hi , reload browser to fix the C. Vae is taken from the model, as it is included in the fp8 model
@tremolo28 I did it this way: I selected the model, but it doesnβt help at all. I ran UPDATE ALL. Nothing helps. I donβt know whatβs going on.
https://ibb.co/Ld9Gt0FNΒ
https://ibb.co/9HWfPGP9
@CyberAImaniaΒ looks alright. check if LTXV Empty Latent Audio node in Latent group is connected with audio_vae, frame_number and frame_rate.
Great flow build but holy shit. I have a 5090 and this uses 30 more GB of RAM than the 5090 has. Are you running this on an h100? or am i fucking up
nevermind, i see the instruct to reserve vram (something i never do because it usually doesn't need to be done). it does work. excellent results.
@bonyponyΒ Is it this instruction that worked for you?:
--lowvram --cache-none --reserve-vram 6 --preview-method none
