LTX-2.3 DEV/DIST - IMAGE to Video and TEXT to Video with Ollama/RTX VSR/LTX-Director

LTX-2.3 DEV/DIST - IMAGE to Video and TEXT to Video with Ollama/RTX VSR/LTX-Director - v1.5 LTX-2

NSFW

V2.6 LTX-2.3 DEV & Distilled Video with Audio + LTX Director workflow

Minor update to LTX2.3 Image to Video with Ollama workflow:

sampling Preview implemented
NAG to allow negative prompt for Distilled model (CFG=1)

Included a Workflow supporting LTX Director node, which is awesome, it supports:

First, Mid, Last or whatever frame, basically any frame is a keyframe
Prompt Relay which allows to generate frames prior to Input Image
more precise editing: What shall happen when in the clip?
Audio Import to create lipsynced clips
Image or Text to Video, even both in one process

more Info: https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI

recommend to watch the creator´s youtube video in above link for how to.

=> added a longer example video at the end.

V2.5 LTX-2.3 DEV & Distilled Video with Audio

Image to Video and a Text to Video workflow, both can use own Prompts or Ollama generated/enhanced prompts.

works with latest LTX 2.3 Distilled model (8 steps, CFG=1) or Dev model (20 steps, CFG=3)
Updated the processing for DISTILLED and DEV model, select the DIST or DEV model in loader node and switch to dedicated DIST or DEV processing pipeline, so each model has its own processing.
- DIST model pipeline: Standard Guider and Basic Scheduler, follows the manual sigmas issued by Lightricks
- DEV model pipeline: MultiModal Guider and LTX Scheduler + Distilled Lora on latent upscaler
Included a workflow version with "RTX Video Super Resolution" node, which upscales videos in highspeed.

Tip: With latest Comfy and LTX updates, the processing got faster for me, so I can increase the scale_by in sampler node from 0.5 to 0.6 or higher to have crisper videos with minor impact on render time.

V2.3 LTX-2.3 DEV & Distilled Video with Audio

Downloads for LTX 2.3:

update : April 14th 2026 : Lightricks has updated their LTX 2.3 distilled model to 1.1 (and Lora):

Model (1.1 fp8 _scaled by Kijai): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/diffusion_models
dist. Lora 1.1 : https://huggingface.co/Lightricks/LTX-2.3/tree/main

LTX-2.3 Distilled & Dev Models (fp8_scaled): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/diffusion_models
Textencoder1: (fp8_e4m3fn, same as LTX-2): https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main
Textencoder2: (projection_bf16): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/text_encoders
Video & Audio Vae: https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
Preview VAE (taeltx2_3): https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
Loras:
- Spartial upscaler (x2-1.1): https://huggingface.co/Lightricks/LTX-2.3/tree/main
- Distilled Lora for upscaler (lora.384): https://huggingface.co/Lightricks/LTX-2.3/tree/main
  - Smaller, alternative Desitilled Lora by Kijai: https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/loras
- Detailer Lora (same as LTX-2): https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/tree/main
Ollama Model (prompt only, fast): https://ollama.com/mirage335/Llama-3-NeuralDaredevil-8B-abliterated-virtuoso
- alternative model with Vision (reads input image+prompt, slower): https://ollama.com/huihui_ai/qwen3-vl-abliterated
- other model with Vision (great for I2V): https://ollama.com/huihui_ai/qwen3.5-abliterated

smaller LTX 2.3 GGUF Dev or Dist. models work as well. (replace Checkpoint loader node with Unet loader node from this custom node: https://github.com/city96/ComfyUI-GGUF ):

models: https://huggingface.co/unsloth/LTX-2.3-GGUF/tree/main
save to models/unet/

V1.5 LTX-2 DEV Video with Audio including latest 🅛🅣🅧 Multimodal Guider

Image to Video and a Text to Video workflow, both can use own Prompts or Ollama generated/enhanced prompts.

Replaced the Guider node with latest Multimodal Guider node, see more details in WF notes or here: https://ltx.io/model/model-blog/ltx-2-better-control-for-real-workflows Before we had 1 CFG parameter for audio and video. With multimodal guider, we now can tweak audio and video seperately with even more parameters...

added a Power Lora Loader node to inject further Loras
use Image to Video Adapter Lora to improve motion for I2V: https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/tree/main
replaced a node to no longer require comfymath custom nodes

V1.0 LTX-2 DEV Video with Audio:

Image to Video and a Text to Video workflow with own Prompts or Ollama generated/enhanced prompts.

setup for the LTX2 Dev model.
uses Detailer Lora for better quality and LTX tiled VAE to avoid OOM and visual grids
2 pass rendering (motion+upscale). Upscale process uses distilled and spatial upscale Lora
setup with latest LTXVNormalizingSampler to increase video & audio quality.
Text to Video can use dynamic prompts with wildcards.

Download LTX-2 Files: (Workflow V1.0 and V1.5 only)

Find Model/Lora Loader nodes within Sampler Subgraph node.

- LTX2 Dev Model (dev_Fp8): https://huggingface.co/Lightricks/LTX-2/tree/main

- Detailer Lora: https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Detailer/tree/main

- Distilled (lora-384) & Spatial upscaler Lora: https://huggingface.co/Lightricks/LTX-2/tree/main

- VAE (already included in above dev_FP8 model, but needed if you go for GGUF models): https://huggingface.co/Lightricks/LTX-2/tree/main/vae

- Textencoder (fp8_e4m3fn): https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn/tree/main

- Image to Video Adapter Lora (more motion with I2V): https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/tree/main

Save Location:

📂 ComfyUI/
├── 📂 models/
│ ├── 📂 checkpoints/
│ │ ├── ltx-2-19b-dev-fp8.safetensors
│ ├── 📂 text_encoders/
│ │ └── gemma_3_12B_it_fp8_e4m3fn.safetensors
│ ├── 📂 loras/
│ │ ├── ltx-2-19b-distilled-lora-384.safetensors
│ └── 📂 latent_upscale_models/
│ └── ltx-2-spatial-upscaler-x2-1.0.safetensors
│ └── 📂 Clip/
│ └── ltx-2.3_text_projection_bf16.safetensors

Custom Nodes used:

Ollama help:

Install Ollama from https://ollama.com/
download a model: Go to a model page, chose a model , then hit the copy button, i.e. https://ollama.com/huihui_ai/qwen3-vl-abliterated
open terminal and paste the model name, i.e.: ollama run huihui_ai/qwen3-vl-abliterated
model will be downloaded and can be selected in green comfy node "Ollama Connectivity". Hit "Reconnect" to refresh.

Example longer Video

Description

LTX2 DEV Image to Video and Text to Video

(including Multimodal guider and LTXV Normalizing Sampler)

FAQ

Comments (18)

markhassain3712Jan 30, 2026· 1 reaction

CivitAI

plz add custom audio input workflow also 😢

tremolo28

Author

Jan 30, 2026· 2 reactions

Hi, will check, but cant promise soon...

LONGERSTARJan 30, 2026· 1 reaction

CivitAI

Haven't tried LTX 2 because trying new video models are always such a hassle. Wonder if it's any better than Wan 2. Regardless, thank you for the work! :)

tremolo28

Author

Jan 30, 2026· 2 reactions

Hey, I would not say it is better or worse than Wan, depends on the use case. End of the day both are just fun tools yet. I like LTX for longer consistent clips, T2V and audio, tho its kind of bad in many cases and for memes. Wan is still great and better in other use cases.

Thank you by the way for the buzz :)

firecrocsFeb 3, 2026· 2 reactions

Took me less time to get ltx-2 up and running than wan. The video2video is kinda amazing, take 2-4 seconds of your favorite movie and extend it by 20+ seconds, convincingly if you get the settings right

iimacgyverii221Jan 30, 2026· 2 reactions

CivitAI

love this workflow! thanks for adding the power lora loader!

plepkitty721Jan 31, 2026· 1 reaction

CivitAI

This workflow is amazing. I do have one question, I see in the DynPrompt to Ollama there is a path D:\SD_webui_Forge_latest\stable-diffusion-webui-forge\extensions\sd-dynamic-prompts\wildcards. I found and did a git clone on sd-dynamic-prompts but the wildcards folder does not exist?

tremolo28

Author

Jan 31, 2026· 1 reaction

Hi, that is the location on my local pc. You can create a wildcard folder anywhere and place the path in the node.

plepkitty721Jan 31, 2026· 1 reaction

@tremolo28 cool! so what should go in the wildcards folder?

tremolo28

Author

Jan 31, 2026· 1 reaction

@plepkitty721 Text files(.txt) that contain wildcards, you then can trigger in you prompt. The repo supposed to give more info on how to create and trigger those wildcards.

https://github.com/KoinnAI/ComfyUI-DynPromptSimplified

You can search civitAi for wildcards, here an example: https://civitai.com/models/242487/wildcards-face-expressions

KoinnAIFeb 8, 2026· 1 reaction

you can set your wildcards folder to be any location in the node, here is a better article explaining how the node works https://civitai.com/articles/25924

CyberAImaniaFeb 2, 2026· 1 reaction

CivitAI

Hey. WHy i have error LTXVAudioVAELoader

Model in folder 'checkpoints' with filename 'C' not found. and cant choose my audio vae ? is grey color.

https://ibb.co/xqd5cLMf

tremolo28

Author

Feb 2, 2026· 1 reaction

Hi , reload browser to fix the C. Vae is taken from the model, as it is included in the fp8 model

CyberAImaniaFeb 2, 2026· 1 reaction

@tremolo28 I did it this way: I selected the model, but it doesn’t help at all. I ran UPDATE ALL. Nothing helps. I don’t know what’s going on.

https://ibb.co/Ld9Gt0FN

https://ibb.co/9HWfPGP9

tremolo28

Author

Feb 2, 2026· 1 reaction

@CyberAImania looks alright. check if LTXV Empty Latent Audio node in Latent group is connected with audio_vae, frame_number and frame_rate.

bonyponyFeb 4, 2026· 1 reaction

CivitAI

Great flow build but holy shit. I have a 5090 and this uses 30 more GB of RAM than the 5090 has. Are you running this on an h100? or am i fucking up

bonyponyFeb 4, 2026· 2 reactions

nevermind, i see the instruct to reserve vram (something i never do because it usually doesn't need to be done). it does work. excellent results.

DasbestosMar 12, 2026

@bonypony Is it this instruction that worked for you?:
--lowvram --cache-none --reserve-vram 6 --preview-method none

Workflows