Director v1.0 - I have introduced the "Ltx Director v2.0" node and removed nodes with redundant functionality. I also updated the input area to allow adjustment of the Basic Scheduler step count and the target resolution following the final upscaling, and Distilled LoRA weights. Additionally, I added nodes for configuring Ic LoRA settings.
FMLF v1.2 - The model area configuration has been updated. You can now easily adjust the Basic Scheduler step value, the target resolution after upscaling, and Distilled LoRA weights directly within the input area.
FMLF v1.1 - Moved the settings for frame rate and length (seconds) loser to the Prompt-Relay node to improve usability.Made it easy to apply frame rate changes to the Ltx Sequencer.Consolidated the Power LoRA Loader nodes into a single node, as the downsides outweighed the benefits.Made it easier to adjust the volume of the generated audio.Switched the VAE Decode in the upscaling path to the Tiled version.
FMLF V1.0 - This workflow incorporates two nodes by WhatDreamCost, "LTX Sequencer" and "Multi Image Loader," as well as Kijai's "Prompt-Relay" node.
Please refer to WhatDreamsCost's GitHub for information on how to use the "Ltx Director v2.0" node.
【WhatDreamsCost-ComfyUI】
https://github.com/WhatDreamsCost/WhatDreamsCost-ComfyUI
【Prompt-Relay】
https://github.com/kijai/ComfyUI-PromptRelay

This workflow is not suitable for beginners.
"Ltx Director," provided by WhatDreamCost, allows you to place multiple "anchor images" at specific points along the video timeline and assign corresponding prompts to each. This capability makes it easier to control the video's content by applying the necessary images and prompts exactly when desired. Furthermore, when featuring the same character, using their image as an anchor helps maintain facial consistency.
However, the key difference from standard workflows is the requirement to prepare multiple images in advance; consequently, this workflow may not be suitable for beginners. Personally, I primarily use "Qwen-Image-Edit 2511" to edit image content and orientation, while using image processing software to handle cropping and the preparation of multiple images. Although it requires some effort, once the necessary images are ready, you can create videos that faithfully reflect your creative vision.
Additionally, the "FMLF" version is a workflow that isolates the "First," "Middle," and "Last Frame" functions of Ltx Director into three separate nodes. If you prefer a more streamlined feature set to achieve greater flexibility in video control, you might find this version easier to work with.
I only recently began using this workflow in earnest, but I decided to share it now in the hope that it contributes to the further expansion of the local generation environment for Ltx2.3.
●The safetensors model required for this workflow:
【Text Encoder】
ComfyUI/models/text_encoders
gemma_3_12B_it_fp8_scaled.safetensors (Some people also use fp4.)
https://huggingface.co/Comfy-Org/ltx-2/tree/main/split_files/text_encoders
and
ltx_text_projection_bf16.safetensors
https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/text_encoders
【VAE】
ComfyUI/models/vae
LTX23_audio_vae_bf16.safetensors
LTX23_video_vae_bf16.safetensors
(If using checkpoints that include VAE, this is unnecessary.)
taeltx2_3.safetensors
https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
【Upscaler】(Be sure to select "1.1". "1.0" has a bug.)
Ltx-2.3-spatial-upscaler-x2-1.1.safetensors
https://huggingface.co/Lightricks/LTX-2.3/tree/main
●Recommended safetensors models for this workflow:
【Checkpoint】
ComfyUI/models/diffusion models
ltx2310eros_v12.safetensors
https://civarchive.com/models/2447875/ltx23-10eros?modelVersionId=3043807
or
ComfyUI/models/checkpoint
10Eros_v1.2-fp8mixed_learned.safetensors (include VAE)
https://huggingface.co/TenStrip/LTX2.3-10Eros/tree/main
【Distilled Lora】
Ltx-2.3-22b-distilled-lora-1.1_fro90_ceil72_condsafe.safetensors
https://huggingface.co/TenStrip/LTX2.3_Distilled_Lora_1.1_Experiments/tree/main
【OmniNFT Lora】
LTX-2.3-OmniNFT-RL-Lora_bf16.safetensors
https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/loras
●As we have received many inquiries, I am sharing the models and workflows used in Qwen Image Edit 2511 for your reference.
【Recommended Checkpoint】
Qwen-Rapid-AIO-NSFW-v19.safetensors
https://huggingface.co/Phr00t/Qwen-Image-Edit-Rapid-AIO/tree/main/v19
【Recommended Workflow】
https://civarchive.com/models/2167203/nsfw-qwen-image-edit-rapid-aio?modelVersionId=2440501
【Multi-angle LoRA model】
(It is already included in 2511, but using LoRA is expected to further enhance the effect.)
qwen-image-edit-2511-multiple-angles-lora.safetensors
https://huggingface.co/fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA/tree/main
【Multi-angle Workflow】
image_qwen_image_edit_2511_multiangle_camera.json
https://github.com/jtydhr88/ComfyUI-qwenmultiangle/tree/master/workflow
● Points to Note:
・ Even when using fp8 models, the total size of the required files exceeds 40GB. Consequently, unless your system has at least 64GB of RAM, you are highly likely to encounter Out-of-Memory (OOM) errors. Since this workflow supports GGUF models, you might want to consider using them instead.
・ When inputting multiple images, you need to adjust the "Guide Strength" value at the bottom right of the node for each segment. Although this setting is small and easily overlooked, it is crucial; please take the time to specify it for every segment. Note that a lower Strength value for an applied image allows for more movement during the video, though this comes at the cost of consistency regarding the anchor image. Therefore, it is best to fine-tune this setting based on the specific video you wish to create.
・ Ltx2.3 Distilled LoRAs are notoriously difficult to work with—even more so than those for Wan2.2. While many models have been released, it is hard to determine which one is best. You will need to experiment with various combinations to find what works with your chosen base model; the same applies to weight settings.
・ Although IC-LoRA functionality is included, I have not used it extensively. I did run generations using the models "ltx23_edit_anything_global_rank128_v1_9000steps_adamw.safetensors" and "ltx-2.3-22b-ic-lora-union-control-ref0.5.safetensors" to verify operation; while I cannot speak to the output quality, I confirmed that the IC-LoRA function was indeed applied. Please understand that I cannot answer questions unrelated to IC-LoRA workflows. The tool is highly feature-rich, so please refer to the creator's GitHub page for full details.
・ The videos I have generated so far resemble those created with Wan; however, if you master "Ltx Director," you should be able to produce videos that are completely different from what you have created in the past. So, let’s break free from preconceptions and use our imaginations to create new videos—myself included!
・ Just to clarify, I am not an expert in AI generation. Everything I’ve shared here is the result of trial and error, based on actual testing and research into the vast amount of information available in the community. Consequently, I cannot answer highly technical questions; my knowledge is strictly limited to my own personal experience. If any issues arise, I would greatly appreciate it if you could handle the troubleshooting yourself.
Description
V1.0 FMLF2V - This workflow incorporates two nodes by WhatDreamCost, "LTX Sequencer" and "Multi Image Loader," as well as Kijai's "Prompt-Relay" node.
While it lacks centralized operation and requires somewhat complex steps, we believe it offers sufficient functionality for a first, middle, and last frame workflow.
FAQ
Comments (5)
Thanks for the workflow. But the upscale output is giving 9:16. I have changed the main width and height to 1080p. The first pass is good, but only in upscale this issue is happening.
Im having a slight problem with the workflow. The output creates a image, a video with no sound and a video with sound. The video with no sound for a clip of 14 secs outputs for 4 minutes and 40 secs and the size usually goes for 240 to 280 mb.
In this workflow, those three outputs are normal.