LTX-2.3 All-In-One workflow for RTX 3060 with 12 GB VRAM + 32 GB RAM

LTX-2.3 All-In-One workflow for RTX 3060 with 12 GB VRAM + 32 GB RAM - v4.0

NSFW

[edit:

13.05.2026: Update version 4.4 (see version description).

Small fixes to get back fast generations.

Attention:

If you struggle with node conflicts or you get errors while running the workflow, please have a look at my short Trouble Shooting Guide note in the wokflow first. Most importent is to update all components sucsessfully! ]

Special thanks to:

@ArcleinSK for investigation and solving the FLF issue, as well as forcing the First-Mid-Last Frame option and last but not least for charing fantastic knowlage.

@boinobin730 for initialising, forcing and supporting this project in all kinds of matter, like providing links, running tests, sharing knowlage and inspiring diskussions.

@Urabewe for publishing the original, perfectly running 12 GB VRAM LTX-2.3 workflows mainly used here in this workflow.

Features:

Simple to use all-In-One LTX-2 workflow with options for:

Text to Video
Image to Video
First/Last Frame to Video
Fisrt/Mid/Last Frame to Video
Video to Video
Text + Audio to Video
Image + Audio to Video
First/Last Frame + Audio to Video
First/Mid/Last Frame + Audio to Video

easy switching between all options,
all steps highly automated: no manual frame or width/hight calculations necessary,
easy to set inputs by predefined sliders and aspeckt ratio inputs (no risk to set wrong frame counts or wrong width/hight values),
completely automated resizing and cropping (if necessary) of your input images/videos.
brilliant audio generation (speech/sound) with LTX-2.3.

LTX-2.3 specifications:

Workflow version v4.3 consistently follows the LTX-2.3 specifications for 16:9/9:16 aspect ratios, including automatic width/hight calculations, as well as automatic input image/video resizing/cropping.

In addition you can simply choose now any other aspect ratios according to your needs while still getting the right values calculated for width/hight and automatic image/video resize/crop.

Requirements:

GPU with 12 GB VRAM (some users reported they got it running with 8 GB too),
32 GB VRAM,
Swap file size: 64 - 128 GB.

Speed and video length:

Runs very fast: 5 second (1280 x 864) Video: < 10 minutes.

Generation of long high quality videos in one run possible: 10 - 20 seconds without any issues,

Testrun: 30 second video (1024 x 704) tooks around 40 minutes without any OOM errors. Longer videos might be possible, but not tested yet.

Important:

This workflow is intended for advanced comfyui users who know how to install and operate the system and are able to resolve basic system errors themselves, like as node conflicts, or general system issues.

About this workflow:

This workflow is mainly based on the fantastic LTX-2.3 workflows of @Urabewe.

As far as I know, those were the first workflows running LTX-2 with 12 GB VRAM. All credits goes to the original creator.

My job was only to combine and organise the different workflows in a simple to use all-in-one design.

Description

Added First-Mid-Last Frame to Video option.
Fixed recent Fisrt/Last Frame issues.
Completely re-designed input "interface" to easy choose the best LTX-2.3 aspect ratios or other standard aspect ratios as well as any custom aspect ratios or automatic aspect ratio calculations according to your input images or videos.
Completely re-designed Widh/Hight calculation under the hood. All values are strictly devisible by 32 for any aspect ratios now.

I did a lot of pre-tests. All options should work as intended.

As allways: please let me know if you find any "bugs" - and of course, let me know if you have any ideas to improve this workflow.

FAQ

Comments (164)

ArcleinSKApr 7, 2026· 3 reactions

CivitAI

@arkinson 4.0 Workflow looks clean. A couple of things I noticed though:
1) Inside the aspect ratio subgraph, you may want to promote the "aspect_ratio" widget on the LayerUtility node. It's fixed to 16:9 and will crop anything to fit that ratio. Promoting it will allow the user to select the aspect ratio they prefer on the outside of the subgraph in the main workflow page.

2) Having the second AddGuideMulti for the [[P:Mid Frame input]] seems to get ignored in the processing subgraph. It will just use the AddGuideMulti for the [[P: Last Frame input]] which ignores the middle frame input. I think it would need some sort of conditional mute to disable the [[P:Last Frame input]] only if the Last AND middle options are enabled.

I don't know if there's a lot of use for a middle frame in the community right now and it does tend to increase generation times and RAM requirements the more keyframes are added. I've tried up to 4 so far and my generation times have gone up from 175s/it up to 700+s/it on the second pass with 4 keyframes.

Might be worth considering disabling that middle frame option for now until there's a better way to implement it. But it definitely seems to be fixed in terms of that weird image flip at the end and the addition of an adjustable aspect ratio seems to address most people's concerns with the last one.

ArcleinSKApr 7, 2026

Just browsing some documentation it seems the orchestrator node appears to have some settings:
ComfyUI_Custom_Switch/README.md at main · tritant/ComfyUI_Custom_Switch · GitHub

Might need to restructure the groups to take advantage of it, but you can enable "Exclusive" node mode in the muter which only allows one toggle at a time to be active.
Looks possible to create a new switch group with two options. One for "first and last" and one for "first last and middle" That way if one is enabled, the other will be muted accordingly.

13.05.2026: Update version 4.4 (see version description).

Attention:

Features:

LTX-2.3 specifications:

Requirements:

Speed and video length:

Important:

About this workflow:

Description

FAQ

What is LTX-2.3 All-In-One workflow for RTX 3060 with 12 GB VRAM + 32 GB RAM?

What files are available and where can I download them?

Comments (164)

Details

Files

ltx23AllInOneWorkflowForRTX_v40.zip

Mirrors