LTX-2 Image Audio to Video - CivArchive (CivitAI Archive)

I'm was getting this error at the SamplerCustomAdvanced node.

The size of tensor a (63240) must match the size of tensor b (8126848) at non-singleton dimension 2

Remove "comfyui_smznodes" from custom nodes to fix it.

HorridWitchMar 6, 2026

CivitAI

will this run with 4090 , 32 ram?

PixelMuseAI

Author

Mar 6, 2026· 2 reactions

it should run. you can try with a lower frame count, like a 1 sec audio sample. then watch your ram usage on task manager.

for reference, i am on a 4060Ti with 64GB DDR4 and it took 12mins to generate a 7sec video @ 24fps, 1920 x 1088 resolution.

Rokit8Mar 6, 2026· 1 reaction

CivitAI

Works great thank you! Averaging 560 seconds per 20 second vid on 3090/64.

luigibarb173Mar 10, 2026

CivitAI

"I add the image and the audio, but when I generate the video there is no lip sync. The video is generated and the voice plays in the background, but the character is not speaking."

PixelMuseAI

Author

Mar 13, 2026

try changing the audio file to stereo, as suggested by thomasdimitri563

this happened for me when the voice starts right at the start of the clip. try to give about 0.2s of silence before the speech.

does your audio file have a lot of background noise? if yes, you can try to isolate the voice by using https://github.com/kijai/ComfyUI-MelBandRoFormer

NovellusMar 11, 2026

CivitAI

There is nothing in the output video. It's just black. I can however hear the audio. I have all the correct files downloaded and I'm running this on a 4090.

PixelMuseAI

Author

Mar 13, 2026· 1 reaction

please ensure that comfyui and your custom nodes are updated.

thomasdimitri563Mar 12, 2026· 2 reactions

CivitAI

I also had no lip sync and I fixed it by changing my audio file from mono to stereo (2 channel). I used ffmpeg to make change my mono audio into stereo audio.

zexeorMar 20, 2026

CivitAI

I can't seem to make the lipsync work. I even changed the audio to stereo, prompted the exact words the character should say, changed the video length to match the audio length, etc. Any help, please?

zexeorMar 20, 2026

Managed to solve it. Elevenlabs audio comes too clean, for whatever reason, adding some background noise makes lip sync work.

PixelMuseAI

Author

Mar 21, 2026

@zexeor thanks for your suggestions. I know people are having trouble with getting the lip sync to work. But no one is telling me what their source sound files are. I've been using audio from videos so I haven't experienced the issues users are experiencing. Let me test with some local TTS.

Ponder_StibbonsMar 28, 2026

CivitAI

Beautiful, right out of the box. Hardly had to change a thing. Well done.

Ponder_StibbonsMar 29, 2026

CivitAI

This is blazing fast. And it made me realize I don't need an upscale stage with LTX. 10 seconds of 24fps 720 is nothing, absolutely nothing...done in 1:30 and with resources to spare. Paired with TTS suite and/or Ace-Step, possibilities are endless. I really need to finish a comp to post. I keep getting distracted discovering everything this model can do.

scotttybreadApr 12, 2026· 1 reaction

CivitAI

yes lip sync is working fine. Just don't try to upload mp3, wav is good for example 40k Hz. Yes mono not working properly, try stereo. Vertical video seems fine. Using on rental 5090, my 5070ti would die. But thanks for sharing it, amazing job

Workflows

LTXV2

by PixelMuseAI

Download (Beta) View on CivitAI

tool

ltxv-2

Details

Downloads

2,808

Platform

CivitAI

Platform Status

Available

Created

3/6/2026

Updated

6/24/2026

Deleted

Files

ltx2ImageAudioTo_ltx23.zip

Size:

5.19 KB

SHA256:

8b58507d1d9a90ca0d6a34cd91678c549f590b77102395974a8bf2e101293e65

Mirrors

HuggingFace (1 mirrors)

ltx2ImageAudioTo_ltx23.zip

CivitAI (1 mirrors)

ltx2ImageAudioTo_ltx23.zip

Description

FAQ

What is LTX-2 Image Audio to Video?

What files are available and where can I download them?

Comments (15)

Details

Files

ltx2ImageAudioTo_ltx23.zip

Mirrors