CivArchive
    Text-to-Speech with Voice Clone in LTX 2.3 - v1.0
    Preview 125572341
    Preview 125572377

    Use LTX 2.3 as a Text-to-Speech (TTS) model.

    Note: As it was pointed out in the comments below, there is an outstanding bug in ComfyUI that may prevent you from using the new voice cloning node in this workflow. See the discussion on github for more information.

    This workflow is designed to generate audio only output from a prompt and a speech sample. The generated audio will clone the sample voice and apply it to the specified dialogue and prompt. If you want to make more than one video with a consistent character voice, then you can't rely on LTX's random voice assignment, so this workflow will give you a consistent voice that you can use to create whatever spoken script you want.

    Voice cloning is made possible with the ID-LoRA models and new ComfyUI node, "LTXV Reference Audio". In my experience, the video generated with this LoRA isn't very reliable or high-quality, so I had better results from creating an audio file and applying that pre-made audio track to a new LTX 2.3 video based on a starting image. Those results haven't been 100% perfect either, but the success rate was higher than any other method I tried.

    IMPORTANT: If you haven't updated your ComfyUI installation after about March 25, 2026, you will need to run an update to get the new ComfyUI native node for ID-LoRA.

    Please see the ID-LoRA github page for important guidance on prompt formatting and usage. The default values in this workflow have worked well for me, but all the nodes are clearly exposed and labeled so you can tweak and experiment to get your own favorite results.

    Github: https://github.com/ID-LoRA/ID-LoRA

    One more piece of advice for generating audio tracks with LTX 2.3: The length of the generated audio clip is very important, probably more important than any other setting in the workflow. If the time is too short, LTX will rush through some of the script with almost no pause in between sentences and the result doesn't sound as natural as LTX is capable of doing. If the time is too long, LTX will stretch out pauses and sometimes repeat sections of the script. My recommendation is to say your script out loud to yourself, in a normal conversation pace, and time yourself doing it. Use that time as the duration for your clip and then adjust it longer or shorter as needed.

    Finally, don't be afraid to generate multiple audio clips if you have a long script with several breaks or pauses and LTX can't seem to get the pauses right. It's much easier to combine audio files into one track than it is to combine video and there are lots of online tools to help with that. When you assemble your own final audio track, you can insert pauses as long or short as you want.

    Description

    FAQ

    Comments (19)

    gambikules858Mar 28, 2026
    CivitAI

    LTX is insane.

    darkroast175696
    Author
    Mar 28, 2026

    I agree, I've tried a couple other text to speech models and LTX produces by far the most natural sounding speech in its videos. I figured it was worth it just for the audio in this case, so the workflow just makes a tiny throwaway video and saves off the audio file.

    PTGMar 28, 2026· 1 reaction
    CivitAI

    OMG, i think this is the first time i can clone a voice with emotion for french speaker ! Thank your for the workflow !

    darkroast175696
    Author
    Mar 28, 2026

    You're welcome!

    InsidiousOneMar 29, 2026· 1 reaction
    CivitAI

    Apparently LTXV Reference Audio is broken in the latest ComfyUI 18.1, 18.2. Damm, I really wanted to try this. But I'm not about to go copying over files that work for everything else. Hope they fix it soon. https://github.com/Comfy-Org/ComfyUI/issues/13194

    darkroast175696
    Author
    Mar 29, 2026· 1 reaction

    Ugh, that's really annoying and unfortunate. I have an older installation of comfyui portable on windows and I've just been running the update_comfyui.bat update script from time to time. Running that script earlier this week gave me the updated comfy with the node that I needed for id-lora.

    I don't know exactly what went haywire with their distribution, but maybe you could try a fresh install of comfyui portable from a few versions ago in a different directly and then run the update script, and see what happens?

    Sorry, I hope it gets fixed soon, too. Partly because now I'm afraid to run another update until it's solid.

    darkroast175696
    Author
    Mar 29, 2026· 1 reaction

    I checked my comfyui installation and I'm running version 0.18.1 according to "comfyui_version.py". That's the version I used to build and test this workspace in the first place, so maybe it would work for you?

    InsidiousOneMar 29, 2026

    @darkroast175696 I'm running 18.2. For me, it's usually a 50/50 chance that I break something trying to fix something else. Since I'm not the only one experiencing it, I'm willing to wait a little bit to see if it gets fixed on their end. I thank you for your effort all the same.

    lithiumhireMar 30, 2026· 1 reaction

    According to their GH, Comfy is standing down on all feature work in April and focusing solely on bugfixes and stability, which they clearly need.

    undertheblacksunApr 6, 2026

    Did this get fixed yet?

    darkroast175696
    Author
    Apr 6, 2026· 2 reactions

    @undertheblacksun The github issue thread I was following doesn't have any "we fixed it" updates yet, so I'm assuming that it's not fixed. I haven't tried installing a new comfyui just to find out. My installation is working right now, so I'm afraid to mess with it.

    darkroast175696
    Author
    Apr 12, 2026· 1 reaction

    @archimo2021app928 In the github thread from InsidiousOne's original comment, there are new posts indicating the nightly branch of comfyui has introduced a fix for this issue. Even if you aren't able to switch to the nightly branch and get the update, this at least means the fix is in the pipeline and should be part of a release update soon.

    darkroast175696
    Author
    Apr 15, 2026

    The github thread has a new message indicating the bug may now be fixed, in case anyone wants to give it a try. You'll need to update your comfyui to the newest version to test it out.
    https://github.com/Comfy-Org/ComfyUI/issues/13194#issuecomment-4249039663

    learnrijoMar 29, 2026
    CivitAI

    hey how to download the lora from github ? any direct link ? Sorry, new to github and huggingface

    corraline101642Mar 29, 2026

    Same Question ;)

    learnrijoMar 29, 2026· 1 reaction
    darkroast175696
    Author
    Mar 29, 2026

    @learnrijo There are links to the ID-LoRA files in the workflow. If you look at the "Load Models" area, there is a note above that section with links to all the important files.

    learnrijoMar 29, 2026

    @darkroast175696 yes thanks, the links to huggingface page, but not a simple download button I can find on that page.

    Workflows
    LTXV 2.3

    Details

    Downloads
    445
    Platform
    CivitAI
    Platform Status
    Available
    Created
    3/27/2026
    Updated
    5/13/2026
    Deleted
    -

    Files

    textToSpeechWithVoice_v10.zip

    Mirrors

    HuggingFace (1 mirrors)