ALL QUESTIONS WILL BE ANSWERED IN THE COMMENT :)
Cause there is a lot to say about how the model and workflows works...
I will try to answer and help you if you get some problems :)
(Upload Still in progress...)
Upload In Progress....
COMING SOON :
LTX+2 Q6_K.GGUFLTX2 Audio VAE BF16TEXT ENCODER GEMMA 3 12BLTX-2-1B-Embeddings_connector_distill_BF16Special GGUF EDITED WORFLOW IMG2IMG / TXT2IMG
LORA DISTILLED VERSIONSPATIAL UPSCALERTEMPORAL UPSCALERCAMERA CONTROL LORAS.CONTROLNET AIO LTX2
Workflows
I2V/ V2V /T2V/ VDETAILER.
(Keep in mind i'm not the owner of this model)
I made this upload from the official Public LTX Huggingface repo for Civitai.
https://huggingface.co/Lightricks
I just want to centralize the most useful models on this Civitai page,
so everyone doesn’t get lost among the many models available on the original repo.
I’m not asking for any “buzz” or anything like that, it’s just to help the community have a solid LTX base on Civitai (for people not familiar with Hugging Face).
I won’t upload big models, I leave that to people who actually want to use them, even if the benefits of doing so will be really minimal.
But I will also centralize here my own releases and base-model variations of unofficial LTX.
⚡ LTX-2 FP8 — Distilled (Fast & Lightweight)
What is LTX-2 FP8 Distilled?
The FP8 Distilled version is a compressed and accelerated variant of LTX-2, trained to replicate the behavior of the full model while being faster and lighter.
Distillation reduces model complexity, making it more efficient — at the cost of some fine-grained detail.
✅ Key Characteristics
Faster generation speed
Lower VRAM requirements
Quicker prompt response
Slightly reduced fine detail compared to full FP8
Excellent quality-to-performance ratio
🎯 Best Use Cases
Rapid iteration & testing
Prompt exploration
Draft videos and previews
Creators with limited hardware
Recommended if:
You want speed and accessibility, and are willing to trade a small amount of detail for faster results.
🔹 LTX-2 FP8 — Standard (Full Quality)
What is LTX-2 FP8 (Standard)?
The FP8 Standard version is a full-quality LTX-2 model quantized to FP8 precision.
It preserves the complete architecture and capabilities of the original model while reducing memory usage.
This is NOT a simplified model.
Only the numerical precision is reduced — the model’s intelligence, structure, and behavior remain intact.
✅ Key Characteristics
High visual fidelity and detail
Strong temporal consistency
Full audio-video synchronization
Lower VRAM usage than FP16
Stable and reliable for long generations
🎯 Best Use Cases
Cinematic video generation
Final renders and high-quality outputs
Creators who want maximum quality with lower hardware requirements
Recommended if:
You want the best possible quality in FP8, with no compromise on features or flexibility.
🧠 Which One Should You Choose?
🎬 Go with FP8 Standard if quality and consistency matter most
⚡ Go with FP8 Distilled if speed and efficiency are your priority
Both versions are fully compatible with ComfyUI workflows and part of the same LTX-2 creative ecosystem.
📌 What is LTX-2?
LTX-2 is a powerful multimodal AI model that transforms text prompts, images, or other media into fully synchronized audiovisual videos — with motion, dialogue, music, and ambient sound generated in one unified pass. It’s built on a hybrid Diffusion-Transformer (DiT) architecture designed specifically for efficient spatiotemporal generation and audio-video alignment. LTX-2+1
This approach lets creators go from idea to cinematic result without stitching separate audio tracks manually — a major step beyond typical text-to-video systems. LTX-2
✨ Key Features & Capabilities
🎥 Cinematic Quality Output
Native 4K resolution support with playback up to 50 FPS, delivering smooth, high-detail video clips ideal for cinematic, commercial, or creative use. LTX-2
🎵 Unified Audio & Visual Generation
Generates synchronized audio — including dialogue, ambience and music — alongside the video in a single generation pass, removing the need for external audio sync tools. LTX-2
🔄 Flexible Input & Output Modes
Works with text prompts, image references, multi-keyframe conditioning, and more to animate concepts or stills into motion. LTX-2
⚙️ Performance Modes
Multiple performance configurations (Fast, Pro, Ultra) allow creators to balance speed and quality according to project needs — from quick drafts to production-ready renders. LTX-2
🧠 Efficient & Accessible
Highly optimized for consumer-grade GPUs — efficient enough to run on ~16 GB VRAM hardware with FP8/FP4 quantization options — making AI video production more accessible. Reddit
🛠️ Open & Extensible
Fully open weights, codebase, and workflows, enabling fine-tuning, custom LoRAs, and integration into tools like ComfyUI. Hugging Face
📈 Improvements Over Earlier Versions
Compared to the original LTX family and other open video models, LTX-2 raises the bar in several key areas:
✅ Audio Integration Built-In
Instead of generating silent videos and requiring post-processing, LTX-2 outputs audio and visual streams together with temporal coherence. LTX-2
✅ Higher Resolution & Frame Rates
Supports native 4K at up to 50 frames per second, reaching cinema-grade quality unlike many earlier community models that cap at lower resolutions or fps. LTX-2
✅ Longer Clips
Offers extended duration generation (up to ~20 s clips) with continuous quality and audio coherence — exceeding many alternatives. LTX-2+1
✅ Expanded Workflows
Native support in ComfyUI plus custom workflows empowers users with text-to-video, image-to-video, multi-keyframe conditioning, and creative control nodes. comfyui.org+1
🧠 Typical Use Cases
🔹 Cinematic storyboarding & concept visuals
🔹 Social media & marketing video content
🔹 Animated storytelling & motion design
🔹 Game cutscenes & immersive narratives
🔹 Product visualizations & dynamic ads
Whether for rapid prototyping or production output, LTX-2 empowers creators with professional-grade generative video. LTX-2
🧩 Included Files & Variants
Depending on the checkpoint uploaded, this collection may include:
Full Model Checkpoints (bf16 / fp8 / fp4) — maximum quality with quantization options
Distilled Variants — faster iteration with lighter compute cost
Spatial & Temporal Upscalers — improve resolution or frame rate via multiscale pipelines
LoRA & Fine-Tuning Packs — custom stylistic or control extension modules Hugging Face
🔧 ComfyUI Integration & Workflows
Included workflow templates help you use LTX-2 in ComfyUI with nodes for:
📌 Text-to-Video — generate animated clips from prompts
📌 Image-to-Video — animate still images with camera motion and style
📌 Video Conditioning — extend clips forward/backward or refine motions
📌 Keyframe Controls — precise guidance over scene transitions
These workflows are designed for ease-of-use and creative flexibility while demonstrating best practices for prompt structure and smooth temporal motion. LTX Documentation
🧠 Foundation Model Philosophy
LTX-2 goes beyond a single task — it’s a foundation model for audiovisual creative AI. Open access to its weights, code, and tools encourages developers, artists, researchers, and hobbyists alike to customize, extend and innovate on a common platform. Hugging Face
📌 Summary
LTX-2 is not just another video model — it is a production-ready, synchronized audio-video foundation model that pushes the boundaries of what open discourse video generation can achieve. With cinematic output quality, flexible workflows, and a fully open ecosystem, LTX-2 stands as one of the most capable generative video tools available today. LTX-2
Description
This is the Uncensored version of the GEMMA 3 12B Text Encoder.
FAQ
Comments (7)
thx for upload, but why you rename every model?
this will add new problems with versions...
With the LTX2 model I've had the most promblems with the gemma3 models. Your models after they download have names that dont tell you want kind of model this is and you have no dissussion about how to use the gemma3 models. example how to rename, what folder to plae them in etc. I place them in .../model/test_encoders and this didnt work.
Bro...My model tell what they are, they're the official and unchanged original names.
FP8-dev -> Not distilled version so you can run it at 20Steps.
FP8-DISTILLED -> 8 STEPS VERSION (cause the distilled lora is already included)
Lora distilled -> Can be used with DEV models to turn them into 8 Steps Models :)
For Gemma its a pain ...I know But use the Gemma 3 12B-iT ABLITERATED.
(That the uncensored text encoder) and i uploaded it.
Sometimes you will need to use Text encoder embeddings connector.
But that depend on wich clip loader you using ;)
I will upload My perfect workflow AIO built from scratch for it.
A beautiful one you will enjoy that one ;)
But you can still use Kijai Or anyone workflow here.
Why doesn't civit put _2 models in ltxv2 section?
Is it their temporary problem, or... ?
yeah i guess it's a bug it happens too sdxl model too
anybody have any problems no matter waht i do i keep getting this
No such file or directory: C:\ComfyUI\ComfyUI\models\clip\proj_linear.safetensors
it's not related to models, i think it come to the "enhancer node" juste disable it :)
basicaly that model is already included into the last comfyui update and LTX nodes...
Try before disable it to "update all" on your comfyui manager installation :)
Edit : i see "clip" @charles0813rayburn297 it can come from your text encoder node.
wich is maybe uncompatible or outdated, try to verify if you have the right model in it.
Details
Files
ltx219BAllYouNeedIs_gemma12BABLITERATED.safetensors
Mirrors
gemma-3-12b-abliterated-text-encoder.safetensors
gemma-3-12b-abliterated-text-encoder.safetensors
gemma-3-12b-abliterated-text-encoder.safetensors
gemma-3-12b-abliterated-text-encoder.safetensors
LTX2_TEXT_ENC_UNCENSORED.safetensors
gemma-3-12b-abliterated-text-encoder.safetensors
gemma-3-12b-abliterated-text-encoder.safetensors
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.