ComfyUI Wan VACE Video Joiner

What it Does

This ComfyUI workflow creates smooth transitions between video clips using Wan VACE (Wan 2.2 Fun VACE or Wan 2.1 VACE). While designed for Wan video outputs, it works with video from any model or other source-LTX-2, drone footage, stock video, personal recordings, etc.

The workflow processes clips in a directory, replacing a configurable number of frames at each transition point to eliminate awkward motion jumps that can occur when joining clips. Context frames before and after each transition guide the generation. If you have noisy or artifacted frames at clip boundaries, this technique can also eliminate those.

Output can be either individual smoothed clips or a single concatenated video.

Setup

This is not a ready to run workflow. You need to configure it to fit your system.

What runs well on my system will not necessarily run well on yours. Configure this workflow to use a VACE model of the same type that you use in your standard Wan workflow. Detailed configuration and usage instructions can be found in the workflow. Please read carefully.

Dependencies

I've used native nodes and tried to keep the custom node dependencies to a minimum. The following packages are required. All of them are installable through the Manager.

Note: I have not tested this workflow under the new Nodes 2.0 UI.

Configuration and Models

You'll need some combination of these models to run the workflow. As already mentioned, this workflow will not run properly on your system until you configure it properly. You probably already have a Wan video generation workflow that runs well on your system. You need to configure this workflow similarly to your generation workflow.

Wan 2.2 Fun VACE
- bf16 and fp8
- GGUF
Wan 2.1 VACE
- fp16
- GGUF
Kijai’s extracted Fun Vace 2.2 modules, for loading along with standard T2V models. Native use examples here.
- bf16
- GGUF

The Sampler subgraph contains KSampler nodes and model loading nodes. Inference is isolated in subgraphs, so it should be easy to modify this workflow for your preferred setup. Replace the provided sampler subgraph with one that implements your setup, then plug it into the workflow. Have your way with these until it feels right to you.

Just make sure all the subgraph inputs and outputs are correctly getting and setting data, and crucially, that the diffusion model you load is one of Wan2.2 Fun VACE or Wan2.1 VACE. GGUFs work fine, but non-VACE models do not. An example alternate sampler subgraph for VACE 2.1 is included.

Enable sageattention and torch compile if you know your system supports them.

Troubleshooting

The size of tensor a must match the size of tensor b at non-singleton dimension 1 - Check that both dimensions of your input videos are divisible by 16 and change this if they're not. Fun fact: 1080 is not divisible by 16!
Brightness/color shift - VACE can sometimes affect the brightness or saturation of the clips it generates. I don't know how to avoid this tendency, I think it's baked into the model, unfortunately. Disabling lightx2v speed loras can help, as can making sure you use the exact same lora(s) and strength in this workflow that you used when generating your clips. Some people have reported success using a color match node before output of the clips in this workflow. I think specific solutions vary by case, though. The most consistent mitigation I have found is to interpolate framerate up to 30 or 60 fps after using this workflow. The interpolation decreases how perceptible the color shift is. The shift is still there, but it's spread out over 60 frames instead over 16, so it doesn't look like a sudden change to our eyes any more.
Regarding Framerate - The Wan models are trained at 16 fps, so if your input videos are at some higher rate, you may get sub-optimal results. At the very least, you'll need to increase the number of context and replace frames by whatever factor your framerate is greater than 16 fps in order to achieve the same effect with VACE. I suggest forcing your inputs down to 16 fps for processing with this workflow, then re-interpolating back up to your desired framerate.
IndexError: list index out of range - Your input video may be too small for the parameters you have specified. The minimum size for a video will be (context_frames + replace_frames) * 2 + 1. Confirm that all of your input videos have at least this minimum number of frames.
If you can't make the workflow work, update ComfyUI and try again. If you're not willing to update ComfyUI, I can't help you. We have to be working from the same starting point.
Feel free to open an issue on github. This is the most direct way to engage me. If you want a head start, paste your complete console log from a failed run into your issue.

Changelog

v2.4 Minor tweaks. Adjust sage attention, torch compile defaults.
v2.3 This release prioritizes workflow reliability and maintainability. Core functionality remains unchanged. These changes reduce surface area for failures and improve debuggability. Stability and deterministic operation take priority over convenience features.
- Looping workflow discontinued – While still functional, the loop-based approach obscured workflow status and complicated targeted reruns for specific transitions. The batch workflow provides better visibility and control.
- Reverted to lossless fv1 intermediate files – The 16-bit PNG experiment provided no practical benefit and made addressing individual joins more cumbersome. Returning to the proven method.
- New custom nodes for cleaner workflows – WAN VACE Prep Batch and VACE Batch Context encapsulate operations that are awkward to express in visual nodes but straightforward in Python. Load Videos From Folder (simple) replaces the KJNodes equivalent to eliminate problematic VideoHelperSuite dependencies that fail in some environments.
- Enhanced console logging – Additional diagnostic output when Debug=True to aid troubleshooting.
- Fewer custom node dependencies
The Lightweight Workflow has moved to its own page. Check it out if you just need to quickly join two clips without the overhead required by the full workflow.
v2.2 Complexity Reduction Release
- Removed fancy model loader which was causing headaches for safetensors users without any gguf models installed, and vice-versa.
- Removed the MOE KSampler and TripleKSampler subgraphs. You can still use these samplers, but it's up to you to bring them and set them up.
- Custom node dependencies reduced.
- Un-subgraphed some functions. Sadly, this powerful and useful feature is still too unstable to distribute to users on varying versions of ComfyUI.
- Updated documentation.
v2.1
- Add Prune Outputs to Video Combine nodes, preventing extra frames from being added to the output
v2.0 - Workflow redesign. Core functionality is the same, but hopefully usability is improved
- (Experimental) New looping workflow variant that doesn't require manual queueing and index manipulation. I am not entirely comfortable with this version and consider it experimental. The ComfyUI-Easy-Use For Loop implementation is janky and requires some extra, otherwise useless code to make it work. But it lets you run with one click! Use with caution. All VACE join features are identical between the workflows. Looping is the only difference.
- (Experimental) Added cross fade at VACE boundaries to mitigate brightness/color shift
- (Experimental) Added color match for VACE frames to mitigate brightness/color shift
- Save intermediate work as 16 bit png instead of ffv1 to mitigate brightness/color shift
- Integrated video join into the main workflow. It will run automatically after the last iteration. No more need to run the join part separately.
- More documentation
- Inputs and outputs are logged to the console for better progress tracking
v1.2 - Minor Update 2025-Oct-13
- Sort the input directory list.
v1.1 - Minor Update 2025-Oct-11
- Preserve input framerate in workflow VACE outputs. Previously, all output was forced to 16fps. Note, you must manually set the framerate in the Join & Save output.
- Changed default model/sampler to Wan 2.2 Fun VACE fp8/KSampler. GGUF, MoE, 2.1 are still available in the bypassed subgraphs.

ComfyUI Wan VACE Video Joiner

What it Does

Setup

Dependencies

Configuration and Models

Troubleshooting

Changelog

Description

Details

Files

wanVACEClipJoinerSmoothAIVideo_v12.zip

Mirrors