DONT USE THIS
It was a fun run. I2V proper is released, download that. I'll leave this up only for a weird old initial way of janking a i2v process before it was released
Final version! (probably)
V4 introduces the Refinement speed hack (works great with a guiding video which depthflow uses)
Flux re-enabled
More electrolytes!
This I think is where I will stop. I have had a lot of frustrating fun playing with this and my other backend workflow for the speed hack, but I think this is finally at a place I am fairly okay with. I hope you enjoy it and post your results down below. If there are problems (always problems), post in the comments also. I or others will try to help out.
Alright Hunyuan. balls in your court. how about the official release to make this irrelevant. We're all doing this janky workarounds, so just pop it out already. btw, if you use this for your official workflow, cut me a check, I like eating.
btw, check out the other workflow on here, the leapfusion thing It actually works pretty well. less control over what you're going for, but closer to the original picture. both are cool to have.
Final update: (HA!)
Added Hunyuan Refiner step for awesomeness
Streamlined
Minor update:
V3.1 is more about refining.
Removed Reactor (pulled from Github
Removed Flux (broken)
Removed Florence (huge memory issue)
Denoodled
Added a few new options to depthflow.
V3: ITS THE FINAL COUNTDOWN!
Alright, this is probably enough. someone else get creative and go from here, but I think I am done messing around with this overall and am happy with it...(until I am not. Come on Hunyuan...release the actual image 2 video)
Anyhow, tweaks and thangs:Added in Florence for recommendation prompt (not attached, just giving you suggestions if you have it on for the hunyuan bit)
Added switches for turning things on and off
More logical flow (slight overhead save)
Shrink image after Depthflow for better preservation of picture elements
Made more stroking colors (Follow the black) and organization for important settings areas
Various tweaks and nudges that I didn't note.
V2:
More optimized, a few more settings added, some pointless nodes removed, and overall a better workflow. Also added in optional Flux group if you want to use that instead of XL
Added in also some help with Teacache (play around with that for speed, but don't go crazy with the thresh..small increments upwards)
Anyhow, give this a shot, its actually pretty impressive. I am not expecting much difference between this vs whenever they come out with I2V natively...(hopefully theirs will be faster though, the depthflow step is a hangup)
Thanks to the person who tipped me 1k buzz btw. I am not 100% sure what to do with it, but that was cool!
Anyhow
(NOTE: I genuinely don't know what I am doing regarding the HunyuanFast vs Regular and Lora. I wrote don't use it, and that remains true if you leave it on the fast model..but use it if using the full model. Ask for others, don't take my word as gospel. consider me GPT2.0 making stuff up. all I know is that this process works great for a hacky image2video knockoff)
XL HunYuan Janky I2V DepthFlow: A Slightly Polished Janky Workflow
This is real Image-to-Video. It’s also a bit of sorcery. It’s DepthFlow warlock rituals combined with HunYuan magic to create something that looks like real motion (well, it is real motion..sort of). Whether it’s practical or just wildly entertaining, you decide.
Key Notes Before You Start
Denoising freedom. Crank that denoising up if you want sweeping motion and dynamic changes. It won’t slow things down, but it will alter the original image significantly at higher settings (0.80+). Keep that in mind. Even with 80+, it'll still be similar to the pic though.
Resolution matters. Keep the resolution (post XL generation) to 512 or lower in the descale step before it shoots over to DepthFlow for faster processing. Bigger resolutions = slower speeds = why did you do this to yourself?
Melty faces aren’t the problem. Higher denoising changes the face and other details. If you want to keep the exact face, turn on Reactor for face-swapping. Otherwise, turn it off, save some time, and embrace the chaos.
DepthFlow is the magic wand. The more steps you give DepthFlow, the longer the video becomes. Play with it—this is the key to unlocking wild, expressive movements.
Lora setup tips.
Don’t use the FastLoRA—it wont work using the fast Hunyuan model which is on by default. Use it if you change the model though
Load any other LoRA, even if you’re not directly calling it. The models use the LoRA’s smoothness for better results.
For HunYuan, I recommend Edge_Of_Reality LoRA or similar for realism.
XL LoRAs behave normally. If you’re working in the XL phase, treat it like any other workflow. Once it moves into HunYuan, it uses the LoRA as a secondary helper. Experiment here—use realism or stylistic LoRAs depending on your vision.
WARNING: REACTOR IS TURNED OFF IN WORKFLOW!
(turn on to lose sanity or leave off and save tons of time if you're not partial to the starting face)
How It Works
Generate your starting image.
Be detailed with your prompt in the XL phase, or use an image2image process to refine an existing image.
Want Flux enhancements? Go for it, but it’s optional. The denoising from the Hunyuan bit will probably alter most of the Flux magic anyhow, so I went with XL speed over Flux's clarity, but sure, give it a shot. enable the group, alter things, and its ready to go. really just a flip of a switch.
DepthFlow creates movement.
Add exaggerated zooms, pans, and tilts in DepthFlow. This movement makes HunYuan interpret dynamic gestures, walking, and other actions.
Don’t make it too spazzy unless chaos is your goal.
HunYuan processes it.
This is where the magic happens. Noise, denoising, and movement interpretation turn DepthFlow output into a smooth, moving video.
Subtle denoising (0.50 or lower) keeps things close to the original image. Higher denoising (0.80+) creates pronounced motion but deviates more from the original.
Reactor (optional).If you care about keeping the exact original face, Reactor will swap it back in, frame by frame.If you’re okay with slight face variations, turn Reactor off and save some time.
Upscale the final result.
The final step upscales your video to 1024x1024 (or double your original resolution).
Why This Exists
Because waiting for HunYuan’s true image-to-video feature was taking too long, and I needed something to tinker with. This (less) janky process works, and it’s a blast to experiment with.
Second warning:
You're probably gonna be asked to download a bunch of nodes you don't have installed yet (DepthFlow, Reactor, and possibly some others). Just a heads up.
Final Thoughts
This workflow is far from perfect, but it gets the job done. If you have improvements, go wild—credit is appreciated but not required. I just want to inspire people to experiment with LoRAs and workflows.
And remember, this isn’t Hollywood-grade video generation. It’s creative sorcery for those of us stuck in the "almost but not quite" phase of technology. Have fun!
Description
Added in Refiner Speed Hack
Enabled Flux
Rearranging for easier altering / less seeking out
Electrolytes
FAQ
Comments (28)
Hello everyone! I can't install TeaCacheHunyuanVideoSampler. I've already completely removed ComfyUI, all extensions, and tried installing it both manually and automatically, but I keep getting this error. Is there a solution to this?
Hey. install ComfyUI TTP Toolset
@saturngfx Why has ComfyUI started working so poorly and displaying all these nodes incorrectly? It's a complete mess! Thank you very much, I'll go test it now.
I love your notes. This is probably just a hair-overcomplicated for me. I think people just want a better AnimateDiff. I've never really gotten greats results with it either, even after all that time trying to figure out the ControlNets. Ugh lol
You eat an elephant one bite at a time :)
It's been made.,Very interesting idea.,But the video that comes out can only be controlled as a fixed action in a few directions.,So,I'm thinking,Just for V2V,Then use LTX to quickly generate videos and then go to Hunyuan V2V,It should be more interesting, right? I don't know English, it's translated with Google, sorry for the grammatical error.
There are workflows that do that I believe. You can try them. I suspect they are extremely GPU intensive. As far as how many directions it can go, there are a multitude of things that can be done with the various tools of depthflow. experimentation is key, but its okay if its not for you.
i got missing nodes for deepflow, changing builds or reinstall still gives error, how to fix?
something is probably clashing with depthflow, test that theory by removing all nodes, then installing only (manager of course and ) depthflow. see if that works, then build up from there That would be my method to troubleshoot.
same issue, did you figure it out?
@Always_Blue One thing that I find often borks me is not having updated comfyui and nodes. just a suggestion, pop into manager, update comfy, update all, and see if that sorts anything out (good practice anyhow)
Nice try, but a complete mess. The Hunyan part does not work, there are error messages coming from other de-validated groups, no resolution set. We need a new 'final' V5.
not sure why its not working for you, but works for thousands of others.
what seems to be the issue? maybe we can troubleshoot a bit to figure out what is off in your install?
the teacache nodes didn't load automatically for me, had to manually replace them. Also I couldn't find the UNETLoaderNF4 anywhere.
@Numerous If you're talking about the unet loader for flux, I believe its from the comfyui-gguf stuff. you can of course use whatever loader you want, but that one worked well enough for me.
Teacache is a pain. make sure you're updated both comfyui and the extensions...its such a pain to work with, but it does help speed things up a bit. if its being too much of a bother just rip it out and toss in. yeah.
This was working excellently for me for about a week. Starting today my comfyui is disconnecting every time it gets to the "Load Diffusion Model" node. I just used it yesterday and haven't updated anything. Oh well. It was fun while it lasted. Anyway, this is nothing against your workflow. Just wanted to say thank you for making this. Now time to wait for the official hunyuan i2v model.
GPU peaking out or something? do me a favor and test a bit, maybe put it on a lower resolution. are you using Flux (Flux gives me a very janky time...disconnects by just eyeing it a hair above like 768)
@saturngfx I wasn't using flux, but previously I was using the hunyuan_t2v_bf16 model. I noticed my terminal had a "clip missing" error before comfyui was disconnecting. I switched to the hunyuan_t2v_comfy native_fp8 model that came out last week and it is working now. Thanks again
@zerox369 Glad you sorted it out. Yeah, these things are beasts for memory...but this time next year we'll probably be complaining about a 20 second HD video taking 45 seconds or something crazy. Wild times.
Here what happened. I'm not interested in depthflow or SDXL, just Hunyuan, so I put off these options. Then when queuing, it throws me errors from depthflow -missing motion, missing depthmap that I don't want. Error in VHS, required input image. Then I bypassed thoses nodes, and finally I have a last message in VaeEncode Tiled (this time in Hunyuan group) Required Input is Missing in Pixels. I don't know what is about.
ahh. well here is the issue then.
put in a picture. alright, you got 1 still frame. this then moves to depthflow which moves the picture around in various ways depending on how you choose. this creates a video of motion. it then takes that video and sends it to hunyuan for a video 2 video processing.
the secret sauce is video 2 video as image 2 video isn't a thing yet officially (though there is some interesting stuff going on with that one Lora in the other workflow)
depthflow is the engine to the car. if you remove the engine, you can't really complain that the car no longer moves :)
the image to video bit is you start with a image...be it XL, Flux, or your own image, and the end result is a video, but the inner workings is actually video to video.
@saturngfx well in that case you should remove the depthflow switcher because it is confusing. Thanks.
@vennettillieric762 would be nice, but I don't know how as I didn't create it. it shows up any and all groups. People hate having too many nodes not grouped up,
Can you please link the vaes that are used in the workflow?
https://civitai.com/models/1167575?modelVersionId=1314474
https://huggingface.co/stabilityai/sdxl-vae
Nothing secret. whatever vae you use for normal XL gens, or flux if you're using that, and then the vae for hunyuan video.
ugggh
quick question: closeupface v1.1 seems no where to find , is there an alternative?
Sure, its just a LoRA I used. you can swap it out for anything, or just disable it. its not important. All LoRAs I use are somewhere here on CivitAI so if you wanted that exact model, it'll be on here somewhere. Any of the close up face things would work...all about experimentation :)


