This Lora is made to enhance the ability / quality to create anthro and furry
but it works also for all kind of other fantasy creatures and animals too
the range that it can handle for example the Wan 2.2 V2 Lora is from comic and drawing towards real photos .
i made the lora for Hunyuan Video , WAN 2.1 and 2.2 (SFW and NSFW)
Furry Enhancer Hunyuan V1
The Hunyuan Video version is trained for Text to Video (T2V) but i also tested it on Image to Video its also working with it very good but not made for that.
All example images for the Hunyuan Video are made T2V
Its also compatible with Ratatoskr Hunyuan
Wan 2.1 I2v and T2V V1
The Wan 2.1 has 2 different models
I2V and T2V
the image to video is in most cases the model of choice for Wan
for the Text to Video Variant please read the description.
Its also compatible Ratatoskr WAN 2.1 (ITV) and will for sure increase the quality especially in the NSFW Area.
I also tested the I2V Lora with Wan 2.2 it seems to enhance the result too but very experimental
Wan 2.2 I2V and T2V V1
This enhancer version is for Wan 2.2 its splitted in basically 2 Loras: low and high noise
both loras for I2V are trained on the same dataset. I its my own dataset before i trained with the wan 2.1 Lora but add more pictures and videos
it's completely synthetic so no real persons or art is used for the dataset.
whats the pros of this lora compared to the wan 2.1?
more consistent
more specimen
better in sfw and nsfw
i recommend using both loras and use the workflow in the training folder Note: your resolution needed to set between 0.4 (480P) and 0.92 (720p) overwise the lora generate backscreens or other failures.
for strength i prefer the full 1.0 strength its tested
its tested with the original wan 2.2 model
also in a short test with my Ratatoskr WAN 2.2 Hybrid v1.0 WIP model (use only the high noise version with that model)
for technical data about Wan 2.2 visit there official site here:
V2 Furry Enhancer High Noise
Changes compared to V1
more consistent
more specimen
better overall movements
more stable overall
way better with comic and anime than V1
better in sfw and nsfw
i recommend using the V1 Version of the WAN 2.2 I2V V1.0Low Noise
Note: I used for example the old workflow for this model to compare V1 with V1 if you like the newest workflow download the training folder.
Important for this workflow is to keep the resolution between 0.4 and 0.92 (480p - 720p)
V3 High Noise and Low Noise
This enhancer version is for Wan 2.2
its trained my own dataset
it's completely synthetic so no real persons or art is used for the dataset.
the biggest change in this dataset is its now completely trained on videos
Changes compared to V2
trained now on over 650 Videos
better overall movements
more emotions
way better with comic and anime than V1 and V2
better in sfw and nsfw
for this Version there is also an newtrained Low Noise Model
LTX2.3 Version (WIP Preview V0.1)
this is an preview version of the first attempt to train the ltx 2.3 model with a very limited dataset
so it's more like an technical demonstration / trial than a fully trained lora.
So don't expect best results yet it's highly experimental.
for workflow i have also added an experimental workflow.
LTX2.3 Version V1 Lora Pack
This is the First full Release Version of the LTX Version of Furry Enhancer Video.
Better to say it's an model pack because
V1.1 is made based on ltx 2.0 training setup so best for the LTX 2.0 model
V1.215 its the main model trained specifically for the ltx 2.3 Version so start usually with that lora
V1.22 More advanced in training but has sound issues.
V1.3 experimental (wip) model trained in different training modes (heavy sound issues)
you find the used workflows and the different model versions in the "training folder"
the loras are tested on the dev version and the distilled variant.
what has changed to preview?
Trained on the full video database (V1.2 and up) at around 700 videos instead of only 40 videos
Also trained on my photo database of ratatoskr overall over 10k pictures
Trained for i2v and t2v
better overall knowledge and movement .
Note: this is still in development because most of the trained videos are based on my wan2.2 database so its planned to train an v2 this year with video output of that model.
also i don't have my own workflow done yet so it depends on other workflow don by afroman4peace and akinson
LTX2.3 Version V2.4
I have decided to retrain the dataset on a far advanced system now its trained on a server with an rtx6000 pro blackwell for more than a week 24/7 (and yes this was expensive)
so what has changed from V1 to V2.4?
the trained video resolution is now at 720P constant
image training did improved from 1024p to 1536p
added 120 new sound videos for training especially for speech and jaw movements.
seperated videos with sound and not sound and trained in different databases
trained for T2V and I2V in stages so its nearly equally trained for both now
added new pictures to database too but more important overworked the captions fitting better for video training
For settings and workflow i have added my example videos in the training folder the images and some videos has the metadata just put them into comfy
on my workflows was the loral at best between 0.8 and 1.0
the lora is trained on the dev model but also tested for the distilled version
for now still don't have my own workflow done yet so it depends on other workflow don by afroman4peace and akinson
please support my work with an like and look on my other loras and models too
the newest workflow download the training folder.
in all videos the metadata is also included just download the video and put it into Comfy
Important for this workflow is to keep the resolution between 0.4 and 0.92 (480p - 720p)
if you like to support me please give a like and Check out my other models.
Making such loras take a huge amount of cost and time please support my work with a like.
and please check out my other models and loras too ;-)
made by freek22
Description
High Noise Lora:
This enhancer version is for Wan 2.2 its splitted in basically 2 Loras: low and high noise
both loras for I2V are trained on the same dataset. I its my own dataset before i trained with the wan 2.1 Lora but add more pictures and videos
it's completely synthetic so no real persons or art is used for the dataset.
whats the pros of this lora compared to the wan 2.1?
more consistent
more specimen
better in sfw and nsfw
i recommend using both loras in the video metadata there is a comfy workflow for using it.
if you like to support me please give a like and Check out my other models.
FAQ
Comments (27)
For Online Generation of the 2.2 Wan variant i build this template: https://tensor.art/template/899736312733543135
@Akkairosu whats the problem with the tool?
It seems tensor.art has some issues they try to fix asap
The tensor.art problem is solved working now correctly
I must be missing something. Would love to use the new Wan 2.2 I2V enhancer LoRAs but even with the provided stock workflows (which use lightning loras, 10 steps, CFG 1.0) - the videos many times have 1 second of video then immediately fades to black.
I've tried removing the Lightning LoRAs and cranking both KSamplers to be 20 steps, 3.5 CFG, Simple Euler without much success. Videos gen fine if I disable the Enhancers. :(
this is very strange never had that issue in all my testings. i used the workflow that is in the video metadata do you use sage attention i may have some influence because i never use it.
@freek22 I even tried a fresh comfyui installation and still had the same issue. (Not using sage either). I've put together a post with the workflows (should be identical to yours in your demo pics) so you can see what happens with/without the LoRA. No errors in the console either.
FOLLOW UP: I just tested another image (square aspect ratio) and it worked perfectly with the Enhancer LoRAs - I'm starting to wonder if that german shepherd pic (which I got from the video on the demo page with a resolution of 480x720) from the demo is just problematic?
@melongrab could it also a vram / ram problem? It's really strange, you can test that by turn down the render resolution.
@freek22 I'm on an RTX 4090 24GB with 128GB system ram so I don't think its memory related - been using Wan 2.1 and 2.2 for a while without any issues. Your workflow already has a node that resizes down to 720 (width/height) so resolution is already lowered I think.
I've done a bunch more tests in case it is helpful. The Furry Enhancer LoRAs work perfectly with square aspect ratio images (e.g. 720x720), but portrait aspect (2:3 like 480x720 or 9:16) almost always produce "fade to black" animations, or other visual artifacts.
Final Update: Even tried using Furry Enhancer Hi/Lo Noise LoRAs with fresh install of Wan2GP which downloads/manages all models/python installation/etc. I couldn't get portrait aspect ratio images to generate properly - they would fade out, or show gray.
Found out it may it's indeed chosen picture ratio but more than only square is working will provide a workflow for that tomorrow
Having same issues, except its an actual black fabric curtain falling down then cover the screen, thats so odd lol.
@Insistent it is I know the issue it's partially wan2.2 related keep it in standard resolutions for now then it works but I have found already a solution will provide a workflow soon
I tried with a 640x640 image and 81 frames and I get the same issue here, it creates a brown curtain after 1 second of the video has passed.
@Sioran generated n own system? if made on tensor its another problem that they have on site overall
Looking forward to the updated workflow!
@freek22 I managed to get a "fix" on my local workflow with the rtx 5060 16gb I have. I lowered the strength of the lora to .6 and added negative tags with "fade out, glow, fade to black" and 3 out of 5 gens appear good. Same square resolution of 640 and 320 for testing.
@Sioran I also fixed the thing turn down the weight may help a bit but it's not the issue at all will upload the workflow soon . On my computer the bug never occured at all so I tested on my buddy's system
New workflow is added: For testing i also attached the original image. you can test your system with that.
@freek22 I tried using 480x720. First attempt there was a naked black guy fallling from the sky landing directly on the anthro wolf before it fades to dark, second attempt a seclusion room materialized and locked the camera/viewer inside. My prompt is nothing even close to any of it lol.
@Insistent did you try the new workflow? Including the example image/ test image
@Insistent the model itself runs fine on my system I tested More than 170 images never had any issue also the only thing was at some systems but it's fixed when using the Megapixel setting I implemented because some need an even divided by 64 value
@freek22 Essentially, the Lora is sensitive to image resizing nodes? If so i could try to incorporate them to the workflow i use. The workflow you linked work buts its very slow, i only have a 5070ti, its not a powerhouse.
@Insistent you can build it in other workflows or reduce the Megapixel value further down till 0.5 it's tested and yes seems it's sensitive about the image ratio also its possible to reduce the step count to min 6 steps overall
so i found the problem its wan 2.2 related: first use the provided workflow or copy the notes that set the image resolution: important never go under 0.41 resolution (480p) its the reason for the black screens also exceeding 0.92 (720p can lead to problems)
updated the High Noise model in my tests the model can handle now lower resolutions better
This Lora is now in v3 with many improvements
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.