This workflow is designed for ComfyUI and leverages the Wan 2.2 Enhanced NSFW I2V model (in GGUF and Safetensors formats) to generate high-quality, dynamic image-to-video (I2V) animations, with a strong focus on NSFW content. It supports advanced features like model switching (high/low quality), audio generation via MMAudioSampler, video upscaling, color matching, and final video compilation at up to 50 FPS. The workflow includes built-in LoRA triggers for specific NSFW scenarios (e.g., cowgirl, deepthroat, cunnilingus, full nelson), making it ideal for creating sensual, explicit animations with realistic motion, lighting, and details.
Key Features:
Image-to-Video Generation: Converts a single input image into a video sequence using the WanImageToVideo node. Supports frame lengths up to 81, batch sizes, and resolutions like 480x720 (configurable via nodes like WIDTH, HEIGHT, LENGTH).
Model Variants: Switch between high-fidelity (Q8H/FP8H) and lightweight (Q8L/FP8L) versions of the Wan 2.2 model for performance optimization. Includes SD3 sampling shifts for better motion coherence.
Prompting System: Dual CLIP text encoders for positive/negative prompts. Built-in notes provide example triggers and prompts for NSFW acts (see original for examples).
Audio Integration: Generates ambient audio (e.g., moans, music) using MMAudioSampler with customizable duration, steps, CFG, and prompts. Negative audio prompts avoid low-quality noise or speech.
Post-Processing: VAE decoding for clean frames; image resizing and upscaling; color matching and restoration; video combining with VHS_VideoCombine (supports H264/H265 MP4, ping-pong looping, CRF quality control, and metadata saving). Preview options: Animation preview at 16 FPS and audio playback.
Optimization: VRAM cleanup nodes, CPU/GPU device switching, and batch processing for efficiency. Supports random seeds for variation.
Output: Saves videos/images in folders like "LongVid/%date:yyyy-MM-dd%/%date:hhmmss%" with prefixes (e.g., V for video, I for image, A for audio). Final videos can be upscaled to 50 FPS.
Requirements:
ComfyUI Version: Latest stable (tested on 2024–2026 builds).
Models (place in the appropriate ComfyUI folders: models/unet, models/vae, models/clip_vision, models/text_encoders, etc.):
Main Diffusion Models (Wan 2.2 Enhanced NSFW SVI Camera variants) — from nolightning's Lightning Edition pack:
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8H.gguf →
https://civarchive.com/api/download/models/2540892?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2Q8L.gguf →
https://civarchive.com/api/download/models/2540896?type=Model&format=GGUF&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8H.safetensors → https://civarchive.com/api/download/models/2477539?type=Model&format=SafeTensor&size=full&fp=fp8
wan22EnhancedNSFWSVICamera_nsfwFASTMOVEV2FP8L.safetensors → https://civarchive.com/api/download/models/2477548?type=Model&format=SafeTensor&size=full&fp=fp8
VAE: Wan2.1_VAE.pth →
CLIP Vision: clip_vision_h.safetensors →
CLIP Text Encoder: umt5_xxl_fp8_e4m3fn_scaled.safetensors →
Audio: MMAudio model (via comfyui-mmaudio extension) — install the extension; models are usually auto-downloaded or available in the repo.
Upscale model: 4x_NMKD-Siax_200k →
https://civarchive.com/api/download/models/2052724?type=Model&format=PickleTensor
Custom Nodes/Extensions (install via ComfyUI Manager):
comfyui-gguf (for GGUF model loading).
ComfyUI_Comfyroll_CustomNodes (math/utils).
comfyui-easy-use (cleanGpuUsed, mathFloat).
comfyui-kjnodes (INTConstant, ImageResizeKJv2, LoadVideosFromFolder, PreviewAnimation).
comfyui-videohelpersuite (VHS_VideoCombine).
comfyui-mmaudio (MMAudioSampler, audio preview).
comfyui-image-saver (Sampler/Scheduler selectors).
controlaltai-nodes (TwoWay/ThreeWaySwitch).
ComfyLiterals (Float node).
comfyui_memory_cleanup (VRAMCleanup).
Hardware: GPU with at least 12GB VRAM recommended for high-quality runs (e.g., 81-frame videos). CPU fallback available for some nodes.
How to Use:
Load the Workflow: Import the JSON into ComfyUI.
Input Image: Connect an image to the "IMAGE" node (e.g., via Load Image). Resize settings are in the "LOAD IMAGE & RESIZE" group.
Prompts: Edit the POSITIVE/NEGATIVE nodes with your description. Use the built-in trigger words for best NSFW results.
Settings: Adjust in "VIDEO SETTINGS" group:
Resolution: WIDTH/HEIGHT (default 480x720).
Frames: LENGTH (default 81), STEPS (default 8), CFG (default 1).
Seed: Randomize for variations.
Sampler/Scheduler: Euler Ancestral + Simple (defaults).
Batch Size: 1 (increase for multiples).
Run: Queue the prompt. Monitor VRAM with cleanup nodes.
Outputs: Videos save to ComfyUI/output/LongVid (customizable). Preview animation and audio in the workflow.
Advanced: Toggle high/low model switches for quality vs. speed. Add audio prompts in MMAudioSampler. Upscale in the "UPSCALE" group for smoother 50 FPS output.
Tips for Best Results:
NSFW Focus: Start with the example prompts in the notes for fluid motion (e.g., thrusting, jiggling). Avoid overlong prompts to prevent artifacts.
Audio Sync: Match audio duration to video length (default 10s). Use positive prompts like "moans, sensual sounds" and negatives to avoid distortion.
Performance: For low VRAM, use GGUF low models and disable audio/upscaling. Force offload in MMAudioSampler if needed.
Customization: Experiment with LoRAs (loaded in "LOAD LORA'S" group) for specific styles. Negative prompts handle artifacts like blur, distortion, or bad anatomy.
This workflow is optimized for explicit, high-detail NSFW I2V—perfect for creators exploring sensual animations.
Disclaimer:
This workflow is provided for entertainment, artistic, and creative purposes only.
It may not be used for any illegal, harmful, non-consensual, or malicious activities.
Please use it responsibly and respect all applicable laws and ethical guidelines.
Description
PainterLongVideo: Advanced node with expanded inputs: positive/negative, VAE, width/height/length/batch_size + new ones like previous_video (full prior segment for continuity), motion_frames (3–7 trailing frames for motion ref, default 10), motion_amplitude (1.0–2.0 multiplier, recommended 1.15–1.3 for slow-mo fix), initial_reference_image (optional first frame for global coherence), and optional clip_vision_output. If no previous_video, acts as a dynamic keyframe enhancer.
Sage Attention + Torch: added
GGUF OR DIFFUSION switch: added
VIDEO EDITOR: Improved the generation speed.
FAQ
Comments (7)
Great workflow, but getting "can't access property "output", res is undefined" when generating a 7 second video, non 50fps, with audio. Any ideas?
Uhm Do you have values filled in in the video Settings for Width and Height? It should be 480 width and 832 Height or 720 width and 1280 height but it takes longer to generate.
@MrXin yeah, I haven't changed the resolution. I'm getting nothing in the terminal signaling what the problem could be, only a red notification in comfy that says the error I posted above. But I am also now getting "can't access property "workflow", res is undefined". So yeah, not sure what is going on.
@MrXin Figured it out. Was a conflict with RES4LYF custom node. Disabled it and the WF is working fine. Thanks!
Great Workflow ! Thanks mate
Hi there! Do you know how to, or is it possible to, take a wan2.2 gen that has no audio, and run it through a ltx2.3 WF to give it sound without having to prompt it?
Hi, this is a perfect workflow and it's fascinating that it maintains the consistency of the face and eyes. However, I need your help because I want to save the video in the output folder, but since the workflow is predefined this way, it's difficult for me to use. I tried removing the nodes that save the video in that folder, but it just makes everything stop working.