Check out our quickstart Guide! https://education.civitai.com/quickstart-guide-to-stable-video-diffusion/
The base img2vid model was trained to generate 14 frames at 1024x576, uses less VRAM than the...
img2vid-xt model, trained to generate 25 frames at 1024x576.
img2vid-xt-1.1, the latest version, is finetuned to provide enhanced outputs for the following settings;
Width: 1024
Height: 576
Frames: 25
Motion Bucket ID: 127
FPS: 6
Augmentation Level: 0.00
Stable Video Diffusion (SVD) Image-to-Video is a diffusion model that takes in a still image as a conditioning frame, and generates a video from it. Developed by: Stability AI
Description
This model was trained to generate 14 frames at resolution 576x1024
Details
Downloads
7,945
Platform
CivitAI
Platform Status
Available
Created
11/21/2023
Updated
9/27/2025
Deleted
-
Files
stableVideoDiffusion_img2vid.safetensors
Mirrors
Huggingface (1 mirrors)
CivitAI (2 mirrors)