Originally posted: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B
https://github.com/ace-step/ACE-Step
Model Description
ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design. It integrates diffusion-based generation with Sana's Deep Compression AutoEncoder (DCAE) and a lightweight linear transformer, achieving state-of-the-art performance in generation speed, musical coherence, and controllability.
Key Features:
15× faster than LLM-based baselines (20s for 4-minute music on A100)
Superior musical coherence across melody, harmony, and rhythm
full-song generation, duration control and accepts natural language descriptions
Uses
Direct Use
ACE-Step can be used for:
Generating original music from text descriptions
Music remixing and style transfer
edit song lyrics
Downstream Use
The model serves as a foundation for:
Voice cloning applications
Specialized music generation (rap, jazz, etc.)
Music production tools
Creative AI assistants
Description
FAQ
Comments (1)
music ai model on an image ai model website before GTA6?!?!??!??!?!??!?!
Details
Files
aceStepAudioGen_v15Turbo.safetensors
Mirrors
model.safetensors
acestep_v1.5_turbo.safetensors
acestep_v1.5_turbo.safetensors
model.safetensors
ace_step_v1_5_transformer_bf16.safetensors
acestep_v1.5_turbo.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
model.safetensors
acestep_v1.5_turbo.safetensors
model.safetensors
