RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Z-Image Turbo I2I for Characters | Ultimate Photorealism

Turns portraits into lifelike, perfectly detailed realistic faces fast.

Wan 2.2 Video Restyle | First Frame Restyle for Consistent and Cinematic Video Generation

Change the first frame, folks, your style makes the whole video look amazing. Pure magic.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

ComfyUI > Nodes > ComfyUI-AceStep_SFT

ComfyUI Extension: ComfyUI-AceStep_SFT

Repo Name

ComfyUI-AceStep_SFT

Author
jeankassio (Account age: 3296 days) Nodes
View all nodes(4) Latest Updated
2026-04-01 Github Stars
0.03K

Github Ask jeankassio Current Questions Past Questions

Table of Content

Description
ComfyUI-AceStep_SFT Introduction
How ComfyUI-AceStep_SFT Works
ComfyUI-AceStep_SFT Features
ComfyUI-AceStep_SFT Models
What's New with ComfyUI-AceStep_SFT
Troubleshooting ComfyUI-AceStep_SFT
Learn More about ComfyUI-AceStep_SFT
Related Nodes

How to Install ComfyUI-AceStep_SFT

Install this extension via the ComfyUI Manager by searching for ComfyUI-AceStep_SFT

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AceStep_SFT in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-AceStep_SFT Description

ComfyUI-AceStep_SFT is an all-in-one node for ComfyUI, implementing AceStep 1.5 SFT for high-quality music generation. It replicates the Gradio pipeline, providing fine control over audio synthesis parameters.

ComfyUI-AceStep_SFT Introduction

ComfyUI-AceStep_SFT is an innovative extension designed for ComfyUI, a user-friendly interface for AI-based music generation. This extension leverages the AceStep 1.5 SFT (Supervised Fine-Tuning) model, which is a cutting-edge tool for creating high-quality audio. It enhances the official AceStep workflow by providing stronger conditioning control and practical quality options tailored for ComfyUI users. This extension is particularly beneficial for AI artists looking to generate superior audio content with ease and precision.

How ComfyUI-AceStep_SFT Works

At its core, ComfyUI-AceStep_SFT simplifies the complex process of music generation into a series of manageable steps. It starts by creating or loading initial audio latents, which are essentially the building blocks of your music. These latents are then processed through text encoding, where captions, lyrics, and metadata are analyzed using multiple CLIP encoders. The diffusion sampling step follows, where the model applies advanced guidance to refine the audio. Finally, the audio decoding step converts these refined latents into high-quality audio outputs. This process ensures that the generated music is both high in quality and aligned with the user's creative vision.

ComfyUI-AceStep_SFT Features

Advanced Guidance

APG (Adaptive Projected Guidance): Offers dynamic adaptation and noise reduction for the best quality and stability.
ADG (Angle-based Dynamic Guidance): Provides aggressive style distortion, ideal for unique audio effects.
Standard CFG: A traditional guidance method for predictable results.

Intelligent Metadata Processing

Automatically estimates music duration and processes metadata like BPM, time signature, and key/scale.
Supports over 23 languages, making it versatile for global users.

AI Music Analyzer

Extracts audio tags, BPM, and key/scale from input audio, providing structured JSON outputs for easy analysis.

Allows for img2img-style editing, enabling users to refine existing audio with precision.

Extended Conditioning Control

Offers split text/lyric guidance and other advanced controls for nuanced audio generation.

AceStep LoRA Workflow

Supports stacking multiple LoRAs for customized audio effects, with automatic conversion for compatibility.

ComfyUI-AceStep_SFT Models

The extension utilizes the ACE-Step-Transcriber model, which is specifically designed for audio-to-text transcription. This model is ideal for extracting lyrics, vocal tags, and song structure, providing a comprehensive analysis of the audio content.

What's New with ComfyUI-AceStep_SFT

The latest updates include enhanced guidance modes like APG and ADG, which improve the quality and stability of the generated audio. The extension also introduces intelligent metadata processing and a robust AI music analyzer, making it easier for users to create and analyze music. These updates are designed to enhance the user experience and provide more control over the music generation process.

Troubleshooting ComfyUI-AceStep_SFT

Common Issues and Solutions

Audio Distortion/Clipping: Adjust the latent_shift parameter to reduce amplitude before decoding.
High Variance Results: Increase the apg_norm_threshold for better gradient clipping.
Lower Than Expected Quality: Use the recommended settings for guidance mode and steps to improve output quality.
LoRA Issues: Adjust strength_model and strength_clip settings for better integration with LoRAs.

Learn More about ComfyUI-AceStep_SFT

For further learning and support, explore the following resources:

ComfyUI GitHub Repository
AceStep 1.5 SFT Model on HuggingFace
Community forums and tutorials available through the ComfyUI community for peer support and shared experiences. This comprehensive guide aims to make ComfyUI-AceStep_SFT accessible and beneficial for AI artists, providing the tools and knowledge needed to create exceptional audio content.

ComfyUI-AceStep_SFT Related Nodes

AceStep 1.5 SFT Generate

AceStep 1.5 SFT Lora Loader

AceStep 1.5 SFT Get Music Infos

AceStep 1.5 SFT Turbo Tag Adapter

Table of Content

Description
ComfyUI-AceStep_SFT Introduction
How ComfyUI-AceStep_SFT Works
ComfyUI-AceStep_SFT Features
ComfyUI-AceStep_SFT Models
What's New with ComfyUI-AceStep_SFT
Troubleshooting ComfyUI-AceStep_SFT
Learn More about ComfyUI-AceStep_SFT
Related Nodes

Face Detailer | Fix Faces

Use Face Detailer first for facial restoration, followed by the 4x UltraSharp Model for superior upscaling.

SCAIL Model | Pose-Guided Animation Maker

Pose-driven animation with identity stability and motion precision.

Wan 2.1 Ditto | Cinematic Video Restyle Generator

Transform videos into stunning artistic styles with perfect motion flow.

MultiTalk | Photo to Talking Video

Millisecond lip sync + Wan2.1 = 15s ultra-detailed talking videos!

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Extension: ComfyUI-AceStep_SFT

ComfyUI-AceStep_SFT

How to Install ComfyUI-AceStep_SFT

ComfyUI-AceStep_SFT Description

ComfyUI-AceStep_SFT Introduction

How ComfyUI-AceStep_SFT Works

ComfyUI-AceStep_SFT Features

Advanced Guidance

Intelligent Metadata Processing

AI Music Analyzer

Audio Refinement

Extended Conditioning Control

AceStep LoRA Workflow

ComfyUI-AceStep_SFT Models

What's New with ComfyUI-AceStep_SFT

Troubleshooting ComfyUI-AceStep_SFT

Common Issues and Solutions

Learn More about ComfyUI-AceStep_SFT

ComfyUI-AceStep_SFT Related Nodes