ComfyUI > Nodes > ComfyUI-AceStep_SFT

ComfyUI Extension: ComfyUI-AceStep_SFT

Repo Name

ComfyUI-AceStep_SFT

Author
jeankassio (Account age: 3296 days)
Nodes
View all nodes(4)
Latest Updated
2026-04-01
Github Stars
0.03K

How to Install ComfyUI-AceStep_SFT

Install this extension via the ComfyUI Manager by searching for ComfyUI-AceStep_SFT
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-AceStep_SFT in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI-AceStep_SFT Description

ComfyUI-AceStep_SFT is an all-in-one node for ComfyUI, implementing AceStep 1.5 SFT for high-quality music generation. It replicates the Gradio pipeline, providing fine control over audio synthesis parameters.

ComfyUI-AceStep_SFT Introduction

ComfyUI-AceStep_SFT is an innovative extension designed for ComfyUI, a user-friendly interface for AI-based music generation. This extension leverages the AceStep 1.5 SFT (Supervised Fine-Tuning) model, which is a cutting-edge tool for creating high-quality audio. It enhances the official AceStep workflow by providing stronger conditioning control and practical quality options tailored for ComfyUI users. This extension is particularly beneficial for AI artists looking to generate superior audio content with ease and precision.

How ComfyUI-AceStep_SFT Works

At its core, ComfyUI-AceStep_SFT simplifies the complex process of music generation into a series of manageable steps. It starts by creating or loading initial audio latents, which are essentially the building blocks of your music. These latents are then processed through text encoding, where captions, lyrics, and metadata are analyzed using multiple CLIP encoders. The diffusion sampling step follows, where the model applies advanced guidance to refine the audio. Finally, the audio decoding step converts these refined latents into high-quality audio outputs. This process ensures that the generated music is both high in quality and aligned with the user's creative vision.

ComfyUI-AceStep_SFT Features

Advanced Guidance

  • APG (Adaptive Projected Guidance): Offers dynamic adaptation and noise reduction for the best quality and stability.
  • ADG (Angle-based Dynamic Guidance): Provides aggressive style distortion, ideal for unique audio effects.
  • Standard CFG: A traditional guidance method for predictable results.

Intelligent Metadata Processing

  • Automatically estimates music duration and processes metadata like BPM, time signature, and key/scale.
  • Supports over 23 languages, making it versatile for global users.

AI Music Analyzer

  • Extracts audio tags, BPM, and key/scale from input audio, providing structured JSON outputs for easy analysis.

Audio Refinement

  • Allows for img2img-style editing, enabling users to refine existing audio with precision.

Extended Conditioning Control

  • Offers split text/lyric guidance and other advanced controls for nuanced audio generation.

AceStep LoRA Workflow

  • Supports stacking multiple LoRAs for customized audio effects, with automatic conversion for compatibility.

ComfyUI-AceStep_SFT Models

The extension utilizes the ACE-Step-Transcriber model, which is specifically designed for audio-to-text transcription. This model is ideal for extracting lyrics, vocal tags, and song structure, providing a comprehensive analysis of the audio content.

What's New with ComfyUI-AceStep_SFT

The latest updates include enhanced guidance modes like APG and ADG, which improve the quality and stability of the generated audio. The extension also introduces intelligent metadata processing and a robust AI music analyzer, making it easier for users to create and analyze music. These updates are designed to enhance the user experience and provide more control over the music generation process.

Troubleshooting ComfyUI-AceStep_SFT

Common Issues and Solutions

  • Audio Distortion/Clipping: Adjust the latent_shift parameter to reduce amplitude before decoding.
  • High Variance Results: Increase the apg_norm_threshold for better gradient clipping.
  • Lower Than Expected Quality: Use the recommended settings for guidance mode and steps to improve output quality.
  • LoRA Issues: Adjust strength_model and strength_clip settings for better integration with LoRAs.

Learn More about ComfyUI-AceStep_SFT

For further learning and support, explore the following resources:

  • ComfyUI GitHub Repository
  • AceStep 1.5 SFT Model on HuggingFace
  • Community forums and tutorials available through the ComfyUI community for peer support and shared experiences. This comprehensive guide aims to make ComfyUI-AceStep_SFT accessible and beneficial for AI artists, providing the tools and knowledge needed to create exceptional audio content.

ComfyUI-AceStep_SFT Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ComfyUI-AceStep_SFT detailed guide | ComfyUI