ComfyUI > Nodes > ComfyUI-AceStep_SFT > AceStep 1.5 SFT Get Music Infos

ComfyUI Node: AceStep 1.5 SFT Get Music Infos

Class Name

AceStepSFTMusicAnalyzer

Category
audio/AceStep SFT
Author
jeankassio (Account age: 3296days)
Extension
ComfyUI-AceStep_SFT
Latest Updated
2026-04-01
Github Stars
0.03K

How to Install ComfyUI-AceStep_SFT

Install this extension via the ComfyUI Manager by searching for ComfyUI-AceStep_SFT
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-AceStep_SFT in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

AceStep 1.5 SFT Get Music Infos Description

Analyzes audio to extract lyrics, vocals, song structure, tempo, key, and scale for music insights.

AceStep 1.5 SFT Get Music Infos:

The AceStepSFTMusicAnalyzer is a sophisticated node designed to analyze audio files and extract valuable musical information. It leverages the native ACE-Step Transcriber to identify and extract descriptive tags related to lyrics, vocals, and song structure. Additionally, it utilizes the librosa library to detect the tempo (BPM) and the musical key and scale of the audio. This node is particularly beneficial for AI artists and music producers who wish to gain deeper insights into their audio content, enabling them to make informed decisions in their creative processes. By providing detailed analysis, the AceStepSFTMusicAnalyzer helps users understand the underlying musical elements, enhancing their ability to manipulate and generate music with precision.

AceStep 1.5 SFT Get Music Infos Input Parameters:

audio

This parameter represents the audio file that you wish to analyze. The audio input is crucial as it serves as the primary source from which the node extracts musical information such as style, BPM, and key/scale. The audio should be provided in a compatible format to ensure accurate analysis.

get_tags

This boolean parameter determines whether the node should extract descriptive tags from the audio using the native ACE-Step transcriber. By default, it is set to True, enabling the extraction of tags related to lyrics, vocals, and song structure. These tags provide valuable insights into the audio's content and can be used for further processing or display.

get_bpm

This boolean parameter specifies whether the node should detect the BPM (beats per minute) of the audio using librosa. It is set to True by default, allowing the node to analyze the tempo of the music. Understanding the BPM is essential for tasks such as remixing, synchronization, and tempo matching.

get_keyscale

This boolean parameter indicates whether the node should detect the key and scale of the audio using librosa. By default, it is set to True, enabling the identification of the musical key and scale, which are critical for tasks like harmonization and key matching.

instrumental

This boolean parameter forces the node to operate in instrumental mode, overriding any lyrics with the placeholder [Instrumental]. It is enabled by default, as the baseline quality profile starts from instrumental generation. This setting is useful when analyzing instrumental tracks or when lyrics are not available.

seed

This integer parameter sets the random seed for the analysis process. It has a default value of 0 and can range from 0 to 0xffffffffffffffff. The seed ensures reproducibility of the analysis results, allowing you to obtain consistent outputs across multiple runs.

steps

This integer parameter defines the number of diffusion inference steps, with a default value of 50. It can range from 1 to 200, with a step size of 1. The ACE-Step 1.5 SFT is tuned for 50 steps by default, but increasing the number of steps can lead to higher quality and slower generation.

cfg

This float parameter represents the classifier-free guidance scale, with a default value of 7.0. It can range from 1.0 to 20.0, with a step size of 0.1. The official ACE-Step 1.5 documentation recommends a range of 7.0 to 9.0 for non-turbo quality generation. Adjusting this parameter can influence the balance between creativity and adherence to the input audio.

AceStep 1.5 SFT Get Music Infos Output Parameters:

tags

This output parameter provides a list of descriptive tags extracted from the audio. These tags offer insights into the lyrical content, vocal characteristics, and song structure, aiding in the understanding and categorization of the audio.

bpm

This output parameter indicates the detected BPM (beats per minute) of the audio. The BPM is a crucial metric for understanding the tempo of the music, which is essential for tasks such as remixing, synchronization, and tempo matching.

keyscale

This output parameter reveals the detected key and scale of the audio. Knowing the key and scale is vital for tasks like harmonization, key matching, and musical composition, as it provides a framework for understanding the tonal structure of the music.

music_infos

This output parameter is a JSON-formatted string that consolidates the extracted tags, BPM, and key/scale information. It provides a comprehensive overview of the analyzed audio, making it easy to access and utilize the extracted data for further processing or display.

AceStep 1.5 SFT Get Music Infos Usage Tips:

  • Ensure that the audio input is in a compatible format to achieve accurate analysis results.
  • Utilize the get_tags, get_bpm, and get_keyscale parameters to customize the analysis based on your specific needs, enabling or disabling features as required.
  • Adjust the steps and cfg parameters to fine-tune the quality and speed of the analysis, balancing between precision and performance.

AceStep 1.5 SFT Get Music Infos Common Errors and Solutions:

Tag extraction failed

  • Explanation: This error occurs when the node is unable to extract descriptive tags from the audio due to an issue with the ACE-Step transcriber.
  • Solution: Ensure that the audio input is clear and in a compatible format. Check for any issues with the ACE-Step transcriber model and try re-running the analysis.

librosa detection failed

  • Explanation: This error indicates a failure in detecting the BPM or key/scale using librosa, possibly due to an incompatible audio format or a problem with the librosa library.
  • Solution: Verify that the audio input is in a supported format and that the librosa library is correctly installed and configured. Consider updating librosa to the latest version if issues persist.

AceStep 1.5 SFT Get Music Infos Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-AceStep_SFT
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

AceStep 1.5 SFT Get Music Infos