AceStep 1.5 SFT Get Music Infos:
The AceStepSFTMusicAnalyzer is a sophisticated node designed to analyze audio files and extract valuable musical information. It leverages the native ACE-Step Transcriber to identify and extract descriptive tags related to lyrics, vocals, and song structure. Additionally, it utilizes the librosa library to detect the tempo (BPM) and the musical key and scale of the audio. This node is particularly beneficial for AI artists and music producers who wish to gain deeper insights into their audio content, enabling them to make informed decisions in their creative processes. By providing detailed analysis, the AceStepSFTMusicAnalyzer helps users understand the underlying musical elements, enhancing their ability to manipulate and generate music with precision.
AceStep 1.5 SFT Get Music Infos Input Parameters:
audio
This parameter represents the audio file that you wish to analyze. The audio input is crucial as it serves as the primary source from which the node extracts musical information such as style, BPM, and key/scale. The audio should be provided in a compatible format to ensure accurate analysis.
get_tags
This boolean parameter determines whether the node should extract descriptive tags from the audio using the native ACE-Step transcriber. By default, it is set to True, enabling the extraction of tags related to lyrics, vocals, and song structure. These tags provide valuable insights into the audio's content and can be used for further processing or display.
get_bpm
This boolean parameter specifies whether the node should detect the BPM (beats per minute) of the audio using librosa. It is set to True by default, allowing the node to analyze the tempo of the music. Understanding the BPM is essential for tasks such as remixing, synchronization, and tempo matching.
get_keyscale
This boolean parameter indicates whether the node should detect the key and scale of the audio using librosa. By default, it is set to True, enabling the identification of the musical key and scale, which are critical for tasks like harmonization and key matching.
instrumental
This boolean parameter forces the node to operate in instrumental mode, overriding any lyrics with the placeholder [Instrumental]. It is enabled by default, as the baseline quality profile starts from instrumental generation. This setting is useful when analyzing instrumental tracks or when lyrics are not available.
seed
This integer parameter sets the random seed for the analysis process. It has a default value of 0 and can range from 0 to 0xffffffffffffffff. The seed ensures reproducibility of the analysis results, allowing you to obtain consistent outputs across multiple runs.
steps
This integer parameter defines the number of diffusion inference steps, with a default value of 50. It can range from 1 to 200, with a step size of 1. The ACE-Step 1.5 SFT is tuned for 50 steps by default, but increasing the number of steps can lead to higher quality and slower generation.
cfg
This float parameter represents the classifier-free guidance scale, with a default value of 7.0. It can range from 1.0 to 20.0, with a step size of 0.1. The official ACE-Step 1.5 documentation recommends a range of 7.0 to 9.0 for non-turbo quality generation. Adjusting this parameter can influence the balance between creativity and adherence to the input audio.
AceStep 1.5 SFT Get Music Infos Output Parameters:
tags
This output parameter provides a list of descriptive tags extracted from the audio. These tags offer insights into the lyrical content, vocal characteristics, and song structure, aiding in the understanding and categorization of the audio.
bpm
This output parameter indicates the detected BPM (beats per minute) of the audio. The BPM is a crucial metric for understanding the tempo of the music, which is essential for tasks such as remixing, synchronization, and tempo matching.
keyscale
This output parameter reveals the detected key and scale of the audio. Knowing the key and scale is vital for tasks like harmonization, key matching, and musical composition, as it provides a framework for understanding the tonal structure of the music.
music_infos
This output parameter is a JSON-formatted string that consolidates the extracted tags, BPM, and key/scale information. It provides a comprehensive overview of the analyzed audio, making it easy to access and utilize the extracted data for further processing or display.
AceStep 1.5 SFT Get Music Infos Usage Tips:
- Ensure that the audio input is in a compatible format to achieve accurate analysis results.
- Utilize the
get_tags,get_bpm, andget_keyscaleparameters to customize the analysis based on your specific needs, enabling or disabling features as required. - Adjust the
stepsandcfgparameters to fine-tune the quality and speed of the analysis, balancing between precision and performance.
AceStep 1.5 SFT Get Music Infos Common Errors and Solutions:
Tag extraction failed
- Explanation: This error occurs when the node is unable to extract descriptive tags from the audio due to an issue with the ACE-Step transcriber.
- Solution: Ensure that the audio input is clear and in a compatible format. Check for any issues with the ACE-Step transcriber model and try re-running the analysis.
librosa detection failed
- Explanation: This error indicates a failure in detecting the BPM or key/scale using
librosa, possibly due to an incompatible audio format or a problem with thelibrosalibrary. - Solution: Verify that the audio input is in a supported format and that the
librosalibrary is correctly installed and configured. Consider updatinglibrosato the latest version if issues persist.
