ComfyUI > Nodes > ComfyUI-FL-AceStep-Training > FL AceStep Auto-Label Samples

ComfyUI Node: FL AceStep Auto-Label Samples

Class Name

FL_AceStep_LabelSamples

Category
FL AceStep/Dataset
Author
filliptm (Account age: 0days)
Extension
ComfyUI-FL-AceStep-Training
Latest Updated
2026-03-19
Github Stars
0.1K

How to Install ComfyUI-FL-AceStep-Training

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-AceStep-Training
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-FL-AceStep-Training in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

FL AceStep Auto-Label Samples Description

Automatically labels audio samples with metadata using a language model in the ACE-Step framework.

FL AceStep Auto-Label Samples:

The FL_AceStep_LabelSamples node is designed to automatically label audio samples by generating metadata using a language model. This node is part of the ACE-Step framework and leverages the capabilities of a large language model (LLM) to produce detailed metadata for audio files. The metadata includes captions or descriptions, genre tags, BPM (tempo), key or scale, time signature, language, and lyrics. By automating the labeling process, this node significantly reduces the manual effort required to annotate audio datasets, making it an invaluable tool for AI artists and developers working with large audio collections. The node uses the ACE-Step model for audio tokenization and encoding, ensuring that the metadata generated is accurate and relevant to the audio content.

FL AceStep Auto-Label Samples Input Parameters:

dataset

The dataset parameter refers to the collection of audio samples that need to be labeled. It is typically obtained from the Scan Directory node, which scans a directory for audio files and prepares them for processing. This parameter is crucial as it provides the raw audio data that the node will process to generate metadata.

model

The model parameter is the ACE-Step MODEL, which is used for audio tokenization. This model is essential for converting audio signals into a format that can be processed by the node. It ensures that the audio data is accurately represented, which is critical for generating precise metadata.

vae

The vae parameter stands for Variational Autoencoder, which is used for audio encoding. This component is responsible for converting audio into discrete codes that can be understood by the language model. The VAE plays a vital role in ensuring that the audio is encoded efficiently, allowing for accurate metadata generation.

llm

The llm parameter is the Large Language Model used for metadata generation. This model analyzes the encoded audio data and generates descriptive metadata, including captions, genre tags, and other relevant information. The LLM is a key component in the node's ability to produce detailed and accurate metadata for each audio sample.

FL AceStep Auto-Label Samples Output Parameters:

labeled_samples

The labeled_samples output parameter contains the audio samples along with their generated metadata. This output is the result of the node's processing and includes all the metadata fields such as captions, genre tags, BPM, key, time signature, language, and lyrics. This comprehensive metadata can be used for further analysis, training, or categorization of audio datasets.

FL AceStep Auto-Label Samples Usage Tips:

  • Ensure that the dataset parameter is correctly set up with audio files that are ready for labeling. This will help the node process the samples efficiently and generate accurate metadata.
  • Use a well-trained ACE-Step MODEL and VAE to ensure that the audio tokenization and encoding are performed accurately, which will improve the quality of the metadata generated by the LLM.

FL AceStep Auto-Label Samples Common Errors and Solutions:

Audio encoding failed for sample <idx>: <error_message>

  • Explanation: This error occurs when the node fails to encode an audio sample into discrete codes using the VAE and tokenizer.
  • Solution: Check the audio file format and ensure it is supported. Verify that the VAE and tokenizer are correctly configured and compatible with the audio data.

No audio codes for sample <idx>, skipping LLM labeling

  • Explanation: This warning indicates that the node could not generate audio codes for a sample, preventing the LLM from generating metadata.
  • Solution: Ensure that the audio file is not corrupted and that the VAE and tokenizer are functioning correctly. Consider increasing the maximum duration parameter if the audio file is longer than expected.

FL AceStep Auto-Label Samples Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-FL-AceStep-Training
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.