ComfyUI > Nodes > ComfyUI-MelBandRoFormer > Mel-Band RoFormer Sampler

ComfyUI Node: Mel-Band RoFormer Sampler

Class Name

MelBandRoFormerSampler

Category
Mel-Band RoFormer
Author
kijai (Account age: 2722days)
Extension
ComfyUI-MelBandRoFormer
Latest Updated
2025-08-26
Github Stars
0.08K

How to Install ComfyUI-MelBandRoFormer

Install this extension via the ComfyUI Manager by searching for ComfyUI-MelBandRoFormer
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-MelBandRoFormer in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Mel-Band RoFormer Sampler Description

Sophisticated audio processing node for separating audio signals using machine learning and STFT.

Mel-Band RoFormer Sampler:

The MelBandRoFormerSampler is a sophisticated audio processing node designed to separate audio signals into distinct components, such as vocals and instruments, using advanced machine learning techniques. It leverages a combination of Short-Time Fourier Transform (STFT) and mel filter banks to analyze and process audio data, making it particularly effective for tasks like music source separation. The node utilizes a transformer-based architecture, which allows it to handle complex audio patterns and deliver high-quality separation results. By employing multi-resolution STFT loss and mask estimation, it ensures that the separated audio components maintain their fidelity and clarity. This node is ideal for AI artists and audio engineers looking to enhance their audio processing capabilities, offering a powerful tool for creative audio manipulation and analysis.

Mel-Band RoFormer Sampler Input Parameters:

model

The model parameter refers to the pre-trained model used for processing the audio input. This model is responsible for performing the audio separation task, utilizing its learned parameters to distinguish between different audio components. The choice of model can significantly impact the quality of the output, as different models may be trained for specific types of audio or separation tasks.

audio

The audio parameter is the input audio data that the node will process. It typically includes the waveform and sample rate of the audio. The waveform is a tensor representing the audio signal, while the sample rate indicates the number of samples per second. The audio input must be properly formatted and may need to be converted to stereo or resampled to match the expected sample rate for optimal processing.

Mel-Band RoFormer Sampler Output Parameters:

vocals_out

The vocals_out parameter provides the separated vocal component of the input audio. It includes the waveform of the vocals and the sample rate, allowing you to easily access and utilize the isolated vocal track for further processing or creative projects. This output is crucial for tasks like karaoke creation or vocal analysis.

instruments_out

The instruments_out parameter delivers the separated instrumental component of the input audio. Similar to vocals_out, it includes the waveform and sample rate, enabling you to work with the isolated instrumental track. This output is valuable for remixing, instrumental analysis, or creating backing tracks.

Mel-Band RoFormer Sampler Usage Tips:

  • Ensure that your input audio is in stereo format and matches the expected sample rate to achieve the best separation results.
  • Experiment with different pre-trained models to find the one that best suits your specific audio separation needs, as different models may excel in different scenarios.
  • Utilize the separated vocal and instrumental outputs for creative projects, such as remixing, karaoke, or audio analysis, to fully leverage the capabilities of the node.

Mel-Band RoFormer Sampler Common Errors and Solutions:

"stereo needs to be set to True if passing in audio signal that is stereo"

  • Explanation: This error occurs when the input audio is stereo, but the node is not configured to process stereo audio.
  • Solution: Ensure that the stereo parameter is set to True when processing stereo audio inputs.

"all frequencies need to be covered by all bands for now"

  • Explanation: This error indicates that the mel filter bank does not cover all necessary frequency bands.
  • Solution: Verify that the mel filter bank is correctly configured and covers the full range of frequencies required for processing.

"Resampling input {sample_rate} to {sr}"

  • Explanation: This message indicates that the input audio sample rate does not match the expected sample rate, prompting a resampling operation.
  • Solution: Ensure that your input audio is already at the expected sample rate to avoid unnecessary resampling, which can affect audio quality.

Mel-Band RoFormer Sampler Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-MelBandRoFormer
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.