RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

LatentSync| Lip Sync Model

Advanced audio-driven lip sync technology.

LTX-2 ComfyUI | Real-Time Video Generator

Create real-time videos instantly, faster than any other generator.

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

ComfyUI > Nodes > ComfyUI-MelBandRoFormer > Mel-Band RoFormer Sampler

ComfyUI Node: Mel-Band RoFormer Sampler

Class Name

MelBandRoFormerSampler

Category
Mel-Band RoFormer

Author
kijai (Account age: 2722days) Extension
ComfyUI-MelBandRoFormer Latest Updated
2025-08-26 Github Stars
0.08K

Github Ask kijai Current Questions Past Questions

Table of Content

Description
MelBandRoFormerSampler:
MelBandRoFormerSampler Input Parameters:
MelBandRoFormerSampler Output Parameters:
MelBandRoFormerSampler Usage Tips:
MelBandRoFormerSampler Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-MelBandRoFormer

Install this extension via the ComfyUI Manager by searching for ComfyUI-MelBandRoFormer

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-MelBandRoFormer in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Mel-Band RoFormer Sampler Description

Sophisticated audio processing node for separating audio signals using machine learning and STFT.

Mel-Band RoFormer Sampler:

The MelBandRoFormerSampler is a sophisticated audio processing node designed to separate audio signals into distinct components, such as vocals and instruments, using advanced machine learning techniques. It leverages a combination of Short-Time Fourier Transform (STFT) and mel filter banks to analyze and process audio data, making it particularly effective for tasks like music source separation. The node utilizes a transformer-based architecture, which allows it to handle complex audio patterns and deliver high-quality separation results. By employing multi-resolution STFT loss and mask estimation, it ensures that the separated audio components maintain their fidelity and clarity. This node is ideal for AI artists and audio engineers looking to enhance their audio processing capabilities, offering a powerful tool for creative audio manipulation and analysis.

Mel-Band RoFormer Sampler Input Parameters:

model

The model parameter refers to the pre-trained model used for processing the audio input. This model is responsible for performing the audio separation task, utilizing its learned parameters to distinguish between different audio components. The choice of model can significantly impact the quality of the output, as different models may be trained for specific types of audio or separation tasks.

audio

The audio parameter is the input audio data that the node will process. It typically includes the waveform and sample rate of the audio. The waveform is a tensor representing the audio signal, while the sample rate indicates the number of samples per second. The audio input must be properly formatted and may need to be converted to stereo or resampled to match the expected sample rate for optimal processing.

Mel-Band RoFormer Sampler Output Parameters:

vocals_out

The vocals_out parameter provides the separated vocal component of the input audio. It includes the waveform of the vocals and the sample rate, allowing you to easily access and utilize the isolated vocal track for further processing or creative projects. This output is crucial for tasks like karaoke creation or vocal analysis.

instruments_out

The instruments_out parameter delivers the separated instrumental component of the input audio. Similar to vocals_out, it includes the waveform and sample rate, enabling you to work with the isolated instrumental track. This output is valuable for remixing, instrumental analysis, or creating backing tracks.

Mel-Band RoFormer Sampler Usage Tips:

Ensure that your input audio is in stereo format and matches the expected sample rate to achieve the best separation results.
Experiment with different pre-trained models to find the one that best suits your specific audio separation needs, as different models may excel in different scenarios.
Utilize the separated vocal and instrumental outputs for creative projects, such as remixing, karaoke, or audio analysis, to fully leverage the capabilities of the node.

Mel-Band RoFormer Sampler Common Errors and Solutions:

"stereo needs to be set to True if passing in audio signal that is stereo"

Explanation: This error occurs when the input audio is stereo, but the node is not configured to process stereo audio.
Solution: Ensure that the stereo parameter is set to True when processing stereo audio inputs.

"all frequencies need to be covered by all bands for now"

Explanation: This error indicates that the mel filter bank does not cover all necessary frequency bands.
Solution: Verify that the mel filter bank is correctly configured and covers the full range of frequencies required for processing.

"Resampling input `{sample_rate}` to `{sr}`"

Explanation: This message indicates that the input audio sample rate does not match the expected sample rate, prompting a resampling operation.
Solution: Ensure that your input audio is already at the expected sample rate to avoid unnecessary resampling, which can affect audio quality.

Mel-Band RoFormer Sampler Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-MelBandRoFormer

Table of Content

Description
MelBandRoFormerSampler:
MelBandRoFormerSampler Input Parameters:
MelBandRoFormerSampler Output Parameters:
MelBandRoFormerSampler Usage Tips:
MelBandRoFormerSampler Common Errors and Solutions:
Related Nodes

Z Image Turbo | Ultra-Fast Photorealistic Generator

Generate ultra-clear visuals fast with unmatched real-time detail.

Fantasy Portrait | Expressive Photo Animation

Photo → expressive cinematic face animation, fast and identity-accurate.

InstantCharacter

One photo, endless characters. Perfect identity preservation.

Flux UltraRealistic LoRA V2

Create stunningly lifelike image with Flux UltraRealistic LoRA V2

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Mel-Band RoFormer Sampler

MelBandRoFormerSampler

How to Install ComfyUI-MelBandRoFormer

Mel-Band RoFormer Sampler Description

Mel-Band RoFormer Sampler:

Mel-Band RoFormer Sampler Input Parameters:

model

audio

Mel-Band RoFormer Sampler Output Parameters:

vocals_out

instruments_out

Mel-Band RoFormer Sampler Usage Tips:

Mel-Band RoFormer Sampler Common Errors and Solutions:

"stereo needs to be set to True if passing in audio signal that is stereo"

"all frequencies need to be covered by all bands for now"

"Resampling input {sample_rate} to {sr}"

Mel-Band RoFormer Sampler Related Nodes

"Resampling input `{sample_rate}` to `{sr}`"