RunComfy

Wan 2.2 FLF2V | First-Last Frame Video Generation

Generate smooth videos from a start and end frame using Wan 2.2 FLF2V.

Wan 2.2 | Open-Source Video Gen Leader

Available now! Better precision + smoother motion.

Outpainting | Expand Image

Easily extend images using outpainting node and ControlNet inpainting model.

Wan 2.2 Lightning T2V I2V | 4-Step Ultra Fast

Wan 2.2 now 20x faster! T2V + I2V in 4 steps.

ComfyUI > Nodes > ComfyUI-AudioX

ComfyUI Extension: ComfyUI-AudioX

Repo Name

ComfyUI-StableAudioX

Author
lum3on (Account age: 314 days) Nodes
View all nodes(15) Latest Updated
2025-06-24 Github Stars
0.04K

Github Ask lum3on Current Questions Past Questions

Table of Content

Description
ComfyUI-StableAudioX Introduction
How ComfyUI-StableAudioX Works
ComfyUI-StableAudioX Features
ComfyUI-StableAudioX Models
Troubleshooting ComfyUI-StableAudioX
Learn More about ComfyUI-StableAudioX
Related Nodes

How to Install ComfyUI-AudioX

Install this extension via the ComfyUI Manager by searching for ComfyUI-AudioX

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioX in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-AudioX Description

ComfyUI-AudioX is an advanced extension for ComfyUI, enabling high-quality audio synthesis by integrating AudioX models. It transforms text and video inputs into rich audio outputs.

ComfyUI-StableAudioX Introduction

ComfyUI-StableAudioX is an innovative extension designed to enhance your audio creation experience using the ComfyUI platform. This extension integrates the AudioX models, which are fine-tuned versions of stable audio tools, to deliver high-quality audio synthesis from both text and video inputs. Whether you're looking to generate audio from a simple text description or create a musical composition with specific styles and moods, ComfyUI-StableAudioX provides the tools you need. It is particularly beneficial for AI artists who want to explore audio generation without delving into complex technical setups. The extension is optimized for systems with a minimum of 16GB VRAM, ensuring smooth and efficient performance.

How ComfyUI-StableAudioX Works

At its core, ComfyUI-StableAudioX leverages advanced machine learning models to transform text and video inputs into audio outputs. Imagine it as a translator that converts your written or visual ideas into sound. The extension uses a process called "conditioning," which involves adjusting various parameters to ensure the generated audio closely matches your input descriptions. For instance, if you input a text description of a serene forest, the extension will generate audio that captures the essence of that environment, complete with ambient sounds like rustling leaves and chirping birds. By breaking down complex audio generation tasks into manageable steps, ComfyUI-StableAudioX makes it accessible for users to create professional-quality audio without needing extensive technical knowledge.

ComfyUI-StableAudioX Features

ComfyUI-StableAudioX offers a range of features designed to cater to different audio generation needs:

Text to Audio: Convert text descriptions into high-quality audio, with options to enhance the conditioning for more accurate results.
Text to Music: Create music by specifying style, tempo, and mood, allowing for a personalized musical experience.
Video to Audio: Extract and generate audio from video content, providing a seamless way to add soundtracks to visual media.
Enhanced Conditioning: Customize the audio output with separate CFG scales, conditioning weights, and negative prompting to avoid unwanted audio characteristics.
Professional Audio Processing: Utilize volume control with LUFS normalization, limiting, and precise gain staging to ensure your audio meets professional standards.
Video Processing: Mute videos and combine them with generated audio to create cohesive multimedia projects.

ComfyUI-StableAudioX Models

The extension uses the AudioX models, which are specifically designed for high-quality audio synthesis. These models are fine-tuned to handle various audio generation tasks, from simple text-to-audio conversions to complex video-to-audio transformations. By using these models, you can expect consistent and reliable audio outputs that align with your creative vision.

Troubleshooting ComfyUI-StableAudioX

Here are some common issues you might encounter while using ComfyUI-StableAudioX and how to resolve them:

Installation Problems: Ensure all system dependencies, like ffmpeg and Microsoft Visual C++ Build Tools, are installed. If you encounter package conflicts, consider using a fresh virtual environment.
Model Not Found: Verify that the model files are correctly placed in the ComfyUI/models/diffusion_models/ directory and that both the model file and config.json are present.
Frontend Errors: If you experience errors like "beforeQueued," try refreshing your browser, clearing the cache, or restarting ComfyUI.
Memory Issues: For VRAM or RAM-related problems, reduce batch sizes, use CPU mode for large models, or lower CFG scales.

Learn More about ComfyUI-StableAudioX

To further explore the capabilities of ComfyUI-StableAudioX, you can visit the GitHub repository for additional resources, including example workflows and community support. Engaging with the community can provide valuable insights and tips to enhance your audio generation projects.

ComfyUI-AudioX Related Nodes

AudioX Advanced Volume Control

AudioX Audio Processor

AudioX Enhanced Text to Audio

AudioX Enhanced Text to Music

AudioX Enhanced Video to Audio

AudioX Model Loader

AudioX Multi-Modal Generation

AudioX Prompt Helper

AudioX Text to Audio

AudioX Text to Music

AudioX Video Audio Combiner

AudioX Video Muter

AudioX Video to Audio

AudioX Video to Music

AudioX Volume Control

Table of Content

Description
ComfyUI-StableAudioX Introduction
How ComfyUI-StableAudioX Works
ComfyUI-StableAudioX Features
ComfyUI-StableAudioX Models
Troubleshooting ComfyUI-StableAudioX
Learn More about ComfyUI-StableAudioX
Related Nodes

FLUX Kontext Dev | Intelligent Image Editing

Kontext Dev = Controllable + All Graphic Design Needs in One Tool

LTX-2 ControlNet | Precision Video Generator

Sharp control, perfect sync, super clear AI video creation.

Virtual Try-On | Realistic Fashion Fitting

Instant outfit previews with natural, well-fitted clothing visuals

MatAnyone Video Matting | Single Mask Removal

Remove video backgrounds with one mask frame for perfect subject isolation.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Extension: ComfyUI-AudioX

ComfyUI-StableAudioX

How to Install ComfyUI-AudioX

ComfyUI-AudioX Description

ComfyUI-StableAudioX Introduction

How ComfyUI-StableAudioX Works

ComfyUI-StableAudioX Features

ComfyUI-StableAudioX Models

Troubleshooting ComfyUI-StableAudioX

Learn More about ComfyUI-StableAudioX

ComfyUI-AudioX Related Nodes