RunComfy

Flux Klein Face Swap | Realistic AI Face Editor

Swap faces perfectly. Natural, lifelike, and fast AI-powered editing.

IPAdapter Plus (V2) | Change Clothes

Use IPAdapter Plus for your fashion model creation, easily changing outfits and styles

Flux Kontext Character Turnaround Sheet LoRA

Generate 5-pose character turnaround sheets from single image

Wan 2.2 | Open-Source Video Gen Leader

Available now! Better precision + smoother motion.

ComfyUI > Nodes > ComfyUI-AudioSR

ComfyUI Extension: ComfyUI-AudioSR

Repo Name

ComfyUI-AudioSR

Author
Saganaki22 (Account age: 0 days) Nodes
View all nodes(1) Latest Updated
2026-03-21 Github Stars
0.07K

Github Ask Saganaki22 Current Questions Past Questions

Table of Content

Description
ComfyUI-AudioSR Introduction
How ComfyUI-AudioSR Works
ComfyUI-AudioSR Features
ComfyUI-AudioSR Models
What's New with ComfyUI-AudioSR
Troubleshooting ComfyUI-AudioSR
Learn More about ComfyUI-AudioSR
Related Nodes

How to Install ComfyUI-AudioSR

Install this extension via the ComfyUI Manager by searching for ComfyUI-AudioSR

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioSR in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-AudioSR Description

ComfyUI-AudioSR enhances audio quality by integrating advanced super-resolution techniques into the ComfyUI framework. It efficiently upscales low-resolution audio, improving clarity and detail.

ComfyUI-AudioSR Introduction

ComfyUI-AudioSR is an innovative extension designed to enhance the quality of audio files by upscaling them to a high-fidelity 48kHz output. This tool is particularly useful for AI artists and audio enthusiasts who wish to improve the clarity and richness of their audio content, whether it be music, speech, or sound effects. By leveraging state-of-the-art latent diffusion techniques, ComfyUI-AudioSR can transform low-quality audio into a more vibrant and detailed sound experience. This extension is seamlessly integrated into the ComfyUI environment, making it accessible and easy to use for those familiar with this platform.

How ComfyUI-AudioSR Works

At its core, ComfyUI-AudioSR uses a process called latent diffusion to enhance audio quality. Imagine your audio file as a painting with faded colors. Latent diffusion acts like a digital artist, carefully restoring and enhancing the colors to make the painting vibrant again. Similarly, this extension analyzes the audio file, identifies areas that lack detail, and fills in the gaps to produce a clearer and more detailed sound. It does this by resampling the audio to a higher frequency, enhancing high frequencies, and reducing artifacts that often plague low-quality audio files.

ComfyUI-AudioSR Features

Audio Super Resolution: Upscales audio to 48kHz, enhancing high frequencies for a richer sound.
Native ComfyUI Integration: Works smoothly with ComfyUI's Load, Preview, and Save Audio nodes.
Spectrogram Visualization: Provides a visual comparison of audio before and after processing.
Automatic Sample Rate Handling: Accepts various input sample rates and adjusts them to 48kHz.
Stereo Support: Processes both mono and stereo audio, handling each channel independently.
Long Audio Support: Uses smart chunking to process long audio files without length limitations.
Model Caching: Keeps the model in memory for faster processing of subsequent audio files.
torch.compile Optimization: Offers a speed boost for FP32 models through PyTorch compilation.
VRAM Management: Option to unload the model to free up GPU memory between runs.
Interruptible Processing: Allows you to cancel processing mid-run using ComfyUI's interrupt button.
Progress Reporting: Displays a real-time progress bar to track chunk processing status.

ComfyUI-AudioSR Models

ComfyUI-AudioSR offers different models tailored for specific audio types:

audiosr_basic_fp32.safetensors: A general-purpose model suitable for music, sound effects, and various audio types.
audiosr_speech_fp32.safetensors: Optimized for voice and speech content, providing enhanced clarity for spoken words. Choosing the right model depends on the type of audio you are working with. For instance, if you are enhancing a podcast or a speech recording, the speech model would be more appropriate.

What's New with ComfyUI-AudioSR

Version 1.1.1

Fixed tensor dimension mismatch for small chunks, ensuring smoother processing.

Version 1.1.0

Introduced SageAttention support for faster processing on compatible GPUs.
Added a dtype selector for better control over compute precision.

Version 1.0.6

Resolved chunk positioning issues to eliminate volume drops in long audio files.
Improved overlap-add normalization for consistent amplitude across chunks.

Troubleshooting ComfyUI-AudioSR

Common Issues and Solutions

Model Not Found Error: Ensure models are downloaded from HuggingFace and placed in the correct directory.
CUDA Out of Memory: Enable unload_model to free VRAM or reduce chunk_size.
Poor Audio Quality: Adjust guidance_scale and ddim_steps for better results.
No Output Audio: Verify connections in ComfyUI and check for error messages in the console.
Slow Processing: Use torch.compile for a speed boost and ensure GPU usage.

Learn More about ComfyUI-AudioSR

For further learning and support, consider exploring the following resources:

AudioSR Paper (arXiv) for an in-depth understanding of the underlying technology.
Project Page for additional insights and updates.
ComfyUI GitHub Repository for community support and discussions. These resources provide valuable information and community support to help you make the most of ComfyUI-AudioSR in your audio enhancement projects.

ComfyUI-AudioSR Related Nodes

AudioSR

Table of Content

Description
ComfyUI-AudioSR Introduction
How ComfyUI-AudioSR Works
ComfyUI-AudioSR Features
ComfyUI-AudioSR Models
What's New with ComfyUI-AudioSR
Troubleshooting ComfyUI-AudioSR
Learn More about ComfyUI-AudioSR
Related Nodes

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

VACE 14B: All-in-One Video Creation & Editing

Create, edit and transform videos with the powerful VACE Wan2.1 14B.

ByteDance USO | Unified Style & Subject Generator

ByteDance USO makes subject and style fusion simple and powerful.

Wan Alpha | Transparent Video Generator

Alpha magic: instant transparent background videos for VFX and design.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy