RunComfy

SCAIL Model | Pose-Guided Animation Maker

Pose-driven animation with identity stability and motion precision.

ReActor | Fast Face Swap

With ComfyUI ReActor, you can easily swap the faces of one or more characters in images or videos.

PuLID Flux II | Consistent Character Generation

Generate images with precise character control while preserving artistic style.

FLUX.2 Klein Unified Image Editing | Smart Inpaint, Outpaint & Remove

Flawless editing. Remove, fill, and extend any image fast.

ComfyUI > Nodes > TTS Audio Suite > 🌊 Audio Wave Analyzer

ComfyUI Node: 🌊 Audio Wave Analyzer

Class Name

ChatterBoxAudioAnalyzer

Category
TTS Audio Suite/🎵 Audio Processing

Author
diogod (Account age: 667days) Extension
TTS Audio Suite Latest Updated
2025-12-13 Github Stars
0.46K

Github Ask diogod Current Questions Past Questions

Table of Content

Description
ChatterBoxAudioAnalyzer:
ChatterBoxAudioAnalyzer Input Parameters:
ChatterBoxAudioAnalyzer Output Parameters:
ChatterBoxAudioAnalyzer Usage Tips:
ChatterBoxAudioAnalyzer Common Errors and Solutions:
Related Nodes

How to Install TTS Audio Suite

Install this extension via the ComfyUI Manager by searching for TTS Audio Suite

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter TTS Audio Suite in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🌊 Audio Wave Analyzer Description

Sophisticated audio processing node for ChatterBox framework, enabling seamless TTS integration and multilingual audio analysis.

🌊 Audio Wave Analyzer:

The ChatterBoxAudioAnalyzer is a sophisticated node designed to process and analyze audio data within the ChatterBox framework. Its primary purpose is to facilitate the conversion and analysis of audio segments, enabling seamless integration with text-to-speech (TTS) systems. This node is particularly beneficial for users looking to enhance their audio processing capabilities, as it supports the analysis of audio streams and the generation of audio segments based on text input. By leveraging advanced audio analysis techniques, the ChatterBoxAudioAnalyzer ensures high-quality audio output, making it an essential tool for AI artists and developers working with multilingual TTS applications. Its ability to handle multiple languages and process audio in chunks allows for efficient and scalable audio generation, catering to diverse linguistic needs.

🌊 Audio Wave Analyzer Input Parameters:

enable_chunking

The enable_chunking parameter determines whether the input text should be divided into smaller chunks for processing. This is particularly useful for handling long text inputs that exceed the maximum character limit per chunk. When enabled, the node will split the text into manageable segments, ensuring smooth and efficient audio generation. The default value is typically set to true, allowing for automatic chunking, but it can be disabled if the input text is already within acceptable length limits.

max_chars_per_chunk

The max_chars_per_chunk parameter specifies the maximum number of characters allowed in each text chunk. This parameter is crucial for controlling the size of the text segments processed by the node, directly impacting the performance and quality of the audio output. A typical default value might be around 200 characters, but this can be adjusted based on the specific requirements of the audio processing task. Setting an appropriate value ensures that the node can handle text efficiently without overloading the system.

voice_refs

The voice_refs parameter provides a reference to the audio prompts associated with specific characters or voices. This parameter is essential for generating audio that matches the desired voice characteristics, allowing for personalized and contextually appropriate audio output. The voice references are typically pre-defined and stored in a database or file, ensuring consistency and accuracy in voice generation across different segments.

🌊 Audio Wave Analyzer Output Parameters:

audio_segments

The audio_segments output parameter represents the processed audio data generated by the node. Each segment corresponds to a chunk of text that has been converted into audio, providing a clear and coherent representation of the input text. These audio segments are crucial for applications that require high-quality audio output, as they ensure that the generated speech is both intelligible and natural-sounding. The segments can be further processed or combined to create complete audio narratives or dialogues.

segment_audio_chunks

The segment_audio_chunks output parameter contains the individual audio chunks generated from each text segment. These chunks are the building blocks of the final audio output, allowing for detailed analysis and manipulation of the audio data. By providing access to these smaller audio units, users can fine-tune the audio output, apply effects, or make adjustments to specific parts of the audio, enhancing the overall quality and customization of the generated speech.

🌊 Audio Wave Analyzer Usage Tips:

To optimize performance, ensure that enable_chunking is set to true for long text inputs, allowing the node to process text efficiently without exceeding system limits.
Adjust the max_chars_per_chunk parameter based on the complexity and length of the input text to maintain a balance between processing speed and audio quality.
Utilize the voice_refs parameter to match the audio output with the desired voice characteristics, ensuring consistency and personalization in multilingual TTS applications.

🌊 Audio Wave Analyzer Common Errors and Solutions:

"Text input exceeds maximum length"

Explanation: This error occurs when the input text is too long and exceeds the maximum character limit set by the max_chars_per_chunk parameter.
Solution: Enable the enable_chunking parameter to automatically split the text into smaller chunks, or manually reduce the length of the input text.

"Voice reference not found"

Explanation: This error indicates that the specified voice reference in the voice_refs parameter is missing or incorrect.
Solution: Verify that the correct voice reference is provided and that it exists in the database or file. Ensure that the reference matches the desired voice characteristics for accurate audio generation.

🌊 Audio Wave Analyzer Related Nodes

Go back to the extension to check out more related nodes.

TTS Audio Suite

Table of Content

Description
ChatterBoxAudioAnalyzer:
ChatterBoxAudioAnalyzer Input Parameters:
ChatterBoxAudioAnalyzer Output Parameters:
ChatterBoxAudioAnalyzer Usage Tips:
ChatterBoxAudioAnalyzer Common Errors and Solutions:
Related Nodes

SeedVR2 | Image & Video Upscaler

Fixes blur instantly. Better than Keep/PMRF.

Wan 2.1 Control LoRA | Depth and Tile

Advance Wan 2.1 video generation with lightweight depth and tile LoRAs for improved structure and detail.

Stable Audio Open 1.0 | Text-to-Music Tool

Turns text prompts into cinematic music seamlessly and fast.

ReActor | Fast Face Swap

Professional face swapping toolkit for ComfyUI that enables natural face replacement and enhancement.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy