ComfyUI > Nodes > TTS Audio Suite > 🔧 Audio Analyzer Options

ComfyUI Node: 🔧 Audio Analyzer Options

Class Name

ChatterBoxAudioAnalyzerOptions

Category
TTS Audio Suite/🎵 Audio Processing
Author
diogod (Account age: 667days)
Extension
TTS Audio Suite
Latest Updated
2025-12-13
Github Stars
0.46K

How to Install TTS Audio Suite

Install this extension via the ComfyUI Manager by searching for TTS Audio Suite
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter TTS Audio Suite in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

🔧 Audio Analyzer Options Description

Enhance audio analysis precision with customizable segmentation and detection parameters.

🔧 Audio Analyzer Options:

ChatterBoxAudioAnalyzerOptions is a node designed to enhance the audio analysis capabilities within the ChatterBox suite, providing users with advanced options to fine-tune the detection and segmentation of audio regions. This node is particularly beneficial for those looking to process audio with precision, allowing for the customization of how audio segments are grouped and analyzed. By offering a range of parameters, it enables users to control the sensitivity and thresholds of audio detection, ensuring that the analysis is tailored to the specific needs of the project. This flexibility is crucial for applications where the clarity and accuracy of audio segmentation can significantly impact the quality of the output, such as in text-to-speech systems or audio editing tasks.

🔧 Audio Analyzer Options Input Parameters:

silence_threshold

The silence_threshold parameter determines the level of audio energy below which a region is considered silent. This threshold is crucial for identifying pauses or breaks in the audio, which can be used to segment the audio into meaningful parts. Adjusting this parameter allows you to control how sensitive the node is to quiet sounds, with lower values detecting more subtle silences and higher values requiring more pronounced quietness.

silence_min_duration

The silence_min_duration parameter specifies the minimum duration that a region must be silent to be considered a separate segment. This helps in filtering out short, insignificant pauses that might not be relevant for the analysis. By setting this duration, you can ensure that only meaningful silences are used to segment the audio, which is particularly useful in speech processing where natural pauses occur.

invert_silence_regions

The invert_silence_regions parameter allows you to invert the detected silence regions, effectively treating them as sound regions. This can be useful in scenarios where you are more interested in the silent parts of the audio rather than the sound, such as in noise reduction or silence detection tasks.

energy_sensitivity

The energy_sensitivity parameter controls how sensitive the node is to changes in audio energy. This affects the detection of peaks and troughs in the audio waveform, which are used to identify significant audio events. Higher sensitivity can detect more subtle changes, while lower sensitivity focuses on more pronounced variations.

peak_threshold

The peak_threshold parameter sets the minimum level of audio energy required for a peak to be considered significant. This is important for identifying key audio events, such as beats or syllables, and can be adjusted to focus on more prominent peaks or to include smaller ones.

peak_min_distance

The peak_min_distance parameter defines the minimum time interval between detected peaks. This helps in avoiding the detection of multiple peaks that are too close together, which might not be meaningful. By setting this distance, you can ensure that only distinct peaks are considered, which is useful in rhythm analysis or beat detection.

peak_region_size

The peak_region_size parameter determines the size of the region around each detected peak that is considered part of the peak. This affects how peaks are grouped and can be adjusted to include more or less of the surrounding audio, depending on the desired level of detail in the analysis.

group_regions_threshold

The group_regions_threshold parameter specifies the maximum gap between detected regions that can be grouped into a larger segment. This is useful for combining closely spaced audio events into a single segment, which can simplify the analysis and make it more coherent. The threshold can be adjusted to control the level of grouping, from keeping all regions separate to combining them into larger phrases or sentences.

🔧 Audio Analyzer Options Output Parameters:

ADV_AUDIO_OPTIONS

The ADV_AUDIO_OPTIONS output parameter provides a set of advanced audio options that have been configured based on the input parameters. This output is essential for further processing within the ChatterBox suite, as it encapsulates the customized settings that dictate how audio analysis should be performed. By using this output, you can ensure that subsequent nodes in the workflow are aligned with the specific analysis requirements set by the input parameters.

🔧 Audio Analyzer Options Usage Tips:

  • Adjust the silence_threshold and silence_min_duration parameters to fine-tune the detection of pauses in speech, which can improve the segmentation of spoken content.
  • Use the group_regions_threshold to control the granularity of audio segmentation, allowing you to focus on individual words or larger phrases depending on your analysis needs.

🔧 Audio Analyzer Options Common Errors and Solutions:

"Invalid silence threshold value"

  • Explanation: The silence_threshold value provided is outside the acceptable range.
  • Solution: Ensure that the silence_threshold is set within the valid range specified by the node's documentation.

"Peak detection failed due to high sensitivity"

  • Explanation: The energy_sensitivity is set too high, causing the node to detect too many insignificant peaks.
  • Solution: Lower the energy_sensitivity to focus on more significant audio events and reduce false positives.

🔧 Audio Analyzer Options Related Nodes

Go back to the extension to check out more related nodes.
TTS Audio Suite
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.