RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

MimicMotion | Human Motion Video Generation

Generate high-quality human motion videos with MimicMotion, using a reference image and motion sequence.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

ReActor | Fast Face Swap

With ComfyUI ReActor, you can easily swap the faces of one or more characters in images or videos.

ComfyUI > Nodes > ComfyUI_ChatterBox_SRT_Voice > 🌊 Audio Wave Analyzer

ComfyUI Node: 🌊 Audio Wave Analyzer

Class Name

AudioAnalyzerNode

Category
ChatterBox Audio

Author
diodiogod (Account age: 768days) Extension
ComfyUI_ChatterBox_SRT_Voice Latest Updated
2026-03-21 Github Stars
0.08K

Github Ask diodiogod Current Questions Past Questions

Table of Content

Description
AudioAnalyzerNode:
AudioAnalyzerNode Input Parameters:
AudioAnalyzerNode Output Parameters:
AudioAnalyzerNode Usage Tips:
AudioAnalyzerNode Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_ChatterBox_SRT_Voice

Install this extension via the ComfyUI Manager by searching for ComfyUI_ChatterBox_SRT_Voice

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_ChatterBox_SRT_Voice in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🌊 Audio Wave Analyzer Description

AudioAnalyzerNode enables precise audio analysis with waveform visualization and timing extraction.

🌊 Audio Wave Analyzer:

The AudioAnalyzerNode is a powerful tool designed to facilitate the analysis of audio files by providing interactive waveform visualization and precise timing extraction. This node is particularly beneficial for tasks that require detailed audio analysis, such as speech recognition or audio editing, as it allows you to extract word timings with high precision. By leveraging advanced audio processing techniques, the node can handle various audio formats and offers flexibility in how the analysis is conducted. Its integration with the ComfyUI framework ensures that the results are formatted appropriately for further processing or visualization, making it an essential component for AI artists working with audio data.

🌊 Audio Wave Analyzer Input Parameters:

audio

The audio parameter is the primary input for the node, representing the audio data to be analyzed. This can be provided as a direct audio input or specified through a file path. The quality and format of the audio input can significantly impact the accuracy of the analysis, so it is important to ensure that the audio is clear and in a supported format. There are no explicit minimum or maximum values for this parameter, but the audio should be of sufficient quality for meaningful analysis.

analysis_method

The analysis_method parameter determines the technique used for analyzing the audio. Options may include methods like "silence" detection, which identifies pauses in the audio to segment it into meaningful parts. The choice of method affects how the audio is processed and the type of results generated. Selecting the appropriate method based on the audio content and desired outcome is crucial for effective analysis.

precision_level

The precision_level parameter specifies the granularity of the timing extraction, with options such as "milliseconds" allowing for very detailed timing information. This parameter influences the resolution of the timing data, which can be critical for applications requiring precise synchronization, such as lip-syncing or detailed audio editing. The default value is typically set to a level that balances performance and precision.

visualization_points

The visualization_points parameter controls the number of points used in the waveform visualization. A higher number of points results in a more detailed waveform but may require more computational resources. This parameter allows you to adjust the level of detail in the visualization to suit your needs, with a typical default value being 2000 points.

options

The options parameter provides additional configuration settings for the analysis process. These settings can include various flags or parameters that modify the behavior of the analysis, allowing for customization based on specific requirements. The exact options available may vary depending on the implementation and the analysis method chosen.

manual_regions

The manual_regions parameter allows you to specify regions of interest within the audio manually. This can be useful for focusing the analysis on specific parts of the audio, such as particular words or phrases. By defining these regions, you can tailor the analysis to extract the most relevant information for your task.

region_labels

The region_labels parameter is used to assign labels to the manually defined regions. These labels can help in organizing and interpreting the results, especially when dealing with complex audio data that contains multiple segments of interest.

export_format

The export_format parameter determines the format in which the analysis results are exported. Options like "f5tts" are available, which may be tailored for specific applications or systems. Choosing the correct export format ensures compatibility with downstream processes or tools that will use the analysis results.

node_id

The node_id parameter is a unique identifier for the node instance, which can be useful for tracking and managing multiple instances of the node within a larger workflow. This identifier helps in organizing and referencing the node's output in complex projects.

🌊 Audio Wave Analyzer Output Parameters:

segmented_audio

The segmented_audio output provides the audio segments extracted based on the analysis. These segments are formatted for compatibility with ComfyUI, ensuring they can be easily integrated into subsequent processing steps. The segmented audio is crucial for tasks that require isolated audio parts, such as editing or further analysis.

🌊 Audio Wave Analyzer Usage Tips:

Ensure that the audio input is of high quality and in a supported format to achieve the best analysis results.
Choose the analysis_method and precision_level based on the specific requirements of your task to optimize performance and accuracy.
Utilize the manual_regions and region_labels parameters to focus the analysis on specific parts of the audio that are most relevant to your project.

🌊 Audio Wave Analyzer Common Errors and Solutions:

Audio file not found: `<file_path>`

Explanation: This error occurs when the specified audio file path does not exist or cannot be found.
Solution: Verify that the file path is correct and that the file exists at the specified location. Ensure that the path is absolute or correctly resolved relative to the ComfyUI input directory.

No audio input provided

Explanation: This error indicates that no audio input was given to the node, either as a direct input or through a file path.
Solution: Provide a valid audio input by connecting an audio source or specifying a correct file path for the audio file.

🌊 Audio Wave Analyzer Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_ChatterBox_SRT_Voice

Table of Content

Description
AudioAnalyzerNode:
AudioAnalyzerNode Input Parameters:
AudioAnalyzerNode Output Parameters:
AudioAnalyzerNode Usage Tips:
AudioAnalyzerNode Common Errors and Solutions:
Related Nodes

Z-Image Turbo I2I for Characters | Ultimate Photorealism

Turns portraits into lifelike, perfectly detailed realistic faces fast.

Z-Image | Fast Photorealistic Base Model

Super-fast image maker with stunning clarity and total control.

LTX-2 First Last Frame | Key Frames Video Generator

Turn still frames into seamless video and sound transitions fast.

Flux Upscaler - Ultimate 32k | Image Upscaler

Flux Upscaler – Achieve 4k, 8k, 16k, and Ultimate 32k Resolution!

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: 🌊 Audio Wave Analyzer

AudioAnalyzerNode

How to Install ComfyUI_ChatterBox_SRT_Voice

🌊 Audio Wave Analyzer Description

🌊 Audio Wave Analyzer:

🌊 Audio Wave Analyzer Input Parameters:

audio

analysis_method

precision_level

visualization_points

options

manual_regions

region_labels

export_format

node_id

🌊 Audio Wave Analyzer Output Parameters:

segmented_audio

🌊 Audio Wave Analyzer Usage Tips:

🌊 Audio Wave Analyzer Common Errors and Solutions:

Audio file not found: <file_path>

No audio input provided

🌊 Audio Wave Analyzer Related Nodes

Audio file not found: `<file_path>`