RunComfy

Wan 2.2 FLF2V | First-Last Frame Video Generation

Generate smooth videos from a start and end frame using Wan 2.2 FLF2V.

ComfyUI Grounding | Object Tracking Workflow

Track any subject with pixel-perfect accuracy for stunning VFX results.

CatVTON | Amazing Virtual Try-On

CatVTON for easy and accurate virtual try-on.

IPAdapter Plus (V2) | Change Clothes

Use IPAdapter Plus for your fashion model creation, easily changing outfits and styles

ComfyUI > Nodes > ComfyUI-Creepy_nodes > Gemini Audio Analyzer (Creepybits)

ComfyUI Node: Gemini Audio Analyzer (Creepybits)

Class Name

GeminiAudioAnalyzer

Category
Creepybits/Audio

Author
Creepybits (Account age: 2146days) Extension
ComfyUI-Creepy_nodes Latest Updated
2025-12-07 Github Stars
0.02K

Github Ask Creepybits Current Questions Past Questions

Table of Content

Description
GeminiAudioAnalyzer:
GeminiAudioAnalyzer Input Parameters:
GeminiAudioAnalyzer Output Parameters:
GeminiAudioAnalyzer Usage Tips:
GeminiAudioAnalyzer Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-Creepy_nodes

Install this extension via the ComfyUI Manager by searching for ComfyUI-Creepy_nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-Creepy_nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Gemini Audio Analyzer (Creepybits) Description

Specialized audio data processing node for AI applications, optimizing audio for analysis with 16kHz sample rate.

Gemini Audio Analyzer (Creepybits):

The GeminiAudioAnalyzer is a specialized node designed to process and analyze audio data, particularly for use in AI-driven applications. Its primary function is to prepare audio content for analysis by ensuring that the audio waveform is in the correct format and sample rate required by the Gemini model, which is 16kHz. This node is capable of handling audio inputs with varying channel configurations, converting them to mono if necessary, and resampling them to meet the model's specifications. By doing so, it ensures that the audio data is optimized for analysis, allowing for more accurate and reliable results. The GeminiAudioAnalyzer is particularly beneficial for applications that require precise audio analysis, such as speech recognition, audio classification, or any AI model that relies on audio input. Its ability to seamlessly integrate audio content with text for multimodal models further enhances its utility, making it a versatile tool for AI artists and developers working with complex audio-visual data.

Gemini Audio Analyzer (Creepybits) Input Parameters:

prompt

The prompt parameter is a textual input that provides context or instructions for the audio analysis process. It guides the node on what specific aspects of the audio to focus on or analyze, ensuring that the output is relevant to the user's needs. This parameter does not have a predefined set of values, as it is highly dependent on the specific requirements of the task at hand.

input_type

The input_type parameter specifies the type of input being provided to the node, which in this case is "audio". This parameter ensures that the node processes the input correctly, distinguishing between different types of data that might be handled by the system. It is crucial for the node to recognize the input type to apply the appropriate processing methods.

Additional_Context

The Additional_Context parameter allows for the inclusion of supplementary information that might be relevant to the audio analysis. This can enhance the node's ability to interpret the audio data by providing additional background or situational details. This parameter is optional and can be tailored to the specific needs of the analysis.

audio

The audio parameter is the core input for the GeminiAudioAnalyzer, consisting of the audio data to be analyzed. It includes the waveform and sample rate, which are essential for processing. The node ensures that the audio is in the correct format and sample rate, converting it to mono and resampling it to 16kHz if necessary. This parameter is critical for the node's operation, as it directly affects the quality and accuracy of the analysis.

api_key

The api_key parameter is used for authentication purposes when interacting with external services or APIs. It ensures that the node can securely access the necessary resources for audio analysis. This parameter is essential for enabling the node to function within a secure and authorized environment.

max_output_tokens

The max_output_tokens parameter defines the maximum number of tokens that the node can generate in its output. This parameter helps manage the length and complexity of the output, ensuring that it remains within manageable limits. It is particularly useful for controlling the verbosity of the analysis results.

safety_threshold

The safety_threshold parameter sets the level of safety filtering applied to the output, with options such as "Block None". This parameter helps ensure that the output is appropriate and free from potentially harmful or inappropriate content. It is an important consideration for maintaining the quality and safety of the analysis results.

temperature

The temperature parameter controls the randomness of the output, with a default value of 0.4. A lower temperature results in more deterministic outputs, while a higher temperature allows for more variability and creativity. This parameter is useful for fine-tuning the balance between consistency and diversity in the analysis results.

Gemini Audio Analyzer (Creepybits) Output Parameters:

content_parts

The content_parts output parameter is a list that contains the processed audio data along with any associated text content. This output is crucial for applications that require a combination of audio and text data, as it provides a comprehensive representation of the analyzed content. The content_parts parameter ensures that the output is ready for further processing or integration into multimodal models.

Gemini Audio Analyzer (Creepybits) Usage Tips:

Ensure that your audio input is clear and free from excessive noise to improve the accuracy of the analysis.
Use the prompt parameter effectively to guide the analysis process and obtain results that are relevant to your specific needs.
Adjust the temperature parameter to balance between consistent and creative outputs, depending on the requirements of your application.

Gemini Audio Analyzer (Creepybits) Common Errors and Solutions:

Error: Missing expected key in audio input

Explanation: This error occurs when the audio input does not contain the necessary keys, such as "waveform" or "sample_rate", required for processing.
Solution: Ensure that your audio input is correctly formatted and includes all the necessary keys. Verify that the input data structure matches the expected format.

Error processing audio

Explanation: This error indicates a failure in processing the audio input, which could be due to incorrect formatting or unsupported audio configurations.
Solution: Check the audio input for any formatting issues or unsupported configurations. Ensure that the audio is in a compatible format and meets the node's requirements for processing.

Gemini Audio Analyzer (Creepybits) Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-Creepy_nodes

Table of Content

Description
GeminiAudioAnalyzer:
GeminiAudioAnalyzer Input Parameters:
GeminiAudioAnalyzer Output Parameters:
GeminiAudioAnalyzer Usage Tips:
GeminiAudioAnalyzer Common Errors and Solutions:
Related Nodes

Face Restore + ControlNet + Reactor | Restore Old Photos

Revive faded photos into vibrant memories, preserving every detail for cherished reminiscence.

Flux Kontext Zoom Out ComfyUI Workflow | Seamless Outpainting

Zoom Out LoRA enlarges images seamlessly with natural continuation.

Easy Video Upscaler for Footage | Pro HD Enhancement

Turn low-res clips into sharp, natural HD videos fast.

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Gemini Audio Analyzer (Creepybits)

GeminiAudioAnalyzer

How to Install ComfyUI-Creepy_nodes

Gemini Audio Analyzer (Creepybits) Description

Gemini Audio Analyzer (Creepybits):

Gemini Audio Analyzer (Creepybits) Input Parameters:

prompt

input_type

Additional_Context

audio

api_key

max_output_tokens

safety_threshold

temperature

Gemini Audio Analyzer (Creepybits) Output Parameters:

content_parts

Gemini Audio Analyzer (Creepybits) Usage Tips:

Gemini Audio Analyzer (Creepybits) Common Errors and Solutions:

Error: Missing expected key in audio input

Error processing audio

Gemini Audio Analyzer (Creepybits) Related Nodes