ComfyUI
Playground
Pricing

RunComfy

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

Hunyuan Video | Image-Prompt to Video

Convert an image and a text prompt into a dynamic video.

Flux Depth and Canny

Official Flux Tools - Flux Depth and Canny ControlNet Model

SkyReels V1 | Human-Focused Video Creation

Generate cinematic human videos with genuine facial expressions and natural movements from text or images.

ComfyUI > Nodes > Akatz Custom Nodes > Audioreactive Dilate Mask Infinite | Akatz

ComfyUI Node: Audioreactive Dilate Mask Infinite | Akatz

Class Name

AK_AudioreactiveDilateMaskInfinite

Category
💜Akatz Nodes/Mask

Author
akatz-ai (Account age: 358days) Extension
Akatz Custom Nodes Latest Updated
2025-04-05 Github Stars
0.03K

Github Ask akatz-ai Current Questions Past Questions

Table of Content

Description
AK_AudioreactiveDilateMaskInfinite:
AK_AudioreactiveDilateMaskInfinite Input Parameters:
AK_AudioreactiveDilateMaskInfinite Output Parameters:
AK_AudioreactiveDilateMaskInfinite Usage Tips:
AK_AudioreactiveDilateMaskInfinite Common Errors and Solutions:
Related Nodes

How to Install Akatz Custom Nodes

Install this extension via the ComfyUI Manager by searching for Akatz Custom Nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Akatz Custom Nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Audioreactive Dilate Mask Infinite | Akatz Description

Dynamic audio-reactive visual mask dilation for synchronized audio-visual experiences with infinite variations.

Audioreactive Dilate Mask Infinite | Akatz:

The AK_AudioreactiveDilateMaskInfinite node is designed to dynamically alter visual masks in response to audio input, creating a visually engaging experience that synchronizes with sound. This node is particularly useful for artists and creators looking to integrate audio-reactive elements into their visual projects. By analyzing the amplitude of audio signals, the node adjusts the dilation of masks, allowing for infinite variations and patterns that evolve with the music. This capability enables the creation of dynamic visuals that can enhance live performances, music videos, or interactive installations. The node's primary function is to detect beats or significant changes in audio amplitude and use these cues to modify the visual representation of masks, providing a seamless blend of audio and visual art.

Audioreactive Dilate Mask Infinite | Akatz Input Parameters:

mask

The mask parameter represents the initial visual mask that will be subject to dilation. It is a crucial input as it defines the base shape and structure that will be modified in response to audio signals. The mask should be provided as a tensor, typically in a format compatible with PyTorch, and it serves as the canvas for the audio-reactive transformations.

normalized_amp

The normalized_amp parameter consists of normalized amplitude values derived from the audio input. These values dictate the extent of the mask's dilation, with higher amplitudes resulting in more significant changes. This parameter is essential for synchronizing the visual effects with the audio, ensuring that the mask's transformations are directly influenced by the sound's dynamics.

mask_colors

The mask_colors parameter specifies the colors to be applied to the dilated masks. This allows for the creation of colorful and vibrant visual effects that can change over time, adding an additional layer of dynamism to the audio-reactive visuals. The colors are cycled through as the mask dilates, providing a visually appealing transition.

threshold

The threshold parameter determines the amplitude level required to trigger the dilation process. By setting this threshold, you can control the sensitivity of the node to audio input, ensuring that only significant audio events cause visual changes. This helps in filtering out background noise and focusing on prominent beats or sounds.

dilation_speed

The dilation_speed parameter controls the rate at which the mask dilates in response to audio input. A higher speed results in faster transformations, while a lower speed creates more gradual changes. This parameter allows you to fine-tune the responsiveness of the visual effects to match the tempo and style of the audio.

quality_factor

The quality_factor parameter affects the resolution and smoothness of the dilation process. A higher quality factor results in more detailed and refined visual effects, while a lower factor may lead to faster processing but with less precision. This parameter is crucial for balancing performance and visual quality.

should_composite_subject

The should_composite_subject parameter is a boolean that determines whether the subject of the mask should be composited with the background. This allows for the integration of the mask with other visual elements, creating a cohesive and unified visual presentation.

subject_mask_color

The subject_mask_color parameter defines the color of the subject mask when compositing with the background. This color choice can enhance the visual contrast and ensure that the subject stands out against the background, contributing to the overall aesthetic of the visual output.

initial_background_color

The initial_background_color parameter sets the starting color of the background before any dilation occurs. This provides a base color that can complement the mask colors and enhance the overall visual composition. The background color is specified as a tuple of RGB values.

start_frame

The start_frame parameter specifies the frame at which the dilation process should begin. This allows for precise control over the timing of the visual effects, ensuring that they align perfectly with the desired audio segments. It is particularly useful for synchronizing visuals with specific parts of a song or audio track.

end_frame

The end_frame parameter defines the frame at which the dilation process should stop. This provides a way to limit the duration of the visual effects, allowing for targeted and controlled visual transformations that match the length of the audio input.

Audioreactive Dilate Mask Infinite | Akatz Output Parameters:

result_images

The result_images parameter is the primary output of the node, consisting of a sequence of images that represent the dilated masks over time. These images are generated based on the input parameters and the audio-reactive transformations, providing a dynamic visual representation that evolves with the audio. The output can be used directly in visual projects or further processed to create complex visual compositions.

Audioreactive Dilate Mask Infinite | Akatz Usage Tips:

Experiment with different mask_colors to create visually striking effects that complement the audio. Consider using contrasting colors for more dynamic visuals.
Adjust the threshold parameter to fine-tune the sensitivity of the node to audio input. This can help in isolating specific beats or sounds that you want to emphasize visually.
Use the dilation_speed parameter to match the tempo of the audio. Faster speeds work well with upbeat music, while slower speeds can create a more relaxed visual flow.

Audioreactive Dilate Mask Infinite | Akatz Common Errors and Solutions:

"Invalid mask shape"

Explanation: This error occurs when the input mask does not have the expected dimensions or format.
Solution: Ensure that the mask is provided as a tensor with the correct shape, typically compatible with PyTorch. Check the dimensions and format of the mask input.

"Amplitude values not normalized"

Explanation: The normalized_amp parameter must consist of values between 0 and 1. - Solution: Normalize the amplitude values before passing them to the node. This can be done by scaling the raw amplitude values to fit within the 0 to 1 range.

"Color parsing error"

Explanation: This error indicates an issue with the mask_colors parameter, possibly due to incorrect formatting.
Solution: Verify that the colors are specified correctly, typically as a list of RGB tuples. Ensure that the color values are within the valid range (0-255).

Audioreactive Dilate Mask Infinite | Akatz Related Nodes

Go back to the extension to check out more related nodes.

Akatz Custom Nodes

Table of Content

Description
AK_AudioreactiveDilateMaskInfinite:
AK_AudioreactiveDilateMaskInfinite Input Parameters:
AK_AudioreactiveDilateMaskInfinite Output Parameters:
AK_AudioreactiveDilateMaskInfinite Usage Tips:
AK_AudioreactiveDilateMaskInfinite Common Errors and Solutions:
Related Nodes

LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

SkyReels-A2 | Multi-Element Video Generation

Combine multi elements into dynamic videos with precision.

ComfyUI Phantom | Subject to Video

Reference-driven video generation using Wan2.1 14B

VACE Wan2.1 | V2V

Transform videos with a reference style image using VACE Wan2.1.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy