ICEdit | Fast AI Image Editing with Nunchaku

ICEdit+Nunchaku: A solution for ultra-fast, precise AI image editing.

Hunyuan Video | Video to Video

Combine text prompt and source video to generate new video.

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

Dance Video Transform | Scene Customization & Face Swap

Transform dance videos with scene editing, face-swapping, and motion preservation.

ComfyUI > Nodes > DJZ-Nodes > 🎤 Voice Effects

ComfyUI Node: 🎤 Voice Effects

Class Name

VoiceEffects

Category
None

Author
DriftJohnson (Account age: 4052days) Extension
DJZ-Nodes Latest Updated
2025-04-25 Github Stars
0.04K

Github Ask DriftJohnson Current Questions Past Questions

Table of Content

Description
VoiceEffects:
VoiceEffects Input Parameters:
VoiceEffects Output Parameters:
VoiceEffects Usage Tips:
VoiceEffects Common Errors and Solutions:
Related Nodes

How to Install DJZ-Nodes

Install this extension via the ComfyUI Manager by searching for DJZ-Nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter DJZ-Nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🎤 Voice Effects Description

Enhance audio with various voice effects for creative sound manipulation and artistic expression.

🎤 Voice Effects:

The VoiceEffects node is designed to enhance and transform audio by applying a variety of voice effects. This node is particularly useful for AI artists and audio enthusiasts who wish to experiment with sound manipulation without needing a deep technical background. By utilizing this node, you can modify audio characteristics such as room ambiance, vibrato, pitch, and more, allowing for creative expression and unique soundscapes. The node processes audio inputs and applies specified effects, ensuring that the output is normalized and formatted correctly for further use in audio projects. Its primary goal is to provide a flexible and user-friendly interface for audio transformation, making it an essential tool for anyone looking to explore the artistic possibilities of sound.

🎤 Voice Effects Input Parameters:

audio

The audio parameter is a dictionary that contains the audio data to be processed. It must include a key named waveform, which holds the actual audio waveform data. This parameter is crucial as it serves as the primary input for the node, and the waveform data is what the node will manipulate to apply the desired effects. The audio dictionary may also include a sample_rate key, which specifies the sample rate of the audio. If not provided, a default sample rate of 44100 Hz is used. This parameter ensures that the audio is correctly interpreted and processed by the node.

room_size

The room_size parameter controls the perceived size of the room in which the audio is played, affecting the reverberation characteristics. A larger room size will result in more pronounced echoes and a sense of spaciousness, while a smaller room size will create a more intimate sound. This parameter allows you to simulate different acoustic environments, enhancing the realism or artistic quality of the audio.

damping

The damping parameter affects the absorption of sound within the simulated room, influencing how quickly the reverberations decay. Higher damping values result in quicker decay and less echo, while lower values allow the sound to linger longer. This parameter is useful for fine-tuning the reverberation effect to match the desired acoustic environment.

cutoff_freq

The cutoff_freq parameter determines the frequency at which the audio signal is filtered. It is used in conjunction with the filter_type parameter to apply either a low-pass or high-pass filter, shaping the tonal quality of the audio. Adjusting the cutoff frequency can help emphasize or attenuate certain frequency ranges, allowing for creative sound design.

filter_type

The filter_type parameter specifies the type of filter to apply to the audio signal. Common options include low-pass and high-pass filters, which respectively allow low or high frequencies to pass through while attenuating others. This parameter works with cutoff_freq to shape the audio's frequency response, enabling you to sculpt the sound to your liking.

vibrato_freq

The vibrato_freq parameter sets the frequency of the vibrato effect, which modulates the pitch of the audio signal. A higher vibrato frequency results in a faster pitch modulation, while a lower frequency creates a slower, more pronounced effect. This parameter is useful for adding expressiveness and movement to the audio.

vibrato_depth

The vibrato_depth parameter controls the intensity of the vibrato effect, determining how much the pitch is modulated. A greater vibrato depth results in more noticeable pitch variations, while a smaller depth creates a subtler effect. This parameter allows you to adjust the vibrato to suit the desired artistic expression.

shift_factor

The shift_factor parameter adjusts the pitch of the audio signal, allowing you to transpose the audio up or down. This can be used to change the perceived pitch of a voice or instrument, enabling creative transformations and harmonizations.

delay_time

The delay_time parameter sets the time interval for the delay effect, which creates echoes by repeating the audio signal after a specified duration. Longer delay times result in more distinct echoes, while shorter times create a more continuous effect. This parameter is essential for creating rhythmic patterns and spatial effects.

decay

The decay parameter controls the rate at which the delayed echoes diminish in volume. A longer decay time results in echoes that persist longer, while a shorter decay causes them to fade quickly. This parameter is important for shaping the character of the delay effect.

gain

The gain parameter adjusts the overall volume of the audio signal. It can be used to boost or attenuate the audio level, ensuring that the processed output is at the desired loudness. This parameter is crucial for maintaining audio balance and preventing distortion.

threshold

The threshold parameter sets the level at which certain effects, such as compression, are triggered. It determines the point at which the audio signal is affected by the effect, allowing for dynamic control over the audio processing. This parameter is key for managing the impact of effects on the audio.

🎤 Voice Effects Output Parameters:

waveform

The waveform output parameter is a tensor that contains the processed audio data. It represents the transformed audio waveform after all specified effects have been applied. This output is crucial as it is the final product of the node's processing, ready for further use or analysis in audio projects.

sample_rate

The sample_rate output parameter indicates the sample rate of the processed audio. It ensures that the audio is played back at the correct speed and pitch, maintaining the integrity of the sound. This output is important for compatibility with other audio processing tools and playback systems.

path

The path output parameter is currently set to None, indicating that the processed audio is not saved to a file by default. This output is included for potential future use, where the audio might be saved to a specified file path for storage or distribution.

🎤 Voice Effects Usage Tips:

Experiment with different room_size and damping settings to simulate various acoustic environments, from small rooms to large halls.
Use the vibrato_freq and vibrato_depth parameters to add expressiveness to vocal tracks, creating a more dynamic and engaging sound.
Adjust the cutoff_freq and filter_type to shape the tonal balance of the audio, emphasizing or reducing specific frequency ranges for creative effects.
Utilize the delay_time and decay parameters to create rhythmic echo patterns, adding depth and complexity to the audio.

🎤 Voice Effects Common Errors and Solutions:

Error: Input audio is not a dictionary

Explanation: The input audio must be provided as a dictionary containing the waveform data.
Solution: Ensure that the audio input is structured as a dictionary with the required keys, such as waveform.

Error: Input audio missing 'waveform' key

Explanation: The audio dictionary must include a waveform key with the audio data.
Solution: Verify that the input dictionary contains a waveform key and that it holds the correct audio data.

Error: Input audio waveform is None

Explanation: The waveform data in the audio dictionary cannot be None.
Solution: Check that the waveform key in the audio dictionary contains valid audio data before processing.

🎤 Voice Effects Related Nodes

Go back to the extension to check out more related nodes.

DJZ-Nodes

Table of Content

Description
VoiceEffects:
VoiceEffects Input Parameters:
VoiceEffects Output Parameters:
VoiceEffects Usage Tips:
VoiceEffects Common Errors and Solutions:
Related Nodes

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Hunyuan Image to Video | Breathtaking Motion Creator

Create magnificent movies out of still images through cinematic motion and customizable effects.

Hunyuan3D-1 | ComfyUI 3D Pack

Create multi-view RGB images first, then transform them into 3D assets.

FramePack Wrapper | Efficient long Video Generation

Create stable, 60s+ long videos with minimal cloud resources.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.