🎛️ Geeky Kokoro Advanced Voice (2025):
The GeekyKokoroAdvancedVoice node is a sophisticated tool designed for the ComfyUI's Geeky Kokoro Text-to-Speech (TTS) system, offering advanced voice transformation capabilities. This node is particularly beneficial for users seeking to enhance their audio projects with professional-grade voice modifications. It provides a range of features including guided voice morphing, which allows for seamless integration of secondary audio files to guide the transformation process. Additionally, it includes autotune-style pitch correction, advanced spectral and formant morphing, and a suite of audio processing effects such as reverb, echo, and distortion. These features enable users to create unique and high-quality voice outputs, making it an essential tool for AI artists and audio designers looking to push the boundaries of their creative projects.
🎛️ Geeky Kokoro Advanced Voice (2025) Input Parameters:
audio
This parameter represents the primary audio input that you wish to transform. It serves as the base audio file upon which all modifications and effects will be applied. The quality and characteristics of this input will significantly influence the final output.
effect_blend
This parameter controls the intensity of the applied effects, ranging from 0.0 (no effect) to 1.0 (full effect). Adjusting this allows you to fine-tune the balance between the original audio and the modified version.
output_volume
This parameter sets the volume level of the output audio. It allows you to adjust the loudness of the final audio product, ensuring it meets your desired specifications.
voice_profile
This parameter allows you to select a predefined voice profile, such as "Alien" or "Deep Voice," which applies a set of specific transformations to the audio. Each profile has unique settings for pitch, formant, and other effects.
profile_intensity
This parameter determines the strength of the selected voice profile's effects, with a typical range from 0.0 to 1.0. A higher intensity results in a more pronounced transformation.
guide_audio
This optional parameter allows you to provide a secondary audio file to guide the voice morphing process. It is particularly useful for achieving specific voice characteristics by mimicking the guide audio.
enable_guided_morph
This boolean parameter enables or disables the guided morphing feature. When set to true, the node uses the guide audio to influence the transformation process.
pitch_morph_amount
This parameter specifies the degree of pitch morphing applied to the audio, allowing for subtle or dramatic changes in pitch.
formant_morph_amount
This parameter controls the amount of formant morphing, which affects the tonal quality and timbre of the voice, enabling you to alter the perceived size and shape of the vocal tract.
spectral_morph_amount
This parameter adjusts the extent of spectral morphing, which modifies the frequency spectrum of the audio to achieve various effects.
amplitude_morph_amount
This parameter determines the level of amplitude morphing, affecting the dynamics and loudness variations within the audio.
manual_mode
This boolean parameter allows you to manually adjust individual transformation settings, providing greater control over the audio modification process.
pitch_shift
This parameter allows you to shift the pitch of the audio up or down, measured in semitones, to achieve desired pitch alterations.
formant_shift
This parameter enables you to shift the formants of the audio, affecting the vocal characteristics without altering the pitch.
reverb_amount
This parameter controls the amount of reverb effect applied, simulating the acoustics of different environments.
reverb_room_size
This parameter specifies the perceived size of the room for the reverb effect, influencing the echo and decay characteristics.
echo_delay
This parameter sets the delay time for the echo effect, determining how quickly the echo repeats.
echo_feedback
This parameter controls the feedback level of the echo effect, affecting the number of echo repetitions.
distortion
This parameter applies a distortion effect to the audio, adding harmonic content and altering the sound's texture.
compression
This parameter adjusts the dynamic range compression, balancing the loud and soft parts of the audio for a more consistent output.
eq_bass
This parameter modifies the bass frequencies of the audio, allowing you to enhance or reduce low-end sounds.
eq_mid
This parameter adjusts the midrange frequencies, which are crucial for the clarity and presence of the audio.
eq_treble
This parameter controls the treble frequencies, affecting the brightness and sharpness of the sound.
time_stretch
This parameter allows you to alter the playback speed of the audio without affecting its pitch, useful for time-based adjustments.
brightness
This parameter influences the perceived brightness of the audio, affecting the high-frequency content.
warmth
This parameter adjusts the warmth of the audio, enhancing the low-mid frequencies for a richer sound.
use_gpu
This boolean parameter enables GPU acceleration for processing, potentially improving performance and reducing processing time.
🎛️ Geeky Kokoro Advanced Voice (2025) Output Parameters:
transformed_audio
The transformed_audio output is the final audio product after all selected modifications and effects have been applied. It reflects the cumulative impact of the input parameters, providing a unique and customized audio experience based on your specifications.
🎛️ Geeky Kokoro Advanced Voice (2025) Usage Tips:
- Experiment with different voice profiles to quickly achieve a variety of effects and find the one that best suits your project.
- Use the guided morphing feature with a carefully chosen guide audio to achieve specific voice characteristics that are difficult to replicate manually.
- Adjust the effect_blend parameter to find the perfect balance between the original and modified audio, ensuring the transformation enhances rather than overwhelms the source material.
- Utilize the manual_mode to fine-tune individual parameters for precise control over the audio transformation process.
🎛️ Geeky Kokoro Advanced Voice (2025) Common Errors and Solutions:
Guided morphing features not available
- Explanation: This error occurs when the guided morphing utilities are not successfully imported, possibly due to missing dependencies.
- Solution: Ensure that all required dependencies for guided morphing are installed and correctly configured in your environment.
Invalid voice profile selected
- Explanation: This error indicates that the specified voice profile does not exist in the available profiles.
- Solution: Double-check the spelling and availability of the voice profile you wish to use, and select a valid profile from the provided list.
Audio processing failed
- Explanation: This error may occur if there is an issue with the audio input or processing parameters.
- Solution: Verify that the audio input is correctly formatted and that all parameters are set within their valid ranges. Adjust settings as necessary and try again.
