ComfyUI > Nodes > TTS Audio Suite > 🔄 Voice Changer

ComfyUI Node: 🔄 Voice Changer

Class Name

UnifiedVoiceChangerNode

Category
TTS Audio Suite/🎤 Text to Speech
Author
diogod (Account age: 667days)
Extension
TTS Audio Suite
Latest Updated
2025-12-13
Github Stars
0.46K

How to Install TTS Audio Suite

Install this extension via the ComfyUI Manager by searching for TTS Audio Suite
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter TTS Audio Suite in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

🔄 Voice Changer Description

Versatile voice conversion tool supporting multiple engines for transforming audio to mimic target voices with ease.

🔄 Voice Changer:

The UnifiedVoiceChangerNode is a versatile and engine-agnostic tool designed for voice conversion within the TTS Audio Suite. It has been refactored from the ChatterBox VC to support multiple voice conversion engines, including the current ChatterBox and future RVC engines. This node allows you to transform a source audio file to mimic the characteristics of a target voice, making it an invaluable asset for applications requiring voice customization and transformation. By supporting multiple engines, it provides flexibility and future-proofing, ensuring compatibility with evolving technologies. The node's primary goal is to facilitate seamless voice conversion with minimal technical complexity, making it accessible to users who may not have a deep technical background.

🔄 Voice Changer Input Parameters:

TTS_engine

This parameter specifies the Text-to-Speech (TTS) or Voice Conversion (VC) engine configuration to be used for the conversion process. It supports the ChatterBox TTS Engine and is prepared for future integration with the RVC Engine. The choice of engine can significantly impact the quality and characteristics of the converted voice, as different engines may have unique processing capabilities and language support.

source_audio

The source_audio parameter is the original voice audio that you wish to convert. It accepts audio input directly or output from a Character Voices node. This audio serves as the base for the conversion process, and its quality and clarity can affect the final output. The node processes this audio to extract the necessary features for conversion.

narrator_target

This parameter represents the reference voice audio whose characteristics will be applied to the source audio. Like source_audio, it accepts audio input or output from a Character Voices node. The narrator_target is crucial as it defines the desired voice characteristics, such as tone, pitch, and style, that will be applied to the source audio.

refinement_passes

The refinement_passes parameter determines the number of conversion iterations to be performed. Each pass refines the output to sound more like the target voice. The default value is 1, with a minimum of 1 and a maximum of 30. It is recommended to use a maximum of 5 passes, as more can cause distortions. Each iteration is deterministic, which helps reduce degradation in the output quality.

🔄 Voice Changer Output Parameters:

converted_audio

The converted_audio is the primary output of the UnifiedVoiceChangerNode. It is the transformed version of the source_audio, modified to mimic the characteristics of the narrator_target. This output is crucial for applications that require voice transformation, as it provides the final audio result after processing through the selected engine and refinement passes.

🔄 Voice Changer Usage Tips:

  • To achieve the best results, ensure that both the source_audio and narrator_target are of high quality and free from background noise, as this can affect the conversion process.
  • Experiment with different numbers of refinement_passes to find the optimal balance between quality and processing time. While more passes can improve the likeness to the target voice, they may also introduce distortions if overused.

🔄 Voice Changer Common Errors and Solutions:

Failed to create engine VC node instance

  • Explanation: This error occurs when the node is unable to instantiate the specified TTS or VC engine.
  • Solution: Ensure that the engine configuration is correct and that all necessary dependencies for the chosen engine are installed and properly configured.

Warning: narrator_target should be RVC Character Model for RVC conversion

  • Explanation: This warning indicates that the narrator_target is not recognized as an RVC Character Model, which is required for RVC conversions.
  • Solution: Verify that the narrator_target is correctly set as an RVC Character Model. If not, adjust the input to match the expected format for RVC conversions.

🔄 Voice Changer Related Nodes

Go back to the extension to check out more related nodes.
TTS Audio Suite
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.