RunComfy

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Wan 2.1 Fun | ControlNet Video Generation

Generate videos with ControlNet-style visual passes like Depth, Canny, and OpenPose.

LivePortrait | Animate Portraits | Vid2Vid

Transfer facial expressions and movements from a driving video onto a source video

Qwen Image 2512 | Precision AI Image Generator

Ultra-detailed art creation with next-level visual accuracy and control.

ComfyUI > Nodes > ComfyUI_ChatterBox_Voice > 🔄 ChatterBox Voice Conversion

ComfyUI Node: 🔄 ChatterBox Voice Conversion

Class Name

ChatterBoxVoiceVC

Category
ChatterBox Voice

Author
ShmuelRonen (Account age: 1863days) Extension
ComfyUI_ChatterBox_Voice Latest Updated
2025-06-04 Github Stars
0.02K

Github Ask ShmuelRonen Current Questions Past Questions

Table of Content

Description
ChatterBoxVoiceVC:
ChatterBoxVoiceVC Input Parameters:
ChatterBoxVoiceVC Output Parameters:
ChatterBoxVoiceVC Usage Tips:
ChatterBoxVoiceVC Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_ChatterBox_Voice

Install this extension via the ComfyUI Manager by searching for ComfyUI_ChatterBox_Voice

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_ChatterBox_Voice in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🔄 ChatterBox Voice Conversion Description

ChatterBoxVoiceVC enables realistic voice conversion, transforming audio to match target profiles.

🔄 ChatterBox Voice Conversion:

ChatterBoxVoiceVC is a sophisticated node designed for voice conversion tasks, allowing you to transform the voice characteristics of an audio input to match a target voice profile. This node leverages advanced voice conversion techniques to seamlessly alter the vocal attributes while maintaining the original content and context of the speech. The primary goal of ChatterBoxVoiceVC is to provide high-quality voice transformation capabilities, making it an invaluable tool for applications such as dubbing, voice cloning, and personalized voice experiences. By utilizing this node, you can achieve realistic and natural-sounding voice conversions, enhancing the versatility and creativity of your audio projects.

🔄 ChatterBox Voice Conversion Input Parameters:

source_audio

The source_audio parameter represents the audio file that contains the original voice you wish to convert. This input is crucial as it serves as the baseline for the voice conversion process. The quality and clarity of the source audio can significantly impact the final output, so it is recommended to use high-quality recordings for optimal results.

target_audio

The target_audio parameter is the audio file that contains the voice characteristics you want to apply to the source audio. This target voice profile guides the conversion process, ensuring that the output audio closely resembles the desired vocal attributes. Like the source audio, the quality of the target audio is important for achieving a convincing and natural-sounding conversion.

device

The device parameter specifies the computational device on which the voice conversion model will run. This can be set to either a CPU or a GPU, depending on the available hardware and performance requirements. Utilizing a GPU can significantly speed up the conversion process, especially for large audio files or batch processing.

🔄 ChatterBox Voice Conversion Output Parameters:

waveform

The waveform output parameter is a tensor representing the converted audio waveform. This output is formatted to include a batch dimension, making it compatible with further processing or playback in ComfyUI. The waveform retains the content of the source audio while adopting the vocal characteristics of the target audio, providing a seamless and natural-sounding conversion.

sample_rate

The sample_rate output parameter indicates the sampling rate of the converted audio. This value is crucial for ensuring that the audio is played back at the correct speed and pitch. The sample rate is typically consistent with the original audio files, maintaining audio fidelity and synchronization.

🔄 ChatterBox Voice Conversion Usage Tips:

Ensure that both the source and target audio files are of high quality to achieve the best voice conversion results.
Utilize a GPU for processing if available, as it can significantly reduce the time required for voice conversion, especially for longer audio files.
Experiment with different target voices to explore the creative possibilities of voice conversion and find the best match for your project needs.

🔄 ChatterBox Voice Conversion Common Errors and Solutions:

FileNotFoundError

Explanation: This error occurs when the specified audio file paths for either the source or target audio do not exist or are incorrect.
Solution: Verify that the file paths are correct and that the files are accessible from the specified location.

RuntimeError: CUDA error

Explanation: This error may arise if the GPU is not properly configured or if there is insufficient memory to process the audio files.
Solution: Ensure that your GPU drivers are up to date and that there is enough available memory. If necessary, switch to CPU processing for smaller files.

ValueError: Invalid audio format

Explanation: This error indicates that the provided audio files are in an unsupported format or have incompatible properties.
Solution: Convert the audio files to a supported format, such as WAV, and ensure they have consistent sample rates and bit depths.

🔄 ChatterBox Voice Conversion Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_ChatterBox_Voice

Table of Content

Description
ChatterBoxVoiceVC:
ChatterBoxVoiceVC Input Parameters:
ChatterBoxVoiceVC Output Parameters:
ChatterBoxVoiceVC Usage Tips:
ChatterBoxVoiceVC Common Errors and Solutions:
Related Nodes

SDXL Turbo | Rapid Text to Image

Experience fast text-to-image synthesis with SDXL Turbo.

Hunyuan Image 2.1 | High-Res AI Image Generator

Next-gen 2.1 model for crisp, sharp, ultra-clear AI visuals fast.

Qwen Image Edit 2509 | Multi-Image Editor

Turn 2–3 images into one seamless, edited masterpiece instantly.

Hunyuan3D-2 | Leading-edge 3D Assets Generator

Generate precise textured 3D assets from images with state-of-the-art AI technology.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: 🔄 ChatterBox Voice Conversion

ChatterBoxVoiceVC

How to Install ComfyUI_ChatterBox_Voice

🔄 ChatterBox Voice Conversion Description

🔄 ChatterBox Voice Conversion:

🔄 ChatterBox Voice Conversion Input Parameters:

source_audio

target_audio

device

🔄 ChatterBox Voice Conversion Output Parameters:

waveform

sample_rate

🔄 ChatterBox Voice Conversion Usage Tips:

🔄 ChatterBox Voice Conversion Common Errors and Solutions:

FileNotFoundError

RuntimeError: CUDA error

ValueError: Invalid audio format

🔄 ChatterBox Voice Conversion Related Nodes