ComfyUI > Nodes > ComfyUI-FreeVC_wrapper > FreeVC Voice Converter 🎤

ComfyUI Node: FreeVC Voice Converter 🎤

Class Name

FreeVC Voice Conversion

Category
audio/voice conversion
Author
ShmuelRonen (Account age: 1536days)
Extension
ComfyUI-FreeVC_wrapper
Latest Updated
2025-04-03
Github Stars
0.06K

How to Install ComfyUI-FreeVC_wrapper

Install this extension via the ComfyUI Manager by searching for ComfyUI-FreeVC_wrapper
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-FreeVC_wrapper in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

FreeVC Voice Converter 🎤 Description

Sophisticated tool for high-quality, text-free voice conversion using advanced machine learning models for natural, expressive results.

FreeVC Voice Converter 🎤:

FreeVC Voice Conversion is a sophisticated tool designed to facilitate high-quality, text-free, one-shot voice conversion. This node allows you to transform the voice characteristics of a source audio file to match those of a reference audio file, without the need for textual input. The primary goal of FreeVC is to enable seamless voice conversion that maintains the naturalness and expressiveness of the original audio while adopting the vocal traits of the reference. This is particularly beneficial for applications in voice synthesis, dubbing, and personalized voice assistants. The node leverages advanced machine learning models to ensure that the converted voice sounds authentic and retains the emotional nuances of the original speech. By using FreeVC, you can achieve impressive voice conversion results with minimal input, making it an invaluable tool for AI artists and developers working with audio content.

FreeVC Voice Converter 🎤 Input Parameters:

model_type

The model_type parameter allows you to select the specific model variant to use for voice conversion. The available options are "FreeVC", "FreeVC-s", and "FreeVC (24kHz)". Each model variant is optimized for different scenarios, with "FreeVC" being the standard model, "FreeVC-s" offering a streamlined version, and "FreeVC (24kHz)" providing higher sampling rate support for enhanced audio quality. Choosing the right model type can significantly impact the quality and characteristics of the converted audio.

source_audio

The source_audio parameter is the file path to the audio file whose voice characteristics you wish to convert. This audio serves as the base for the conversion process, and its vocal traits will be transformed to match those of the reference audio. The quality and clarity of the source audio can affect the final output, so it is recommended to use high-quality recordings.

reference_audio

The reference_audio parameter is the file path to the audio file that provides the target voice characteristics. The voice traits from this audio will be applied to the source audio during the conversion process. It is important to ensure that the reference audio is clear and free of excessive silence, as this can affect the speaker similarity in the output.

secondary_reference

The secondary_reference parameter is an optional input that allows you to provide an additional reference audio file. This can be used to further refine the voice conversion process by incorporating additional vocal traits from another source. If not provided, the conversion will rely solely on the primary reference audio.

noise_reduction_strength

The noise_reduction_strength parameter controls the level of noise reduction applied to the audio during conversion. It accepts a value between 0 and 1, with 0.5 as the default. A higher value results in more aggressive noise reduction, which can be useful for improving audio clarity in noisy recordings, but may also affect the naturalness of the voice.

clarity_enhancement

The clarity_enhancement parameter adjusts the enhancement of audio clarity. It accepts a value between 0 and 1, with 0.3 as the default. Increasing this value can help make the converted voice sound clearer and more distinct, but excessive enhancement may introduce artifacts.

temperature

The temperature parameter influences the randomness of the voice conversion process. It accepts a value between 0 and 1, with 0.7 as the default. A lower temperature results in more deterministic outputs, while a higher temperature introduces more variation, which can be useful for achieving a more natural-sounding voice.

normalize_output

The normalize_output parameter is a boolean that determines whether the output audio should be normalized. When set to True, the audio is adjusted to a consistent volume level, with a normalization level of 0.95. This helps ensure that the output audio is neither too quiet nor too loud.

FreeVC Voice Converter 🎤 Output Parameters:

waveform

The waveform output parameter provides the converted audio waveform as a tensor. This waveform represents the audio data after the voice conversion process, ready for playback or further processing. It is important for users to understand that this output is the transformed version of the source audio, now bearing the vocal characteristics of the reference audio.

sample_rate

The sample_rate output parameter indicates the sampling rate of the output audio. This is crucial for ensuring compatibility with audio playback systems and further processing tools. The sample rate is typically set to match the original audio or the selected model's requirements, ensuring high-quality audio output.

FreeVC Voice Converter 🎤 Usage Tips:

  • Ensure that both the source and reference audio files are of high quality and free from excessive noise or silence to achieve the best conversion results.
  • Experiment with different model types to find the one that best suits your specific audio conversion needs, as each model offers unique benefits.
  • Adjust the noise_reduction_strength and clarity_enhancement parameters to balance between audio clarity and naturalness, especially when working with noisy recordings.
  • Use the temperature parameter to control the variability of the conversion, which can help achieve a more natural-sounding voice.

FreeVC Voice Converter 🎤 Common Errors and Solutions:

"Error: Reference audio contains too much silence"

  • Explanation: The reference audio file has excessive silent segments, which can negatively impact the speaker similarity in the conversion process.
  • Solution: Trim the reference audio to remove unnecessary silence before submitting it for conversion.

"Error: Model type not recognized"

  • Explanation: The specified model type is not one of the recognized options ("FreeVC", "FreeVC-s", "FreeVC (24kHz)").
  • Solution: Ensure that the model_type parameter is set to one of the valid options provided by the node.

"Error: Source or reference audio file not found"

  • Explanation: The file path provided for the source or reference audio does not point to a valid file.
  • Solution: Verify that the file paths for both the source and reference audio are correct and that the files exist at the specified locations.

FreeVC Voice Converter 🎤 Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-FreeVC_wrapper
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.