ComfyUI > Nodes > ComfyUI_FL-CosyVoice3 > FL CosyVoice3 Voice Conversion

ComfyUI Node: FL CosyVoice3 Voice Conversion

Class Name

FL_CosyVoice3_VoiceConversion

Category
🔊FL CosyVoice3/Synthesis
Author
filliptm (Account age: 2386days)
Extension
ComfyUI_FL-CosyVoice3
Latest Updated
2026-03-21
Github Stars
0.11K

How to Install ComfyUI_FL-CosyVoice3

Install this extension via the ComfyUI Manager by searching for ComfyUI_FL-CosyVoice3
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_FL-CosyVoice3 in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

FL CosyVoice3 Voice Conversion Description

Transforms source audio to match target voice using advanced machine learning models.

FL CosyVoice3 Voice Conversion:

The FL_CosyVoice3_VoiceConversion node is designed to transform the voice characteristics of a source audio file to match those of a target voice reference. This process, known as voice-to-voice conversion, allows you to take an audio recording of one voice and modify it so that it sounds like another specified voice. This node leverages advanced machine learning models to achieve high-quality voice conversion, making it a powerful tool for applications such as dubbing, voice cloning, and personalized voice assistants. By using this node, you can seamlessly convert voices while maintaining the naturalness and expressiveness of the original speech, providing a versatile solution for creative audio projects.

FL CosyVoice3 Voice Conversion Input Parameters:

model

The model parameter specifies the CosyVoice model to be used for voice conversion. This model is responsible for processing the audio data and performing the conversion. It is crucial to select a model that supports voice conversion to ensure successful execution. The model is loaded from the ModelLoader and should be compatible with the node's requirements.

source_audio

The source_audio parameter is the audio file that you wish to convert. This is the original voice recording that will be transformed to sound like the target voice. The quality and clarity of the source audio can significantly impact the conversion results, so it is recommended to use high-quality recordings.

target_audio

The target_audio parameter serves as the reference voice that the source audio will be converted to resemble. This audio file provides the voice characteristics that the conversion process will aim to replicate. Like the source audio, the target audio should be clear and representative of the desired voice characteristics.

speed

The speed parameter allows you to adjust the speech speed of the converted audio. It is a float value with a default of 1.0, meaning no change in speed. You can set it between 0.5 and 2.0, with increments of 0.05, to slow down or speed up the speech. Adjusting this parameter can help match the pacing of the target voice or achieve specific artistic effects.

seed

The seed parameter is an optional integer that sets the random seed for the conversion process. By default, it is set to 42, but you can choose any value between -1 and 2147483647. Setting a specific seed ensures reproducibility of results, while a seed of -1 allows for random variations in the conversion output.

FL CosyVoice3 Voice Conversion Output Parameters:

audio

The audio output parameter is the result of the voice conversion process. It contains the converted audio in the ComfyUI AUDIO format, ready for playback or further processing. This output retains the content of the source audio but with the voice characteristics of the target audio, providing a seamless transformation that can be used in various creative applications.

FL CosyVoice3 Voice Conversion Usage Tips:

  • Ensure that both the source and target audio files are of high quality to achieve the best conversion results. Clear and noise-free recordings will enhance the naturalness of the converted voice.
  • Experiment with the speed parameter to find the optimal speech rate that matches the target voice's natural pacing. This can help in achieving a more authentic and convincing voice conversion.
  • Use the seed parameter to control the randomness of the conversion process. Setting a specific seed can help you achieve consistent results across multiple conversions, which is useful for projects requiring uniformity.

FL CosyVoice3 Voice Conversion Common Errors and Solutions:

Model does not support voice conversion

  • Explanation: This error occurs when the selected model does not have the capability to perform voice conversion.
  • Solution: Ensure that you are using a CosyVoice model that supports voice conversion. Check the model documentation or consult with the provider to confirm compatibility.

Error in voice conversion: <specific error message>

  • Explanation: This error indicates that an unexpected issue occurred during the voice conversion process.
  • Solution: Review the error message for specific details and check the input parameters for correctness. Ensure that the audio files are accessible and in the correct format. If the problem persists, consult the traceback for more information and consider reaching out for technical support.

FL CosyVoice3 Voice Conversion Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI_FL-CosyVoice3
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

FL CosyVoice3 Voice Conversion