ComfyUI > Nodes > ComfyUI_FL-CosyVoice3

ComfyUI Extension: ComfyUI_FL-CosyVoice3

Repo Name

ComfyUI_FL-CosyVoice3

Author
filliptm (Account age: 2386 days)
Nodes
View all nodes(10)
Latest Updated
2026-03-21
Github Stars
0.11K

How to Install ComfyUI_FL-CosyVoice3

Install this extension via the ComfyUI Manager by searching for ComfyUI_FL-CosyVoice3
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_FL-CosyVoice3 in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI_FL-CosyVoice3 Description

ComfyUI_FL-CosyVoice3 enhances ComfyUI with advanced text-to-speech capabilities, offering zero-shot voice cloning, cross-lingual synthesis, and voice conversion. It supports 9 languages and 18+ Chinese dialects, utilizing the CosyVoice3 model family for seamless voice synthesis and automatic model management.

ComfyUI_FL-CosyVoice3 Introduction

ComfyUI_FL-CosyVoice3 is an advanced extension designed to enhance the text-to-speech capabilities within the ComfyUI environment. Powered by the CosyVoice3 model family, this extension offers a suite of features that allow you to create highly realistic and versatile voice outputs. Whether you're looking to clone a voice, synthesize speech in multiple languages, or convert one voice to sound like another, ComfyUI_FL-CosyVoice3 provides the tools you need. This extension is particularly beneficial for AI artists who want to incorporate dynamic and expressive audio elements into their projects without needing extensive technical knowledge.

How ComfyUI_FL-CosyVoice3 Works

At its core, ComfyUI_FL-CosyVoice3 leverages large language models to perform text-to-speech synthesis. The extension uses a process called zero-shot voice cloning, which means it can replicate a voice using only a short audio sample as a reference. This is akin to an artist capturing the essence of a subject with just a few brushstrokes. Additionally, the extension supports cross-lingual synthesis, allowing a voice to speak in different languages while maintaining its unique characteristics. This is achieved through sophisticated algorithms that analyze and reproduce the nuances of speech, such as tone and accent, across various languages.

ComfyUI_FL-CosyVoice3 Features

  • Zero-Shot Voice Cloning: This feature allows you to clone any voice using a reference audio clip of just 3 to 30 seconds. It's perfect for creating personalized voiceovers or character voices.
  • Cross-Lingual Synthesis: Speak in nine different languages, including Chinese, English, Japanese, and more, while preserving the original voice's characteristics. This feature is ideal for multilingual projects.
  • Voice Conversion: Transform one voice to sound like another, enabling creative audio transformations and character development.
  • Auto Transcription: Integrated with Whisper, this feature automatically transcribes reference audio, making it easier to work with text and audio simultaneously.
  • Speed Control: Adjust the speech rate from 0.5x to 2.0x, allowing for creative control over the pacing of the audio output.

ComfyUI_FL-CosyVoice3 Models

The extension supports several models, each tailored for different needs:

  • Fun-CosyVoice3-0.5B: This is the recommended model, offering a balance of performance and quality with a size of approximately 2GB.
  • CosyVoice2-0.5B: An alternative model of similar size, providing different stylistic outputs.
  • CosyVoice-300M: A lightweight model at around 1.2GB, though it may not perform as well as the others.

These models are automatically downloaded and cached for ease of use.

What's New with ComfyUI_FL-CosyVoice3

Recent updates have introduced new nodes and features to enhance functionality:

  • Instruct2 Node: Allows voice cloning with instructive text, expanding the creative possibilities for voice synthesis.
  • Save Speaker Node: Enables saving of voice presets, making it easier to reuse specific voice settings across projects.
  • Speaker Clone and Speaker Instruct2 Nodes: These nodes facilitate voice cloning using saved presets, streamlining the workflow for repeated use of specific voices.

Troubleshooting ComfyUI_FL-CosyVoice3

Here are some common issues and solutions:

  • Model Loading Issues: Ensure that your internet connection is stable for downloading models. If a model fails to load, try restarting ComfyUI and reloading the model.
  • Voice Cloning Errors: If the voice cloning does not sound accurate, check the quality and length of your reference audio. A clear audio sample of 3-30 seconds is recommended.
  • Language Synthesis Problems: Ensure that the text input is correctly formatted for the desired language. Double-check language settings in the Cross-Lingual node.

Learn More about ComfyUI_FL-CosyVoice3

For further learning and support, consider exploring the following resources:

  • CosyVoice Original Repository: Access the original codebase and documentation for deeper insights into the technology behind ComfyUI_FL-CosyVoice3. - CosyVoice Demos: View demonstrations of the CosyVoice models in action to understand their capabilities.
  • Community Forums: Join discussions and seek help from other users and developers. These resources provide valuable information and community support to help you make the most of ComfyUI_FL-CosyVoice3 in your creative projects.

ComfyUI_FL-CosyVoice3 Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ComfyUI_FL-CosyVoice3 detailed guide | ComfyUI