ComfyUI > Nodes > ComfyUI-FL-Qwen3TTS > FL Qwen3 TTS Voice Clone Prompt

ComfyUI Node: FL Qwen3 TTS Voice Clone Prompt

Class Name

FL_Qwen3TTS_VoiceClonePrompt

Category
FL/Qwen3TTS
Author
filliptm (Account age: 2372days)
Extension
ComfyUI-FL-Qwen3TTS
Latest Updated
2026-03-18
Github Stars
0.12K

How to Install ComfyUI-FL-Qwen3TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-Qwen3TTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-FL-Qwen3TTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

FL Qwen3 TTS Voice Clone Prompt Description

Facilitates voice clone prompts for Qwen3-TTS using reference audio and optional text.

FL Qwen3 TTS Voice Clone Prompt:

The FL_Qwen3TTS_VoiceClonePrompt node is designed to facilitate the creation of voice clone prompts for the Qwen3-TTS model. This node is integral for generating personalized voice synthesis by leveraging reference audio and optional reference text to create a voice clone prompt. The primary function of this node is to pre-compute the necessary data that the Qwen3-TTS model requires to mimic a specific voice. By doing so, it allows for the generation of audio that closely resembles the voice characteristics of the reference audio provided. This capability is particularly beneficial for applications requiring voice personalization, such as virtual assistants, audiobooks, and other AI-driven voice applications. The node ensures that the model can handle the sample rate internally, thus simplifying the process for users and enhancing the model's adaptability to different audio inputs.

FL Qwen3 TTS Voice Clone Prompt Input Parameters:

ref_audio

The ref_audio parameter is essential as it provides the reference audio from which the voice characteristics will be extracted. This audio input is used to create a voice clone prompt that the model will use to generate audio with similar voice traits. The parameter accepts audio data in various formats, and the model internally manages the sample rate, so users do not need to adjust it manually. This parameter is crucial for the node's execution, as it directly influences the accuracy and quality of the voice cloning process.

x_vector_only_mode

The x_vector_only_mode parameter is a boolean flag that determines whether the node should operate in a mode that focuses solely on extracting the x-vector, which is a compact representation of the speaker's voice characteristics. When set to True, the node will only consider the x-vector for voice cloning, potentially speeding up the process but possibly at the cost of some detail in voice reproduction. This parameter allows users to choose between a more detailed voice cloning process or a faster, more streamlined approach.

ref_text

The ref_text parameter is optional and allows users to provide a text reference that can be used alongside the audio to enhance the voice cloning process. By supplying reference text, the model can better align the audio characteristics with the intended speech content, potentially improving the naturalness and accuracy of the generated voice. If not provided, the node will rely solely on the audio input for voice cloning.

FL Qwen3 TTS Voice Clone Prompt Output Parameters:

prompt

The prompt output parameter contains the generated voice clone prompt, which is a structured representation of the voice characteristics extracted from the reference audio and optional text. This prompt is used by the Qwen3-TTS model to synthesize audio that mimics the reference voice. It is a critical component for achieving high-quality voice cloning results.

x_vector_only_mode

The x_vector_only_mode output parameter reflects the mode in which the node operated, indicating whether the x-vector only mode was used during the voice cloning prompt creation. This information can be useful for understanding the context of the generated prompt and for debugging purposes.

has_ref_text

The has_ref_text output parameter is a boolean value that indicates whether reference text was provided and used in the creation of the voice clone prompt. This output helps users verify if the text input was successfully integrated into the voice cloning process.

FL Qwen3 TTS Voice Clone Prompt Usage Tips:

  • Ensure that the reference audio provided is of high quality and free from background noise to achieve the best voice cloning results.
  • Experiment with the x_vector_only_mode parameter to find the right balance between processing speed and voice detail, depending on your specific application needs.
  • If possible, provide reference text that closely matches the intended speech content to enhance the naturalness and accuracy of the generated voice.

FL Qwen3 TTS Voice Clone Prompt Common Errors and Solutions:

Failed to create voice clone prompt: <error_message>

  • Explanation: This error occurs when the node encounters an issue during the creation of the voice clone prompt, possibly due to invalid input parameters or issues with the reference audio.
  • Solution: Verify that the reference audio is correctly formatted and free from corruption. Ensure that all required parameters are provided and correctly configured. Check the logs for more detailed error messages that can guide troubleshooting.

Either voice_clone_prompt or ref_audio must be provided.

  • Explanation: This error indicates that neither a pre-computed voice clone prompt nor reference audio was provided, both of which are necessary for the node to function.
  • Solution: Provide either a valid reference audio file or a pre-computed voice clone prompt to proceed with the voice cloning process.

FL Qwen3 TTS Voice Clone Prompt Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-FL-Qwen3TTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

FL Qwen3 TTS Voice Clone Prompt