FL Qwen3 TTS Voice Clone Prompt:
The FL_Qwen3TTS_VoiceClonePrompt node is designed to facilitate the creation of voice clone prompts for the Qwen3-TTS model. This node is integral for generating personalized voice synthesis by leveraging reference audio and optional reference text to create a voice clone prompt. The primary function of this node is to pre-compute the necessary data that the Qwen3-TTS model requires to mimic a specific voice. By doing so, it allows for the generation of audio that closely resembles the voice characteristics of the reference audio provided. This capability is particularly beneficial for applications requiring voice personalization, such as virtual assistants, audiobooks, and other AI-driven voice applications. The node ensures that the model can handle the sample rate internally, thus simplifying the process for users and enhancing the model's adaptability to different audio inputs.
FL Qwen3 TTS Voice Clone Prompt Input Parameters:
ref_audio
The ref_audio parameter is essential as it provides the reference audio from which the voice characteristics will be extracted. This audio input is used to create a voice clone prompt that the model will use to generate audio with similar voice traits. The parameter accepts audio data in various formats, and the model internally manages the sample rate, so users do not need to adjust it manually. This parameter is crucial for the node's execution, as it directly influences the accuracy and quality of the voice cloning process.
x_vector_only_mode
The x_vector_only_mode parameter is a boolean flag that determines whether the node should operate in a mode that focuses solely on extracting the x-vector, which is a compact representation of the speaker's voice characteristics. When set to True, the node will only consider the x-vector for voice cloning, potentially speeding up the process but possibly at the cost of some detail in voice reproduction. This parameter allows users to choose between a more detailed voice cloning process or a faster, more streamlined approach.
ref_text
The ref_text parameter is optional and allows users to provide a text reference that can be used alongside the audio to enhance the voice cloning process. By supplying reference text, the model can better align the audio characteristics with the intended speech content, potentially improving the naturalness and accuracy of the generated voice. If not provided, the node will rely solely on the audio input for voice cloning.
FL Qwen3 TTS Voice Clone Prompt Output Parameters:
prompt
The prompt output parameter contains the generated voice clone prompt, which is a structured representation of the voice characteristics extracted from the reference audio and optional text. This prompt is used by the Qwen3-TTS model to synthesize audio that mimics the reference voice. It is a critical component for achieving high-quality voice cloning results.
x_vector_only_mode
The x_vector_only_mode output parameter reflects the mode in which the node operated, indicating whether the x-vector only mode was used during the voice cloning prompt creation. This information can be useful for understanding the context of the generated prompt and for debugging purposes.
has_ref_text
The has_ref_text output parameter is a boolean value that indicates whether reference text was provided and used in the creation of the voice clone prompt. This output helps users verify if the text input was successfully integrated into the voice cloning process.
FL Qwen3 TTS Voice Clone Prompt Usage Tips:
- Ensure that the reference audio provided is of high quality and free from background noise to achieve the best voice cloning results.
- Experiment with the
x_vector_only_modeparameter to find the right balance between processing speed and voice detail, depending on your specific application needs. - If possible, provide reference text that closely matches the intended speech content to enhance the naturalness and accuracy of the generated voice.
FL Qwen3 TTS Voice Clone Prompt Common Errors and Solutions:
Failed to create voice clone prompt: <error_message>
- Explanation: This error occurs when the node encounters an issue during the creation of the voice clone prompt, possibly due to invalid input parameters or issues with the reference audio.
- Solution: Verify that the reference audio is correctly formatted and free from corruption. Ensure that all required parameters are provided and correctly configured. Check the logs for more detailed error messages that can guide troubleshooting.
Either voice_clone_prompt or ref_audio must be provided.
- Explanation: This error indicates that neither a pre-computed voice clone prompt nor reference audio was provided, both of which are necessary for the node to function.
- Solution: Provide either a valid reference audio file or a pre-computed voice clone prompt to proceed with the voice cloning process.
