ComfyUI > Nodes > ComfyUI-Qwen3-TTS > Qwen3-TTS Prompt Maker

ComfyUI Node: Qwen3-TTS Prompt Maker

Class Name

Qwen3PromptMaker

Category
Qwen3-TTS
Author
wanaigc (Account age: 0days)
Extension
ComfyUI-Qwen3-TTS
Latest Updated
2026-03-21
Github Stars
0.09K

How to Install ComfyUI-Qwen3-TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-Qwen3-TTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-Qwen3-TTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Qwen3-TTS Prompt Maker Description

Facilitates creation of personalized voice clone prompts in Qwen3-TTS using audio and text inputs.

Qwen3-TTS Prompt Maker:

The Qwen3PromptMaker is a specialized node designed to facilitate the creation of voice clone prompts within the Qwen3-TTS framework. This node is integral for generating high-quality, personalized voice prompts by leveraging reference audio and text inputs. Its primary function is to create a voice clone prompt that can be used to synthesize speech that closely mimics the characteristics of the reference voice. This capability is particularly beneficial for applications requiring voice personalization, such as virtual assistants, audiobooks, and other voice-driven interfaces. By using this node, you can ensure that the generated voice output maintains a high degree of fidelity to the original reference, providing a seamless and natural listening experience.

Qwen3-TTS Prompt Maker Input Parameters:

ref_audio

The ref_audio parameter is a crucial input that provides the reference audio from which the voice characteristics will be cloned. This audio file serves as the template for the voice cloning process, allowing the model to capture and replicate the unique vocal traits present in the sample. The quality and length of the reference audio can significantly impact the accuracy and quality of the generated voice clone. It is important to ensure that the reference audio is clear and representative of the desired voice characteristics. The maximum length of the reference audio is controlled by the ref_audio_max_seconds parameter, which defaults to 30 seconds to prevent generation issues.

ref_text

The ref_text parameter is used to provide the textual content that corresponds to the reference audio. This text serves as a guide for the model to understand the linguistic context of the reference audio, aiding in the accurate reproduction of speech patterns and intonation. The text should be a faithful transcription of the reference audio to ensure the best results. Providing accurate and well-aligned text can enhance the model's ability to generate a coherent and natural-sounding voice clone.

ref_audio_max_seconds

The ref_audio_max_seconds parameter sets the maximum allowable duration for the reference audio input. This parameter is essential for managing the processing load and ensuring that the generation process does not hang due to excessively long audio inputs. By default, this value is set to 30 seconds, which is generally sufficient for capturing the essential characteristics of the reference voice. Adjusting this parameter allows you to control the trade-off between processing time and the richness of the voice clone.

Qwen3-TTS Prompt Maker Output Parameters:

prompt

The prompt output parameter represents the generated voice clone prompt, which is a data structure encapsulating the voice characteristics extracted from the reference audio and text. This prompt can be used in subsequent voice synthesis tasks to produce speech that closely mimics the original reference voice. The prompt is a critical component in the voice cloning process, as it contains the necessary information for the model to replicate the desired vocal traits accurately.

Qwen3-TTS Prompt Maker Usage Tips:

  • Ensure that the reference audio is of high quality and free from background noise to achieve the best voice cloning results.
  • Align the reference text accurately with the reference audio to enhance the model's ability to capture and reproduce speech patterns.
  • Adjust the ref_audio_max_seconds parameter based on the complexity and length of the reference audio to balance processing time and output quality.

Qwen3-TTS Prompt Maker Common Errors and Solutions:

Model Type Error: This model does not support creating voice clone prompts. Please load a 'Base' model.

  • Explanation: This error occurs when the model being used does not support the creation of voice clone prompts, which is typically restricted to certain model types.
  • Solution: Ensure that you are using a compatible 'Base' model that supports voice clone prompt creation. Check the model documentation for compatibility details.

No prompt files available. Save a prompt first using Qwen3-TTS Save Prompt.

  • Explanation: This error indicates that there are no saved prompt files available for loading, which is necessary for certain operations.
  • Solution: Use the Qwen3-TTS Save Prompt node to save a prompt before attempting to load it. Ensure that the prompt is saved in the correct directory and format.

Qwen3-TTS Prompt Maker Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-Qwen3-TTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Qwen3-TTS Prompt Maker