ComfyUI > Nodes > ComfyUI-QwenTTS > Custom Voice (QwenTTS)

ComfyUI Node: Custom Voice (QwenTTS)

Class Name

AILab_Qwen3TTSCustomVoice

Category
🧪AILab/🎙️QwenTTS
Author
1038lab (Account age: 0days)
Extension
ComfyUI-QwenTTS
Latest Updated
2026-03-18
Github Stars
0.2K

How to Install ComfyUI-QwenTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-QwenTTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-QwenTTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Custom Voice (QwenTTS) Description

Generates custom synthetic speech from text using QwenTTS for personalized voice outputs.

Custom Voice (QwenTTS):

The AILab_Qwen3TTSCustomVoice node is designed to facilitate the creation of custom voice outputs using the QwenTTS system. This node allows you to generate synthetic speech from text inputs, providing a versatile tool for AI artists who wish to incorporate unique voice elements into their projects. By leveraging advanced text-to-speech technology, this node can produce high-quality audio that mimics human speech, offering a wide range of customization options to tailor the voice output to specific artistic needs. The primary goal of this node is to enable users to create personalized voice experiences, enhancing the auditory dimension of their creative works.

Custom Voice (QwenTTS) Input Parameters:

text

The text parameter is the primary input for the node, representing the textual content that you wish to convert into speech. This parameter directly influences the spoken output, as the node will synthesize audio based on the provided text. There are no specific minimum or maximum values for this parameter, but the length and complexity of the text can affect processing time and the resulting audio quality.

speaker

The speaker parameter allows you to select the voice profile that will be used for the speech synthesis. This parameter is crucial for defining the characteristics of the voice, such as tone, pitch, and style. Different speaker profiles can be chosen to match the desired voice characteristics for your project. The available options depend on the pre-configured speaker profiles within the QwenTTS system.

model_size

The model_size parameter determines the complexity and resource requirements of the model used for speech synthesis. Larger models may produce higher quality audio but require more computational resources. This parameter allows you to balance between audio quality and processing efficiency, depending on your available resources and quality requirements.

language

The language parameter specifies the language in which the text will be synthesized. This is important for ensuring that the pronunciation and intonation are appropriate for the language of the input text. The QwenTTS system supports multiple languages, and selecting the correct language is essential for accurate and natural-sounding speech output.

instruct

The instruct parameter provides additional guidance or instructions to the speech synthesis model, allowing for further customization of the voice output. This can include specific stylistic or emotional cues that you want the synthesized voice to convey. The use of this parameter can enhance the expressiveness and relevance of the generated speech.

unload_models

The unload_models parameter is a boolean option that determines whether the models should be unloaded from memory after processing. Setting this to True can help manage memory usage, especially when working with large models or limited resources. However, unloading models may increase processing time for subsequent operations.

seed

The seed parameter is used to initialize the random number generator for the synthesis process. By setting a specific seed value, you can ensure that the generated speech is reproducible, which is useful for consistency in iterative projects. If not specified, a random seed will be used, leading to variations in the output.

Custom Voice (QwenTTS) Output Parameters:

audio

The audio parameter is the primary output of the node, representing the synthesized speech in audio format. This output is the result of converting the input text into spoken words, using the specified voice characteristics and language settings. The audio output can be used directly in multimedia projects or further processed for additional effects.

Custom Voice (QwenTTS) Usage Tips:

  • Experiment with different speaker profiles to find the voice that best fits your project's theme or character.
  • Use the instruct parameter to add emotional depth or stylistic nuances to the synthesized speech, enhancing the overall impact of the audio.
  • Consider the model_size and unload_models parameters to optimize performance based on your available computational resources and desired audio quality.

Custom Voice (QwenTTS) Common Errors and Solutions:

"Model loading failed"

  • Explanation: This error may occur if the specified model size is too large for the available system resources.
  • Solution: Try reducing the model_size or ensure that your system has sufficient memory and processing power to handle the selected model.

"Unsupported language"

  • Explanation: This error indicates that the chosen language is not supported by the QwenTTS system.
  • Solution: Verify that the language parameter is set to a supported language and adjust it accordingly.

"Invalid speaker profile"

  • Explanation: This error occurs when the specified speaker profile does not exist or is not configured correctly.
  • Solution: Check the available speaker profiles and ensure that the speaker parameter matches one of the valid options.

Custom Voice (QwenTTS) Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-QwenTTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Custom Voice (QwenTTS)