ComfyUI > Nodes > ComfyUI-Qwen3-TTS > Qwen3-TTS Custom Voice

ComfyUI Node: Qwen3-TTS Custom Voice

Class Name

Qwen3CustomVoice

Category
Qwen3-TTS
Author
wanaigc (Account age: 0days)
Extension
ComfyUI-Qwen3-TTS
Latest Updated
2026-03-21
Github Stars
0.09K

How to Install ComfyUI-Qwen3-TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-Qwen3-TTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-Qwen3-TTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Qwen3-TTS Custom Voice Description

Generates custom voice outputs using Qwen3-TTS, allowing personalized audio content creation.

Qwen3-TTS Custom Voice:

The Qwen3CustomVoice node is designed to generate custom voice outputs using the Qwen3-TTS model. This node allows you to create personalized voice outputs by specifying various parameters such as text, language, and speaker characteristics. It is particularly beneficial for users who want to produce unique and tailored audio content, as it leverages advanced text-to-speech technology to deliver high-quality and natural-sounding voice outputs. The primary goal of this node is to provide flexibility and customization in voice generation, making it an essential tool for AI artists looking to enhance their audio projects with distinctive vocal elements.

Qwen3-TTS Custom Voice Input Parameters:

text

The text parameter is the input text that you want to convert into speech. It serves as the primary content that the node will process to generate the audio output. The quality and clarity of the generated voice depend significantly on the text provided, so it's important to ensure that the text is well-structured and free of errors. There are no specific minimum or maximum values for this parameter, but the length of the text may affect processing time and output quality.

language

The language parameter specifies the language in which the text should be spoken. This parameter is crucial for ensuring that the generated voice accurately reflects the linguistic characteristics of the desired language. The available options for this parameter depend on the languages supported by the Qwen3-TTS model. Selecting the correct language is essential for achieving a natural and intelligible voice output.

target_speaker

The target_speaker parameter allows you to choose a specific speaker profile for the voice generation. This parameter is important for customizing the voice characteristics, such as tone and pitch, to match your desired speaker identity. The options for this parameter are typically predefined speaker profiles available within the model. Choosing the right speaker profile can significantly enhance the personalization of the generated voice.

instruct

The instruct parameter provides additional instructions or context for the voice generation process. This parameter can be used to influence the style or emotion of the generated voice, allowing for more expressive and contextually appropriate audio outputs. The specific instructions you can provide depend on the capabilities of the Qwen3-TTS model.

max_new_tokens

The max_new_tokens parameter defines the maximum number of tokens that the model can generate for the voice output. This parameter helps control the length of the generated audio, ensuring that it does not exceed a certain duration. Adjusting this parameter can be useful for managing processing time and ensuring that the output fits within specific time constraints.

Qwen3-TTS Custom Voice Output Parameters:

audio

The audio parameter is the primary output of the Qwen3CustomVoice node, representing the generated voice in audio format. This output is the result of processing the input text and parameters through the Qwen3-TTS model, producing a high-quality and natural-sounding voice that reflects the specified characteristics. The audio output can be used in various applications, such as voiceovers, podcasts, or any project requiring custom voice content.

Qwen3-TTS Custom Voice Usage Tips:

  • Ensure that the input text is clear and well-structured to achieve the best audio quality.
  • Experiment with different speaker profiles and instructions to find the most suitable voice characteristics for your project.
  • Adjust the max_new_tokens parameter to control the length of the generated audio, especially for longer texts.

Qwen3-TTS Custom Voice Common Errors and Solutions:

Model Type Error: You are trying to use 'Custom Voice' with an incompatible model.

  • Explanation: This error occurs when you attempt to use the Qwen3CustomVoice node with a model that does not support custom voice generation.
  • Solution: Load a compatible model, such as Qwen3-TTS-12Hz-1.7B-CustomVoice, to ensure that the node functions correctly.

ValueError: Text input is empty or invalid.

  • Explanation: This error indicates that the input text provided to the node is either empty or not properly formatted.
  • Solution: Verify that the input text is non-empty and correctly formatted before processing it through the node.

Qwen3-TTS Custom Voice Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-Qwen3-TTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Qwen3-TTS Custom Voice