RunComfy

FLUX.2 [klein] 4B & 9B | Ultra-Fast Flux Image Generator

Blazing-fast visual creation with unified editing control.

Consistent Face 3x3 Generator

Generate 3x3 consistent character faces using FLUX and Depth LoRA

Qwen Image Edit 2509 | Multi-Image Editor

Turn 2–3 images into one seamless, edited masterpiece instantly.

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

ComfyUI > Nodes > ComfyUI-QwenTTS > Custom Voice (QwenTTS)

ComfyUI Node: Custom Voice (QwenTTS)

Class Name

AILab_Qwen3TTSCustomVoice

Category
🧪AILab/🎙️QwenTTS

Author
1038lab (Account age: 0days) Extension
ComfyUI-QwenTTS Latest Updated
2026-03-18 Github Stars
0.2K

Github Ask 1038lab Current Questions Past Questions

Table of Content

Description
AILab_Qwen3TTSCustomVoice:
AILab_Qwen3TTSCustomVoice Input Parameters:
AILab_Qwen3TTSCustomVoice Output Parameters:
AILab_Qwen3TTSCustomVoice Usage Tips:
AILab_Qwen3TTSCustomVoice Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-QwenTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-QwenTTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-QwenTTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Custom Voice (QwenTTS) Description

Generates custom synthetic speech from text using QwenTTS for personalized voice outputs.

Custom Voice (QwenTTS):

The AILab_Qwen3TTSCustomVoice node is designed to facilitate the creation of custom voice outputs using the QwenTTS system. This node allows you to generate synthetic speech from text inputs, providing a versatile tool for AI artists who wish to incorporate unique voice elements into their projects. By leveraging advanced text-to-speech technology, this node can produce high-quality audio that mimics human speech, offering a wide range of customization options to tailor the voice output to specific artistic needs. The primary goal of this node is to enable users to create personalized voice experiences, enhancing the auditory dimension of their creative works.

Custom Voice (QwenTTS) Input Parameters:

text

The text parameter is the primary input for the node, representing the textual content that you wish to convert into speech. This parameter directly influences the spoken output, as the node will synthesize audio based on the provided text. There are no specific minimum or maximum values for this parameter, but the length and complexity of the text can affect processing time and the resulting audio quality.

speaker

The speaker parameter allows you to select the voice profile that will be used for the speech synthesis. This parameter is crucial for defining the characteristics of the voice, such as tone, pitch, and style. Different speaker profiles can be chosen to match the desired voice characteristics for your project. The available options depend on the pre-configured speaker profiles within the QwenTTS system.

model_size

The model_size parameter determines the complexity and resource requirements of the model used for speech synthesis. Larger models may produce higher quality audio but require more computational resources. This parameter allows you to balance between audio quality and processing efficiency, depending on your available resources and quality requirements.

language

The language parameter specifies the language in which the text will be synthesized. This is important for ensuring that the pronunciation and intonation are appropriate for the language of the input text. The QwenTTS system supports multiple languages, and selecting the correct language is essential for accurate and natural-sounding speech output.

instruct

The instruct parameter provides additional guidance or instructions to the speech synthesis model, allowing for further customization of the voice output. This can include specific stylistic or emotional cues that you want the synthesized voice to convey. The use of this parameter can enhance the expressiveness and relevance of the generated speech.

unload_models

The unload_models parameter is a boolean option that determines whether the models should be unloaded from memory after processing. Setting this to True can help manage memory usage, especially when working with large models or limited resources. However, unloading models may increase processing time for subsequent operations.

seed

The seed parameter is used to initialize the random number generator for the synthesis process. By setting a specific seed value, you can ensure that the generated speech is reproducible, which is useful for consistency in iterative projects. If not specified, a random seed will be used, leading to variations in the output.

Custom Voice (QwenTTS) Output Parameters:

audio

The audio parameter is the primary output of the node, representing the synthesized speech in audio format. This output is the result of converting the input text into spoken words, using the specified voice characteristics and language settings. The audio output can be used directly in multimedia projects or further processed for additional effects.

Custom Voice (QwenTTS) Usage Tips:

Experiment with different speaker profiles to find the voice that best fits your project's theme or character.
Use the instruct parameter to add emotional depth or stylistic nuances to the synthesized speech, enhancing the overall impact of the audio.
Consider the model_size and unload_models parameters to optimize performance based on your available computational resources and desired audio quality.

Custom Voice (QwenTTS) Common Errors and Solutions:

"Model loading failed"

Explanation: This error may occur if the specified model size is too large for the available system resources.
Solution: Try reducing the model_size or ensure that your system has sufficient memory and processing power to handle the selected model.

"Unsupported language"

Explanation: This error indicates that the chosen language is not supported by the QwenTTS system.
Solution: Verify that the language parameter is set to a supported language and adjust it accordingly.

"Invalid speaker profile"

Explanation: This error occurs when the specified speaker profile does not exist or is not configured correctly.
Solution: Check the available speaker profiles and ensure that the speaker parameter matches one of the valid options.

Custom Voice (QwenTTS) Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-QwenTTS

Table of Content

Description
AILab_Qwen3TTSCustomVoice:
AILab_Qwen3TTSCustomVoice Input Parameters:
AILab_Qwen3TTSCustomVoice Output Parameters:
AILab_Qwen3TTSCustomVoice Usage Tips:
AILab_Qwen3TTSCustomVoice Common Errors and Solutions:
Related Nodes

Wan 2.2 VACE | Pose-Controlled Video Generator

Turn still images into stunning motion with pose-based control.

Wan2.1 Stand In | Consistent Character Video Maker

Keeps characters consistent across video from just one reference image.

Generate ENTIRE AI WORLDS Video Scene Builder

Turn simple footage into immersive cinematic AI landscapes instantly

Image Bypass | Smart Image Detection Bypass Utility Workflow

Skip limits and process images faster with total creative control.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Custom Voice (QwenTTS)

AILab_Qwen3TTSCustomVoice

How to Install ComfyUI-QwenTTS

Custom Voice (QwenTTS) Description

Custom Voice (QwenTTS):

Custom Voice (QwenTTS) Input Parameters:

text

speaker

model_size

language

instruct

unload_models

seed

Custom Voice (QwenTTS) Output Parameters:

audio

Custom Voice (QwenTTS) Usage Tips:

Custom Voice (QwenTTS) Common Errors and Solutions:

"Model loading failed"

"Unsupported language"

"Invalid speaker profile"

Custom Voice (QwenTTS) Related Nodes