RunComfy

Wan 2.2 FLF2V | First-Last Frame Video Generation

Generate smooth videos from a start and end frame using Wan 2.2 FLF2V.

Wan 2.2 Video Restyle | First Frame Restyle for Consistent and Cinematic Video Generation

Change the first frame, folks, your style makes the whole video look amazing. Pure magic.

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

SDXL Turbo | Rapid Text to Image

Experience fast text-to-image synthesis with SDXL Turbo.

ComfyUI > Nodes > ComfyUI-QwenTTS > Voice Design (QwenTTS) Advanced

ComfyUI Node: Voice Design (QwenTTS) Advanced

Class Name

AILab_Qwen3TTSVoiceDesign_Advanced

Category
🧪AILab/🎙️QwenTTS

Author
1038lab (Account age: 0days) Extension
ComfyUI-QwenTTS Latest Updated
2026-03-18 Github Stars
0.2K

Github Ask 1038lab Current Questions Past Questions

Table of Content

Description
AILab_Qwen3TTSVoiceDesign_Advanced:
AILab_Qwen3TTSVoiceDesign_Advanced Input Parameters:
AILab_Qwen3TTSVoiceDesign_Advanced Output Parameters:
AILab_Qwen3TTSVoiceDesign_Advanced Usage Tips:
AILab_Qwen3TTSVoiceDesign_Advanced Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-QwenTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-QwenTTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-QwenTTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Voice Design (QwenTTS) Advanced Description

Enhances TTS projects with advanced voice design, offering customizable, natural audio.

Voice Design (QwenTTS) Advanced:

AILab_Qwen3TTSVoiceDesign_Advanced is a sophisticated node designed to enhance your text-to-speech (TTS) projects by allowing you to create advanced voice designs. This node leverages the QwenTTS technology to generate high-quality, natural-sounding audio from text inputs. It is particularly beneficial for AI artists looking to craft unique voice styles and tones, as it provides a wide range of customization options. The node's primary function is to transform textual instructions into audio outputs, offering a seamless way to experiment with different voice characteristics and styles. By utilizing this node, you can achieve a high degree of control over the voice output, making it an invaluable tool for projects that require precise voice modulation and design.

Voice Design (QwenTTS) Advanced Input Parameters:

text

This parameter represents the textual content that you want to convert into speech. It is crucial as it forms the basis of the audio output. The text should be clear and concise to ensure accurate voice generation. There are no specific minimum or maximum values, but the quality of the input text directly impacts the clarity and effectiveness of the generated speech.

instruct

The instruct parameter allows you to provide specific instructions or guidelines on how the text should be spoken. This can include details about the desired tone, pace, or style of the voice. It is essential for tailoring the voice output to meet specific artistic or project requirements. Like the text parameter, there are no strict limits, but the instructions should be clear to achieve the desired effect.

model_size

This parameter determines the size of the model used for voice generation. Larger models may offer more nuanced and detailed voice outputs but require more computational resources. The choice of model size can impact both the quality of the audio and the performance of the node.

device

The device parameter specifies the hardware on which the voice generation process will run. Options typically include CPU or GPU, with GPUs generally providing faster processing times. Selecting the appropriate device can optimize the node's performance based on your available resources.

precision

Precision refers to the numerical precision used during the voice generation process. Higher precision can lead to more accurate audio outputs but may also increase computational demands. This parameter allows you to balance quality and performance according to your needs.

language

This parameter sets the language for the voice output. It ensures that the generated speech is in the correct language, which is crucial for projects targeting specific linguistic audiences. The language should be chosen based on the text input and the intended audience.

seed

The seed parameter is used to initialize the random number generator, ensuring reproducibility of results. By setting a specific seed value, you can achieve consistent voice outputs across different runs. The default value is -1, which means no specific seed is set.

max_new_tokens

This parameter defines the maximum number of new tokens (words or parts of words) that can be generated. It helps control the length of the audio output, with a default value of 2048 tokens. Adjusting this value can help manage the duration of the generated speech.

do_sample

A boolean parameter that determines whether sampling is used during voice generation. When set to true, it allows for more varied and creative outputs, while false results in more deterministic outputs. This parameter is useful for exploring different voice styles.

top_p

Top-p sampling, also known as nucleus sampling, is controlled by this parameter. It sets the cumulative probability threshold for token selection, with a default value of 0.9. Adjusting top_p can influence the diversity and creativity of the voice output.

top_k

This parameter limits the number of tokens considered for each step in the generation process. A lower top_k value results in more focused outputs, while a higher value allows for more diversity. The default is 50, balancing creativity and coherence.

temperature

Temperature controls the randomness of the voice generation process. A higher temperature results in more varied outputs, while a lower temperature produces more predictable results. The default value is 0.9, offering a balance between creativity and stability.

repetition_penalty

This parameter applies a penalty to repeated tokens, helping to reduce redundancy in the generated speech. A value of 1.0 means no penalty, while higher values discourage repetition. It is useful for ensuring more natural-sounding outputs.

attention

The attention parameter specifies the attention mechanism used during voice generation. The default setting is "auto," which automatically selects the best attention mechanism based on the model and input parameters. This helps optimize the quality of the audio output.

unload_models

A boolean parameter that determines whether models should be unloaded from memory after use. Setting this to true can free up resources, especially when working with large models or limited hardware. It is useful for managing memory usage effectively.

Voice Design (QwenTTS) Advanced Output Parameters:

audio

The audio parameter is the primary output of the node, representing the generated speech in audio format. This output is crucial as it is the final product of the voice design process, ready for use in various applications. The audio quality and characteristics depend on the input parameters and the model used, making it essential to configure the node correctly to achieve the desired results.

Voice Design (QwenTTS) Advanced Usage Tips:

Experiment with different instruct values to explore a wide range of voice styles and tones, enhancing the creativity of your projects.
Adjust the temperature and top_p parameters to balance between creative and coherent outputs, depending on the specific requirements of your project.
Utilize the seed parameter to ensure reproducibility of results, especially when fine-tuning voice outputs for consistency across different sessions.

Voice Design (QwenTTS) Advanced Common Errors and Solutions:

"Text and instruct are required"

Explanation: This error occurs when either the text or instruct parameter is missing or empty.
Solution: Ensure that both the text and instruct parameters are provided and contain valid content before executing the node.

"Model loading failed"

Explanation: This error indicates an issue with loading the specified model, possibly due to incorrect model_size or device settings.
Solution: Verify that the model_size and device parameters are correctly configured and that the necessary resources are available for model loading.

"Invalid language setting"

Explanation: This error arises when the specified language is not supported or incorrectly mapped.
Solution: Check the language parameter and ensure it matches one of the supported languages or is correctly mapped in the LANGUAGE_MAP.

Voice Design (QwenTTS) Advanced Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-QwenTTS

Table of Content

Description
AILab_Qwen3TTSVoiceDesign_Advanced:
AILab_Qwen3TTSVoiceDesign_Advanced Input Parameters:
AILab_Qwen3TTSVoiceDesign_Advanced Output Parameters:
AILab_Qwen3TTSVoiceDesign_Advanced Usage Tips:
AILab_Qwen3TTSVoiceDesign_Advanced Common Errors and Solutions:
Related Nodes

Virtual Try-On | Realistic Fashion Fitting

Instant outfit previews with natural, well-fitted clothing visuals

Qwen-Image Lightning | 8-Step Speed Boost

Cut generation time in half.

Wan2.1 Stand In | Consistent Character Video Maker

Keeps characters consistent across video from just one reference image.

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Voice Design (QwenTTS) Advanced

AILab_Qwen3TTSVoiceDesign_Advanced

How to Install ComfyUI-QwenTTS

Voice Design (QwenTTS) Advanced Description

Voice Design (QwenTTS) Advanced:

Voice Design (QwenTTS) Advanced Input Parameters:

text

instruct

model_size

device

precision

language

seed

max_new_tokens

do_sample

top_p

top_k

temperature

repetition_penalty

attention

unload_models

Voice Design (QwenTTS) Advanced Output Parameters:

audio

Voice Design (QwenTTS) Advanced Usage Tips:

Voice Design (QwenTTS) Advanced Common Errors and Solutions:

"Text and instruct are required"

"Model loading failed"

"Invalid language setting"

Voice Design (QwenTTS) Advanced Related Nodes