RunComfy

SeedVR2 V2.5 | AI Video Upscaling Workflow

Upscale videos fast with sharp, smooth, cinematic results.

Face Detailer | Fix Faces

Use Face Detailer first for facial restoration, followed by the 4x UltraSharp Model for superior upscaling.

SUPIR | Photo-Realistic Image/Video Upscaler

SUPIR enables photo-realistic image restoration, works with SDXL model, and supports text-prompt enhancement.

Qwen Image Edit | Precise AI Photo Editing

Edit photos fast with style, relighting, and object control precision.

ComfyUI > Nodes > ComfyUI-Qwen-TTS > 千问3TTS

ComfyUI Node: 千问3TTS

Class Name

Qwen3TTS

Category
Qwen

Author
flybirdxx (Account age: 0days) Extension
ComfyUI-Qwen-TTS Latest Updated
2026-03-21 Github Stars
1.3K

Github Ask flybirdxx Current Questions Past Questions

Table of Content

Description
Qwen3TTS:
Qwen3TTS Input Parameters:
Qwen3TTS Output Parameters:
Qwen3TTS Usage Tips:
Qwen3TTS Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-Qwen-TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-Qwen-TTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-Qwen-TTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

千问3TTS Description

Qwen3TTS converts text to high-quality audio with diverse voice and language options on ComfyUI.

千问3TTS:

Qwen3TTS is a sophisticated text-to-speech (TTS) node designed for the ComfyUI platform, offering seamless integration for converting text into high-quality audio. This node leverages advanced TTS models to provide a wide range of voice options and language support, making it an ideal tool for AI artists and developers looking to add realistic voice synthesis to their projects. With Qwen3TTS, you can easily transform written content into spoken words, choosing from various voices and languages to suit your creative needs. The node is designed to be user-friendly, allowing you to input text and select voice characteristics with ease, while the underlying technology ensures that the output audio is both natural and expressive. Whether you're creating voiceovers, interactive applications, or any project requiring speech synthesis, Qwen3TTS provides the flexibility and quality needed to bring your ideas to life.

千问3TTS Input Parameters:

model_id

The model_id parameter allows you to select the specific TTS model version you wish to use. This choice can impact the quality and characteristics of the generated audio. Available options include "qwen-tts-latest", "qwen-tts-2025-05-22", "qwen-tts-2025-04-10", and "qwen-tts". The default value is set to "qwen-tts-latest", ensuring you have access to the most recent advancements in TTS technology.

content

The content parameter is where you input the text that you want to convert into speech. This parameter supports multiline text, allowing you to input longer passages if needed. The default text is "你好，千问！", and you can use this field to input any text you wish to be spoken. The placeholder "TTS text" serves as a guide for what to enter.

voice

The voice parameter lets you choose from a variety of voice options, each offering a unique tone and style. Options include voices like "Cherry", "Serena", "Ethan", "Chelsie", "Dylan", "Jada", and "Sunny", among others. The default voice is "Sunny", but you can select any voice that best fits the mood or character of your project.

language

The language parameter specifies the language of the input text, ensuring that the TTS engine processes the text correctly. Supported languages include Chinese, English, German, Italian, Portuguese, Spanish, Japanese, Korean, French, and Russian. The default language is "Chinese", but you should select the language that matches your input text to achieve the best results.

千问3TTS Output Parameters:

音频

The 音频 (audio) output parameter provides the generated audio file resulting from the text-to-speech conversion. This audio output is the primary result of the node's operation, allowing you to use it in various applications, such as voiceovers or interactive media.

采样率

The 采样率 (sample rate) output parameter indicates the sample rate of the generated audio. This value is crucial for understanding the quality and fidelity of the audio output, as a higher sample rate generally corresponds to better audio quality.

千问3TTS Usage Tips:

To achieve the most natural-sounding speech, experiment with different voice options and select the one that best matches the tone of your project.
Ensure that the language parameter matches the language of your input text to avoid pronunciation errors and to enhance the clarity of the generated speech.

千问3TTS Common Errors and Solutions:

Invalid model_id selection

Explanation: This error occurs when an unsupported or incorrect model_id is selected.
Solution: Verify that the model_id you have chosen is one of the supported options: "qwen-tts-latest", "qwen-tts-2025-05-22", "qwen-tts-2025-04-10", or "qwen-tts".

Unsupported language input

Explanation: This error arises when the language parameter does not match the language of the input text.
Solution: Ensure that the language parameter is set to the correct language corresponding to your input text to avoid processing errors.

千问3TTS Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-Qwen-TTS

Table of Content

Description
Qwen3TTS:
Qwen3TTS Input Parameters:
Qwen3TTS Output Parameters:
Qwen3TTS Usage Tips:
Qwen3TTS Common Errors and Solutions:
Related Nodes

AnimateDiff + IPAdapter V1 | Image to Video

With IPAdapter, you can efficiently control the generation of animations using reference images.

Wan2.2 S2V | Sound to Video Generator

Turns your audio clip into lifelike, synced video from one image

Flex.1 LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained Flex.1 LoRA in ComfyUI with training-matched defaults using a single RC custom node.

LatentSync| Lip Sync Model

Advanced audio-driven lip sync technology.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: 千问3TTS

Qwen3TTS

How to Install ComfyUI-Qwen-TTS

千问3TTS Description

千问3TTS:

千问3TTS Input Parameters:

model_id

content

voice

language

千问3TTS Output Parameters:

音频

采样率

千问3TTS Usage Tips:

千问3TTS Common Errors and Solutions:

Invalid model_id selection

Unsupported language input

千问3TTS Related Nodes