ComfyUI > Nodes > ComfyUI-Qwen-TTS > 千问TTS

ComfyUI Node: 千问TTS

Class Name

QwenTTS

Category
Qwen
Author
flybirdxx (Account age: 0days)
Extension
ComfyUI-Qwen-TTS
Latest Updated
2026-03-21
Github Stars
1.3K

How to Install ComfyUI-Qwen-TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-Qwen-TTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-Qwen-TTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

千问TTS Description

QwenTTS converts text to natural-sounding speech in multiple languages and voices for ComfyUI.

千问TTS:

QwenTTS is a node designed for ComfyUI that facilitates text-to-speech (TTS) conversion using advanced models. It allows you to transform written content into natural-sounding audio, providing a versatile tool for creating voiceovers, narrations, or any application requiring synthesized speech. The node supports multiple languages and voices, offering flexibility in choosing the desired tone and style for your audio output. By leveraging the capabilities of QwenTTS, you can enhance your projects with high-quality audio that aligns with your creative vision.

千问TTS Input Parameters:

model_id

The model_id parameter specifies the version of the QwenTTS model to be used for generating speech. It allows you to select from various model versions, such as qwen-tts-latest, qwen-tts-2025-05-22, qwen-tts-2025-04-10, and qwen-tts. The default value is set to qwen-tts-latest, ensuring you use the most recent advancements in TTS technology. Choosing a specific model version can impact the quality and characteristics of the generated audio, so selecting the appropriate model for your needs is crucial.

content

The content parameter is where you input the text you wish to convert into speech. It supports multiline text, allowing you to input longer passages or scripts. The default text is set to "你好,千问!", and the placeholder suggests entering "TTS text". This parameter directly influences the spoken content of the audio output, making it essential to provide clear and well-structured text for optimal results.

voice

The voice parameter allows you to choose from a variety of voice options, such as Cherry, Serena, Ethan, Chelsie, Dylan, Jada, and Sunny. The default voice is Sunny. Each voice option offers a unique tone and style, enabling you to tailor the audio output to match the desired emotional or contextual tone of your project. Selecting the right voice can significantly enhance the listener's experience.

千问TTS Output Parameters:

音频

The 音频 (audio) output parameter provides the generated audio file resulting from the text-to-speech conversion. This audio file is the primary output of the QwenTTS node, containing the spoken version of the input text. It is essential for applications where audio playback is required, such as podcasts, video narrations, or interactive media.

采样率

The 采样率 (sample rate) output parameter indicates the sample rate of the generated audio. The sample rate is a critical factor in determining the audio quality, with higher sample rates generally providing better sound fidelity. This parameter helps ensure that the audio output is compatible with various playback systems and meets the desired quality standards for your project.

千问TTS Usage Tips:

  • Experiment with different voice options to find the one that best suits the tone and style of your project. Each voice has unique characteristics that can enhance the emotional impact of your audio.
  • Use the model_id parameter to select the most appropriate model version for your needs. Newer models may offer improved audio quality and naturalness, so consider using the latest version for the best results.
  • Ensure that the content parameter is well-structured and free of errors, as this directly affects the clarity and coherence of the generated speech.

千问TTS Common Errors and Solutions:

Invalid model_id

  • Explanation: This error occurs when an unsupported or incorrect model ID is specified in the model_id parameter.
  • Solution: Verify that the model ID is one of the supported options: qwen-tts-latest, qwen-tts-2025-05-22, qwen-tts-2025-04-10, or qwen-tts.

Unsupported voice selection

  • Explanation: This error arises when a voice option not listed in the voice parameter is selected.
  • Solution: Ensure that the voice selection is one of the available options: Cherry, Serena, Ethan, Chelsie, Dylan, Jada, or Sunny.

Text input too long

  • Explanation: This error may occur if the text input in the content parameter exceeds the maximum allowed length.
  • Solution: Break down the text into smaller segments and process them separately to avoid exceeding the input limit.

千问TTS Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-Qwen-TTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

千问TTS