千问3TTS:
Qwen3TTS is a sophisticated text-to-speech (TTS) node designed for the ComfyUI platform, offering seamless integration for converting text into high-quality audio. This node leverages advanced TTS models to provide a wide range of voice options and language support, making it an ideal tool for AI artists and developers looking to add realistic voice synthesis to their projects. With Qwen3TTS, you can easily transform written content into spoken words, choosing from various voices and languages to suit your creative needs. The node is designed to be user-friendly, allowing you to input text and select voice characteristics with ease, while the underlying technology ensures that the output audio is both natural and expressive. Whether you're creating voiceovers, interactive applications, or any project requiring speech synthesis, Qwen3TTS provides the flexibility and quality needed to bring your ideas to life.
千问3TTS Input Parameters:
model_id
The model_id parameter allows you to select the specific TTS model version you wish to use. This choice can impact the quality and characteristics of the generated audio. Available options include "qwen-tts-latest", "qwen-tts-2025-05-22", "qwen-tts-2025-04-10", and "qwen-tts". The default value is set to "qwen-tts-latest", ensuring you have access to the most recent advancements in TTS technology.
content
The content parameter is where you input the text that you want to convert into speech. This parameter supports multiline text, allowing you to input longer passages if needed. The default text is "你好,千问!", and you can use this field to input any text you wish to be spoken. The placeholder "TTS text" serves as a guide for what to enter.
voice
The voice parameter lets you choose from a variety of voice options, each offering a unique tone and style. Options include voices like "Cherry", "Serena", "Ethan", "Chelsie", "Dylan", "Jada", and "Sunny", among others. The default voice is "Sunny", but you can select any voice that best fits the mood or character of your project.
language
The language parameter specifies the language of the input text, ensuring that the TTS engine processes the text correctly. Supported languages include Chinese, English, German, Italian, Portuguese, Spanish, Japanese, Korean, French, and Russian. The default language is "Chinese", but you should select the language that matches your input text to achieve the best results.
千问3TTS Output Parameters:
音频
The 音频 (audio) output parameter provides the generated audio file resulting from the text-to-speech conversion. This audio output is the primary result of the node's operation, allowing you to use it in various applications, such as voiceovers or interactive media.
采样率
The 采样率 (sample rate) output parameter indicates the sample rate of the generated audio. This value is crucial for understanding the quality and fidelity of the audio output, as a higher sample rate generally corresponds to better audio quality.
千问3TTS Usage Tips:
- To achieve the most natural-sounding speech, experiment with different voice options and select the one that best matches the tone of your project.
- Ensure that the language parameter matches the language of your input text to avoid pronunciation errors and to enhance the clarity of the generated speech.
千问3TTS Common Errors and Solutions:
Invalid model_id selection
- Explanation: This error occurs when an unsupported or incorrect model_id is selected.
- Solution: Verify that the model_id you have chosen is one of the supported options: "qwen-tts-latest", "qwen-tts-2025-05-22", "qwen-tts-2025-04-10", or "qwen-tts".
Unsupported language input
- Explanation: This error arises when the language parameter does not match the language of the input text.
- Solution: Ensure that the language parameter is set to the correct language corresponding to your input text to avoid processing errors.
