RunComfy

Wan 2.2 FLF2V | First-Last Frame Video Generation

Generate smooth videos from a start and end frame using Wan 2.2 FLF2V.

Image Bypass | Smart Image Detection Bypass Utility Workflow

Skip limits and process images faster with total creative control.

FLUX Img2Img | Merge Visuals and Prompts

Merge visuals and prompts for stunning, enhanced results.

FLUX.2 Klein Unified Image Editing | Smart Inpaint, Outpaint & Remove

Flawless editing. Remove, fill, and extend any image fast.

ComfyUI > Nodes > civitai-comfy-nodes > qwen3 / customVoice

ComfyUI Node: qwen3 / customVoice

Class Name

CivitaiTextToSpeechVllmOmniQwen3CustomVoice

Category
Civitai/Audio/qwen3

Author
civitai (Account age: 1322days) Extension
civitai-comfy-nodes Latest Updated
2026-06-18 Github Stars
0.02K

Github Ask civitai Current Questions Past Questions

Table of Content

Description
CivitaiTextToSpeechVllmOmniQwen3CustomVoice:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Input Parameters:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Output Parameters:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Usage Tips:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Common Errors and Solutions:
Related Nodes

How to Install civitai-comfy-nodes

Install this extension via the ComfyUI Manager by searching for civitai-comfy-nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter civitai-comfy-nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

qwen3 / customVoice Description

Sophisticated node for text-to-speech conversion with customizable voice synthesis and speaker options.

qwen3 / customVoice:

CivitaiTextToSpeechVllmOmniQwen3CustomVoice is a sophisticated node designed to convert text into speech using the Civitai Orchestration platform. This node leverages the vllm-omni engine within the qwen3 ecosystem to provide a customizable voice synthesis experience. It allows you to generate audio outputs from text inputs, offering a range of built-in speaker options to tailor the voice output to your specific needs. The node is particularly beneficial for creating personalized audio content, enabling you to specify language preferences and style instructions to achieve the desired speech characteristics. Its primary goal is to facilitate the seamless transformation of written content into high-quality spoken audio, making it an invaluable tool for AI artists and content creators looking to enhance their projects with custom voiceovers.

qwen3 / customVoice Input Parameters:

text

This parameter represents the text you wish to convert into speech. It is a required input and should be provided as a string. The text can be multiline, allowing for more extensive content to be synthesized. The quality and clarity of the output audio will depend on the text provided.

language

The language parameter allows you to specify the target language for the speech synthesis. It accepts a string value, such as "English" or "Chinese," and defaults to "Auto" if not specified. This parameter ensures that the synthesized speech matches the linguistic characteristics of the desired language.

max_new_tokens

This optional parameter sets a cap on the maximum number of tokens generated during the speech synthesis process. It is an integer value with a default of 0, meaning no cap is applied. The minimum value is 0, and the maximum is 2,147,483,647. Adjusting this parameter can help manage the length and complexity of the generated speech.

speaker

The speaker parameter allows you to choose from a list of built-in speaker names, such as "aiden," "dylan," "eric," and others. This selection determines the voice characteristics used in the CustomVoice mode, enabling you to personalize the audio output to suit your project's needs.

instruct

This optional parameter provides style instructions for the speech synthesis, such as "speak slowly and clearly." It accepts a string value and allows you to influence the delivery style of the generated speech, adding an extra layer of customization to the audio output.

api_config

The api_config parameter is optional and is used to configure the Civitai Auth connection. It defaults to using the CIVITAI_API_TOKEN or a stored OAuth login. This configuration is necessary for authenticating and authorizing access to the Civitai platform's resources.

qwen3 / customVoice Output Parameters:

audio_blob

This output parameter contains the synthesized audio in a binary format. It represents the actual speech generated from the input text, ready for playback or further processing.

model_type

The model_type output provides a string indicating the type of model used for the speech synthesis. This information can be useful for understanding the underlying technology and capabilities of the generated audio.

speaker

This output returns the name of the speaker used in the synthesis process. It confirms the voice characteristics applied to the audio output, ensuring that the desired speaker was utilized.

workflow_id

The workflow_id is a string that uniquely identifies the workflow instance used for the text-to-speech conversion. It can be helpful for tracking and managing different synthesis tasks within the Civitai platform.

raw_json

This output provides a raw JSON string containing detailed information about the synthesis process. It includes metadata and other relevant data that can be useful for debugging or analyzing the text-to-speech operation.

qwen3 / customVoice Usage Tips:

Experiment with different speaker options to find the voice that best suits your project's tone and style.
Use the instruct parameter to fine-tune the delivery style of the speech, such as adding pauses or emphasizing certain words.
Adjust the max_new_tokens parameter to control the length of the generated speech, especially for longer texts.

qwen3 / customVoice Common Errors and Solutions:

"Invalid API Configuration"

Explanation: This error occurs when the api_config parameter is not correctly set, preventing authentication with the Civitai platform.
Solution: Ensure that the api_config is properly configured with a valid CIVITAI_API_TOKEN or OAuth login credentials.

"Unsupported Language"

Explanation: The specified language is not supported by the text-to-speech engine.
Solution: Verify that the language parameter is set to a supported language, such as "English" or "Chinese," or use the default "Auto" setting.

"Speaker Not Found"

Explanation: The chosen speaker name does not exist in the list of available options.
Solution: Double-check the speaker parameter to ensure it matches one of the available speaker names, such as "aiden" or "serena."

qwen3 / customVoice Related Nodes

Go back to the extension to check out more related nodes.

civitai-comfy-nodes

Table of Content

Description
CivitaiTextToSpeechVllmOmniQwen3CustomVoice:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Input Parameters:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Output Parameters:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Usage Tips:
CivitaiTextToSpeechVllmOmniQwen3CustomVoice Common Errors and Solutions:
Related Nodes

Wan 2.2 VACE | Pose-Controlled Video Generator

Turn still images into stunning motion with pose-based control.

LTX-2 ComfyUI | Real-Time Video Generator

Create real-time videos instantly, faster than any other generator.

CHORD Model | AI PBR Texture Generator

Turns images into true PBR texture maps fast.

PuLID Flux II | Consistent Character Generation

Generate images with precise character control while preserving artistic style.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: qwen3 / customVoice

CivitaiTextToSpeechVllmOmniQwen3CustomVoice

How to Install civitai-comfy-nodes

qwen3 / customVoice Description

qwen3 / customVoice:

qwen3 / customVoice Input Parameters:

text

language

max_new_tokens

speaker

instruct

api_config

qwen3 / customVoice Output Parameters:

audio_blob

model_type

speaker

workflow_id

raw_json

qwen3 / customVoice Usage Tips:

qwen3 / customVoice Common Errors and Solutions:

"Invalid API Configuration"

"Unsupported Language"

"Speaker Not Found"

qwen3 / customVoice Related Nodes