RunComfy

Flux Klein Face Swap | Realistic AI Face Editor

Swap faces perfectly. Natural, lifelike, and fast AI-powered editing.

Qwen Image 2512 | Precision AI Image Generator

Ultra-detailed art creation with next-level visual accuracy and control.

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

FLUX Kontext Face Swap | Seamless Face Replacement

Photoreal face replacement with prompt-guided control and natural blending

ComfyUI > Nodes > civitai-comfy-nodes > omnivoice

ComfyUI Node: omnivoice

Class Name

CivitaiTextToSpeechVllmOmniOmnivoice

Category
Civitai/Audio/omnivoice

Author
civitai (Account age: 1322days) Extension
civitai-comfy-nodes Latest Updated
2026-06-18 Github Stars
0.02K

Github Ask civitai Current Questions Past Questions

Table of Content

Description
CivitaiTextToSpeechVllmOmniOmnivoice:
CivitaiTextToSpeechVllmOmniOmnivoice Input Parameters:
CivitaiTextToSpeechVllmOmniOmnivoice Output Parameters:
CivitaiTextToSpeechVllmOmniOmnivoice Usage Tips:
CivitaiTextToSpeechVllmOmniOmnivoice Common Errors and Solutions:
Related Nodes

How to Install civitai-comfy-nodes

Install this extension via the ComfyUI Manager by searching for civitai-comfy-nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter civitai-comfy-nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

omnivoice Description

Powerful node for text-to-speech conversion using vllm-omni engine in Civitai Orchestration suite.

omnivoice:

CivitaiTextToSpeechVllmOmniOmnivoice is a powerful node designed to convert text into speech using the advanced capabilities of the vllm-omni engine within the omnivoice ecosystem. This node is part of the Civitai Orchestration suite, which focuses on providing high-quality audio outputs from textual inputs. The primary goal of this node is to facilitate seamless text-to-speech conversion, making it an invaluable tool for AI artists and developers who need to generate audio content efficiently. By leveraging the omnivoice technology, this node ensures that the generated speech is natural and expressive, enhancing the overall user experience. Its integration into the Civitai ecosystem allows for easy orchestration and customization, making it suitable for a wide range of applications, from voiceovers to interactive audio experiences.

omnivoice Input Parameters:

text

The text parameter is the core input for the node, representing the textual content that you wish to convert into speech. This parameter directly influences the audio output, as it dictates the words and phrases that will be spoken. There are no specific minimum or maximum values for this parameter, but the length of the text may impact processing time and the resulting audio file size.

language

The language parameter specifies the language in which the text should be spoken. This is crucial for ensuring that the pronunciation and intonation are appropriate for the target language. The available options typically include a range of common languages, and selecting the correct one is essential for achieving natural-sounding speech.

ref_audio_url

The ref_audio_url parameter allows you to provide a reference audio file URL. This can be used to guide the speech synthesis process, potentially influencing the style or tone of the generated audio. While not mandatory, using a reference audio can enhance the customization of the speech output.

ref_text

The ref_text parameter serves as a reference text that can be used alongside the main text input. This can be particularly useful for maintaining consistency in style or tone when generating speech for multiple related texts. It helps in aligning the speech synthesis with specific textual nuances.

instruct

The instruct parameter is used to provide additional instructions or guidelines for the speech synthesis process. This can include directives on tone, pace, or emphasis, allowing for a more tailored audio output. Proper use of this parameter can significantly enhance the expressiveness and clarity of the generated speech.

omnivoice Output Parameters:

audio_blob

The audio_blob output is the primary result of the node, containing the synthesized speech in audio format. This output is crucial as it represents the final product that can be used in various applications, such as voiceovers or interactive media.

model_type

The model_type output provides information about the specific model used for the text-to-speech conversion. This can be useful for understanding the characteristics of the generated audio and for debugging or optimization purposes.

speaker

The speaker output indicates the voice or speaker profile used in the speech synthesis. This is important for applications where specific voice characteristics are required, such as gender or accent.

workflow_id

The workflow_id output is a unique identifier for the specific text-to-speech conversion process. This can be useful for tracking and managing multiple audio generation tasks within a larger workflow.

raw_json

The raw_json output contains the raw data and metadata associated with the text-to-speech process. This can be valuable for advanced users who need to analyze or manipulate the underlying data for further customization or integration.

omnivoice Usage Tips:

Ensure that the language parameter is set correctly to match the text input for optimal pronunciation and intonation.
Utilize the instruct parameter to fine-tune the expressiveness of the speech output, especially for applications requiring specific emotional tones.
Consider using the ref_audio_url and ref_text parameters to maintain consistency across multiple audio outputs, especially in projects with recurring themes or characters.

omnivoice Common Errors and Solutions:

"Invalid language selection"

Explanation: This error occurs when the specified language is not supported by the node.
Solution: Verify that the language parameter is set to one of the supported languages and adjust it accordingly.

"Text input too long"

Explanation: The text input exceeds the processing capacity of the node.
Solution: Break down the text into smaller segments and process them individually to avoid exceeding the length limit.

"Reference audio URL not accessible"

Explanation: The provided reference audio URL is invalid or cannot be accessed.
Solution: Check the URL for correctness and ensure that it is publicly accessible or properly authenticated if required.

omnivoice Related Nodes

Go back to the extension to check out more related nodes.

civitai-comfy-nodes

Table of Content

Description
CivitaiTextToSpeechVllmOmniOmnivoice:
CivitaiTextToSpeechVllmOmniOmnivoice Input Parameters:
CivitaiTextToSpeechVllmOmniOmnivoice Output Parameters:
CivitaiTextToSpeechVllmOmniOmnivoice Usage Tips:
CivitaiTextToSpeechVllmOmniOmnivoice Common Errors and Solutions:
Related Nodes

Flux Upscaler - Ultimate 32k | Image Upscaler

Flux Upscaler – Achieve 4k, 8k, 16k, and Ultimate 32k Resolution!

Cosmos-Predict2 | Text2Image & Video2World

Fast and real! NVIDIA Cosmos with true physics.

HiDream E1.1 | AI Image Editing

Edit images with natural language using HiDream E1.1 model

Wan Alpha | Transparent Video Generator

Alpha magic: instant transparent background videos for VFX and design.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: omnivoice

CivitaiTextToSpeechVllmOmniOmnivoice

How to Install civitai-comfy-nodes

omnivoice Description

omnivoice:

omnivoice Input Parameters:

text

language

ref_audio_url

ref_text

instruct

omnivoice Output Parameters:

audio_blob

model_type

speaker

workflow_id

raw_json

omnivoice Usage Tips:

omnivoice Common Errors and Solutions:

"Invalid language selection"

"Text input too long"

"Reference audio URL not accessible"

omnivoice Related Nodes