Save 4 hours! We auto-setup your workflow! Free!

Drop your workflow.json — we handle every dependency, custom node, and model. Just open the link and run.

Auto-Setup Workflow Json (Free) Now!

ComfyUI Node: omnivoice

Class Name

CivitaiTextToSpeechVllmOmniOmnivoice

Category
Civitai/Audio/omnivoice
Author
civitai (Account age: 1322days)
Extension
civitai-comfy-nodes
Latest Updated
2026-06-18
Github Stars
0.02K

How to Install civitai-comfy-nodes

Install this extension via the ComfyUI Manager by searching for civitai-comfy-nodes
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter civitai-comfy-nodes in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

omnivoice Description

Powerful node for text-to-speech conversion using vllm-omni engine in Civitai Orchestration suite.

omnivoice:

CivitaiTextToSpeechVllmOmniOmnivoice is a powerful node designed to convert text into speech using the advanced capabilities of the vllm-omni engine within the omnivoice ecosystem. This node is part of the Civitai Orchestration suite, which focuses on providing high-quality audio outputs from textual inputs. The primary goal of this node is to facilitate seamless text-to-speech conversion, making it an invaluable tool for AI artists and developers who need to generate audio content efficiently. By leveraging the omnivoice technology, this node ensures that the generated speech is natural and expressive, enhancing the overall user experience. Its integration into the Civitai ecosystem allows for easy orchestration and customization, making it suitable for a wide range of applications, from voiceovers to interactive audio experiences.

omnivoice Input Parameters:

text

The text parameter is the core input for the node, representing the textual content that you wish to convert into speech. This parameter directly influences the audio output, as it dictates the words and phrases that will be spoken. There are no specific minimum or maximum values for this parameter, but the length of the text may impact processing time and the resulting audio file size.

language

The language parameter specifies the language in which the text should be spoken. This is crucial for ensuring that the pronunciation and intonation are appropriate for the target language. The available options typically include a range of common languages, and selecting the correct one is essential for achieving natural-sounding speech.

ref_audio_url

The ref_audio_url parameter allows you to provide a reference audio file URL. This can be used to guide the speech synthesis process, potentially influencing the style or tone of the generated audio. While not mandatory, using a reference audio can enhance the customization of the speech output.

ref_text

The ref_text parameter serves as a reference text that can be used alongside the main text input. This can be particularly useful for maintaining consistency in style or tone when generating speech for multiple related texts. It helps in aligning the speech synthesis with specific textual nuances.

instruct

The instruct parameter is used to provide additional instructions or guidelines for the speech synthesis process. This can include directives on tone, pace, or emphasis, allowing for a more tailored audio output. Proper use of this parameter can significantly enhance the expressiveness and clarity of the generated speech.

omnivoice Output Parameters:

audio_blob

The audio_blob output is the primary result of the node, containing the synthesized speech in audio format. This output is crucial as it represents the final product that can be used in various applications, such as voiceovers or interactive media.

model_type

The model_type output provides information about the specific model used for the text-to-speech conversion. This can be useful for understanding the characteristics of the generated audio and for debugging or optimization purposes.

speaker

The speaker output indicates the voice or speaker profile used in the speech synthesis. This is important for applications where specific voice characteristics are required, such as gender or accent.

workflow_id

The workflow_id output is a unique identifier for the specific text-to-speech conversion process. This can be useful for tracking and managing multiple audio generation tasks within a larger workflow.

raw_json

The raw_json output contains the raw data and metadata associated with the text-to-speech process. This can be valuable for advanced users who need to analyze or manipulate the underlying data for further customization or integration.

omnivoice Usage Tips:

  • Ensure that the language parameter is set correctly to match the text input for optimal pronunciation and intonation.
  • Utilize the instruct parameter to fine-tune the expressiveness of the speech output, especially for applications requiring specific emotional tones.
  • Consider using the ref_audio_url and ref_text parameters to maintain consistency across multiple audio outputs, especially in projects with recurring themes or characters.

omnivoice Common Errors and Solutions:

"Invalid language selection"

  • Explanation: This error occurs when the specified language is not supported by the node.
  • Solution: Verify that the language parameter is set to one of the supported languages and adjust it accordingly.

"Text input too long"

  • Explanation: The text input exceeds the processing capacity of the node.
  • Solution: Break down the text into smaller segments and process them individually to avoid exceeding the length limit.

"Reference audio URL not accessible"

  • Explanation: The provided reference audio URL is invalid or cannot be accessed.
  • Solution: Check the URL for correctness and ensure that it is publicly accessible or properly authenticated if required.

omnivoice Related Nodes

Go back to the extension to check out more related nodes.
civitai-comfy-nodes
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

omnivoice