Flux 2 Dev | Photoreal Text-to-Image Generator

Next-level image realism with advanced generation control power

Mix 13 art styles instantly or plug in custom LoRAs!

FLUX.1 Dev LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained FLUX.1 Dev LoRA in ComfyUI with training-matched behavior using a single RCFluxDev custom node.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

ComfyUI > Nodes > ComfyUI-Index-TTS > Index TTS

ComfyUI Node: Index TTS

Class Name

IndexTTSNode

Category
audio

Author
chenpipi0807 (Account age: 609days) Extension
ComfyUI-Index-TTS Latest Updated
2025-05-11 Github Stars
0.07K

Github Ask chenpipi0807 Current Questions Past Questions

Table of Content

Description
IndexTTSNode:
IndexTTSNode Input Parameters:
IndexTTSNode Output Parameters:
IndexTTSNode Usage Tips:
IndexTTSNode Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-Index-TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-Index-TTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-Index-TTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Index TTS Description

ComfyUI IndexTTSNode: Converts text to speech with customization options for language, speed, and voice style.

Index TTS:

The IndexTTSNode is a component of the ComfyUI system designed to facilitate text-to-speech (TTS) synthesis. This node leverages the capabilities of the IndexTTS model to convert written text into spoken audio, providing a seamless way to generate speech from text inputs. It is particularly useful for applications that require voice synthesis, such as virtual assistants, audiobooks, or any interactive system that benefits from auditory feedback. The node allows for customization of the speech output by adjusting parameters like language and speed, making it versatile for different linguistic and pacing needs. By using a reference audio, it can also mimic the style or tone of a specific voice, enhancing the personalization of the generated speech.

Index TTS Input Parameters:

text

This parameter accepts a string input, which is the text you want to convert into speech. It supports multiline text, allowing for longer passages to be synthesized. The default text is "你好，我是IndexTTS语音合成系统。" which translates to "Hello, I am the IndexTTS speech synthesis system." This parameter is crucial as it forms the basis of the audio output.

reference_audio

This parameter takes an audio file as input, which serves as a reference for the voice style or tone that the synthesized speech should emulate. By providing a reference audio, you can achieve a more personalized and consistent voice output that matches the desired characteristics.

language

This parameter specifies the language of the text input. It offers options such as "auto," "zh" (Chinese), "en" (English), "ja" (Japanese), and "ko" (Korean), with "auto" being the default setting. Selecting the correct language ensures that the text is pronounced accurately according to the linguistic rules of the chosen language.

speed

This parameter controls the speed of the synthesized speech. It is a float value with a default of 1.0, representing normal speed. The speed can be adjusted between a minimum of 0.5 and a maximum of 2.0, with increments of 0.1. Modifying the speed allows you to tailor the pacing of the speech to suit different contexts or preferences.

Index TTS Output Parameters:

synthesized_audio

This output parameter provides the generated audio file, which is the result of the text-to-speech conversion process. The synthesized audio reflects the input text, reference audio style, language, and speed settings, delivering a customized speech output that can be used in various applications.

Index TTS Usage Tips:

To achieve a more natural and personalized voice output, use a high-quality reference audio that closely matches the desired voice characteristics.
Experiment with different speed settings to find the optimal pacing for your specific application, whether it requires a slower, more deliberate speech or a faster, more dynamic delivery.
Ensure that the language parameter is set correctly to match the text input, as this will significantly impact the pronunciation and overall quality of the synthesized speech.

Index TTS Common Errors and Solutions:

ImportError: 导入IndexTTS相关模块失败

Explanation: This error indicates that the necessary IndexTTS modules could not be imported, possibly due to missing dependencies.
Solution: Verify that all required dependencies for the IndexTTS module are installed. You may need to check the installation instructions or documentation for any additional packages that need to be installed.

NotImplementedError: 自定义实现尚未完成，请安装原始的IndexTTS模块

Explanation: This error occurs when the custom implementation of the IndexTTS is not complete, and the original IndexTTS module is not installed.
Solution: Install the original IndexTTS module to ensure that the node can function correctly. This may involve downloading and setting up the official IndexTTS package from a reliable source.

Index TTS Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-Index-TTS

Table of Content

Description
IndexTTSNode:
IndexTTSNode Input Parameters:
IndexTTSNode Output Parameters:
IndexTTSNode Usage Tips:
IndexTTSNode Common Errors and Solutions:
Related Nodes

Flux PuLID for Face Swapping

Take your face swapping projects to new heights with Flux PuLID.

Stable Video Infinity 2.0 | Long-Form Video Generator

Create long, smooth, story-driven AI videos effortlessly.

ComfyUI Grounding | Object Tracking Workflow

Track any subject with pixel-perfect accuracy for stunning VFX results.

Qwen Image Edit | Precise AI Photo Editing

Edit photos fast with style, relighting, and object control precision.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.