RunComfy

FLUX.2 [klein] 4B & 9B | Ultra-Fast Flux Image Generator

Blazing-fast visual creation with unified editing control.

Wan 2.2 Video Restyle | First Frame Restyle for Consistent and Cinematic Video Generation

Change the first frame, folks, your style makes the whole video look amazing. Pure magic.

Hunyuan3D 2.1 | Image to 3D Model

Big jump from 2.0: Turn photos into incredible 3D models instantly.

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

ComfyUI > Nodes > ComfyUI-KaniTTS

ComfyUI Extension: ComfyUI-KaniTTS

Repo Name

ComfyUI-KaniTTS

Author
wildminder (Account age: 4772 days) Nodes
View all nodes(1) Latest Updated
2025-10-17 Github Stars
0.03K

Github Ask wildminder Current Questions Past Questions

Table of Content

Description
ComfyUI-KaniTTS Introduction
How ComfyUI-KaniTTS Works
ComfyUI-KaniTTS Features
ComfyUI-KaniTTS Models
Troubleshooting ComfyUI-KaniTTS
Learn More about ComfyUI-KaniTTS
Related Nodes

How to Install ComfyUI-KaniTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-KaniTTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-KaniTTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-KaniTTS Description

ComfyUI-KaniTTS enables the generation of natural, high-quality speech from text, enhancing user interaction by converting written content into lifelike audio output.

ComfyUI-KaniTTS Introduction

ComfyUI-KaniTTS is an innovative extension designed to integrate the KaniTTS family of Text-to-Speech (TTS) models into the ComfyUI platform. This extension is tailored for AI artists who wish to transform text into high-quality speech effortlessly. By leveraging the power of KaniTTS, ComfyUI-KaniTTS offers a seamless way to generate speech with remarkable speed and fidelity, making it ideal for real-time applications. Whether you're creating voiceovers for digital art, animations, or interactive media, this extension provides a versatile toolset to bring your text to life with a variety of voices and languages.

How ComfyUI-KaniTTS Works

At its core, ComfyUI-KaniTTS operates using a two-stage pipeline. First, it employs a sophisticated language model to interpret and process the input text. Then, it utilizes an efficient audio codec to convert this processed text into speech. This approach ensures that the generated audio is not only fast but also of high quality. Imagine it as a skilled translator who not only understands the nuances of language but also has the ability to deliver it with the right tone and clarity. This makes ComfyUI-KaniTTS a powerful tool for artists looking to add a vocal dimension to their projects.

ComfyUI-KaniTTS Features

ComfyUI-KaniTTS is packed with features that enhance its usability and flexibility:

Multi-Speaker Synthesis: With the kani-tts-370m model, you can choose from a diverse array of predefined voices across multiple languages, allowing for a rich variety of vocal expressions.
Variety of Models: The extension provides access to five different KaniTTS models, each catering to different needs, from creative voice generation to specific vocal characteristics.
Automatic Model Management: The extension automatically handles the downloading and management of KaniTTS and NeMo codec models, optimizing memory usage to save VRAM.
Fine-Grained Control: Users can adjust parameters such as temperature, top-p, and repetition penalty to fine-tune the style and performance of the generated speech.
High-Efficiency Synthesis: Designed for low-latency inference, ComfyUI-KaniTTS can generate 15 seconds of audio in under a second on modern GPUs, making it suitable for real-time applications.

ComfyUI-KaniTTS Models

ComfyUI-KaniTTS offers a selection of models, each with unique capabilities:

kani-tts-370m: A multi-speaker model supporting a wide range of voices in various languages. Ideal for projects requiring diverse vocal expressions.
kani-tts-450m-0.1-pt: A base model pretrained on English, suitable for generating generic or randomized voices.
kani-tts-450m-0.1-ft: A finetuned model producing a consistent male voice, perfect for projects needing a specific male vocal character.
kani-tts-450m-0.2-pt: Another base model with broader multilingual support, offering creative voice generation.
kani-tts-450m-0.2-ft: A finetuned model for a consistent female voice, ideal for projects requiring a specific female vocal character.

Troubleshooting ComfyUI-KaniTTS

If you encounter issues while using ComfyUI-KaniTTS, here are some common problems and solutions:

Model Download Issues: Ensure you have a stable internet connection. If a model fails to download, try restarting ComfyUI.
Audio Quality Concerns: Adjust the temperature and top-p settings to refine the speech output. Lowering the temperature can result in more coherent speech.
Performance Issues: If you experience lag, consider reducing the input text length or adjusting the max_new_tokens parameter.
Installation Problems on Windows: Manually install pre-built packages for dependencies like nemo_toolkit using the provided .whl files.

Learn More about ComfyUI-KaniTTS

To further explore the capabilities of ComfyUI-KaniTTS, consider visiting the following resources:

KaniTTS GitHub Repository for more technical details and updates.
Hugging Face Model Links to access and explore different KaniTTS models.
Community forums and discussion groups where you can connect with other AI artists and share insights or seek assistance. By utilizing these resources, you can enhance your understanding and make the most out of ComfyUI-KaniTTS in your creative projects.

ComfyUI-KaniTTS Related Nodes

Kani TTS

Table of Content

Description
ComfyUI-KaniTTS Introduction
How ComfyUI-KaniTTS Works
ComfyUI-KaniTTS Features
ComfyUI-KaniTTS Models
Troubleshooting ComfyUI-KaniTTS
Learn More about ComfyUI-KaniTTS
Related Nodes

FLUX ControlNet Depth-V3 & Canny-V3

Achieve better control with FLUX-ControlNet-Depth & FLUX-ControlNet-Canny for FLUX.1 [dev].

SAM 3D ComfyUI | Object & Body Animation

Create realistic 3D motion and animation from static images instantly.

Multitalk | Realistic Talking Video Maker

One-click create multi-speaker lip-sync videos from portraits and voices!

IPAdapter V1 FaceID Plus | Consistent Characters

Leverage IPAdapter FaceID Plus V2 model to create consistent characters.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Extension: ComfyUI-KaniTTS

ComfyUI-KaniTTS

How to Install ComfyUI-KaniTTS

ComfyUI-KaniTTS Description

ComfyUI-KaniTTS Introduction

How ComfyUI-KaniTTS Works

ComfyUI-KaniTTS Features

ComfyUI-KaniTTS Models

Troubleshooting ComfyUI-KaniTTS

Learn More about ComfyUI-KaniTTS

ComfyUI-KaniTTS Related Nodes