ComfyUI > Nodes > ComfyUI_KokoroTTS_MW

ComfyUI Extension: ComfyUI_KokoroTTS_MW

Repo Name

ComfyUI_KokoroTTS_MW

Author
mw (Account age: 2267 days)
Nodes
View all nodes(2)
Latest Updated
2025-04-27
Github Stars
0.02K

How to Install ComfyUI_KokoroTTS_MW

Install this extension via the ComfyUI Manager by searching for ComfyUI_KokoroTTS_MW
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_KokoroTTS_MW in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI_KokoroTTS_MW Description

ComfyUI_KokoroTTS_MW is a Text To Speech node for ComfyUI, utilizing Kokoro TTS to offer multilingual support across 8 languages and 150 distinct voices.

ComfyUI_KokoroTTS_MW Introduction

ComfyUI_KokoroTTS_MW is an extension designed to integrate high-quality text-to-speech (TTS) capabilities into the ComfyUI environment. This extension leverages the power of the Kokoro TTS model, which is known for its lightweight architecture and impressive performance. With ComfyUI_KokoroTTS_MW, you can transform written text into natural-sounding speech across multiple languages and voices, making it an invaluable tool for AI artists looking to add an auditory dimension to their projects. Whether you're creating interactive art installations, voiceovers for animations, or simply exploring new creative avenues, this extension provides a versatile and efficient solution.

How ComfyUI_KokoroTTS_MW Works

At its core, ComfyUI_KokoroTTS_MW utilizes the Kokoro TTS model, which is an open-weight model with 82 million parameters. Despite its relatively small size, Kokoro delivers audio quality comparable to larger models while being faster and more cost-effective. The extension works by taking input text and processing it through a pipeline that converts the text into phonemes, which are then synthesized into speech. This process is akin to how humans read aloud, where the brain interprets written words and converts them into spoken language. The extension supports multiple languages and voices, allowing for a wide range of expressive possibilities.

ComfyUI_KokoroTTS_MW Features

  • High-Quality Synthesis: Produces clear and natural-sounding speech, suitable for professional and creative applications.
  • Multilingual Support: Currently supports eight languages, including American English, British English, Japanese, Chinese, Spanish, French, Hindi, Italian, and Brazilian Portuguese.
  • Diverse Voice Options: Offers a variety of voices for each language, enabling you to choose the perfect tone and style for your project.
  • Seamless Integration: Easily integrates with ComfyUI workflows, allowing you to incorporate TTS functionality into your existing projects without hassle.

ComfyUI_KokoroTTS_MW Models

The extension supports two main models: Kokoro-82M and Kokoro-82M-v1.1-zh. The Kokoro-82M model is suitable for general use across all supported languages, while the Kokoro-82M-v1.1-zh is optimized for Mandarin Chinese. Depending on your project's language requirements, you can select the appropriate model to ensure optimal performance and quality.

What's New with ComfyUI_KokoroTTS_MW

  • [2025-03-22]: The code has been refactored to enhance generation speed, making the extension more efficient and responsive.
  • [2025-03-05]: Expanded language support to include Spanish, French, Hindi, Italian, and Brazilian Portuguese, along with 150 new voices. Additionally, 100 new Chinese voices have been added, providing even more options for customization and creativity.

Troubleshooting ComfyUI_KokoroTTS_MW

If you encounter issues while using the extension, here are some common problems and solutions:

  • Problem: The extension is not producing any sound.
  • Solution: Ensure that the models and voices are correctly downloaded and placed in the ComfyUI\models\Kokorotts directory. Verify that the file paths and configurations are correct.
  • Problem: The generated speech sounds unnatural or robotic.
  • Solution: Experiment with different voices and adjust the speed settings to find a more natural-sounding output. Ensure that the text input is clear and free of errors.
  • Problem: Language or voice not available.
  • Solution: Check the latest updates to ensure you have the most recent version of the extension, which includes new languages and voices.

Learn More about ComfyUI_KokoroTTS_MW

To further explore the capabilities of ComfyUI_KokoroTTS_MW, you can access additional resources such as tutorials and community forums. The Kokoro GitHub repository provides detailed documentation and examples. You can also join the Kokoro Discord server to connect with other users, share experiences, and seek support. For those interested in the technical aspects, the Kokoro-82M model page offers insights into the model's architecture and performance.

ComfyUI_KokoroTTS_MW Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.