ComfyUI > Nodes > ComfyUI_ChatterBox_SRT_Voice

ComfyUI Extension: ComfyUI_ChatterBox_SRT_Voice

Repo Name

ComfyUI_ChatterBox_SRT_Voice

Author
diodiogod (Account age: 768 days)
Nodes
View all nodes(10)
Latest Updated
2026-03-21
Github Stars
0.08K

How to Install ComfyUI_ChatterBox_SRT_Voice

Install this extension via the ComfyUI Manager by searching for ComfyUI_ChatterBox_SRT_Voice
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_ChatterBox_SRT_Voice in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI_ChatterBox_SRT_Voice Description

ComfyUI_ChatterBox_SRT_Voice is an extension that integrates voice capabilities into the ComfyUI interface, enabling users to interact using speech. It enhances user experience by converting spoken input into text, facilitating seamless communication within the application.

ComfyUI_ChatterBox_SRT_Voice Introduction

ComfyUI_ChatterBox_SRT_Voice is an extension designed to enhance your text-to-speech (TTS) experience within the ComfyUI framework. This extension integrates high-quality TTS and voice conversion capabilities using ResembleAI's ChatterBox technology. It allows you to generate speech from text with unlimited text length and provides a specialized node for handling SRT (SubRip Subtitle) timings. This means you can synchronize audio with subtitles seamlessly, making it ideal for projects that require precise timing, such as video dubbing or multimedia presentations.

How ComfyUI_ChatterBox_SRT_Voice Works

At its core, ComfyUI_ChatterBox_SRT_Voice leverages advanced TTS models to convert written text into spoken words. It uses ResembleAI's ChatterBoxTTS, which is known for its high-quality voice synthesis. The extension processes text input, applies any specified voice or language settings, and generates audio output. The SRT node ensures that the generated speech aligns perfectly with subtitle timings, providing a natural and coherent audio-visual experience.

ComfyUI_ChatterBox_SRT_Voice Features

  • ChatterBox TTS: Generate speech from text with optional voice cloning for personalized voice outputs.
  • SRT Timing Node: Aligns audio with subtitle timings using SRT files, ensuring precise synchronization.
  • Character & Narrator Switching: Seamlessly switch between different characters or narrators using tags like [CharacterName].
  • Language Switching: Use bracket syntax [language:character] to switch languages and models automatically.
  • Iterative Voice Conversion: Refine voice conversion outputs through multiple iterations for improved quality.
  • Pause Tags System: Insert pauses in speech using tags like [pause:1s] for natural timing control.
  • Multi-language Support: Supports multiple languages, including English, German, Spanish, French, and more.

ComfyUI_ChatterBox_SRT_Voice Models

The extension supports various models tailored for different languages and purposes. For instance, the F5-TTS model offers high-quality voice synthesis with support for multiple languages, while the ChatterBox model provides robust TTS capabilities with language and character switching features. Choosing the right model depends on your specific needs, such as the language of your text or the desired voice characteristics.

What's New with ComfyUI_ChatterBox_SRT_Voice

Recent updates have introduced several new features and improvements:

  • F5-TTS Integration: Enhanced voice synthesis with reference audio and text.
  • Audio Analyzer: Visualize audio waveforms for precise timing extraction.
  • Character & Language Switching: Improved syntax for seamless transitions between characters and languages.
  • Iterative Voice Conversion: Enhanced caching for faster experimentation with voice refinement.

Troubleshooting ComfyUI_ChatterBox_SRT_Voice

If you encounter issues while using the extension, consider the following solutions:

  • Audio Misalignment: Ensure your SRT files are correctly formatted and match the audio content.
  • Voice Model Errors: Verify that the required models are downloaded and placed in the correct directory.
  • Language Switching Issues: Double-check the syntax of your language tags and ensure the corresponding models are available.

Learn More about ComfyUI_ChatterBox_SRT_Voice

To further explore the capabilities of ComfyUI_ChatterBox_SRT_Voice, you can access additional resources such as:

  • ChatterBox Demo
  • Model Downloads on Hugging Face
  • Community forums and tutorials available through the ComfyUI and ResembleAI websites. These resources provide valuable insights and support for maximizing the potential of the ComfyUI_ChatterBox_SRT_Voice extension in your creative projects.

ComfyUI_ChatterBox_SRT_Voice Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.