Flux 2 Dev | Photoreal Text-to-Image Generator

Next-level image realism with advanced generation control power

Mix 13 art styles instantly or plug in custom LoRAs!

FLUX.1 Dev LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained FLUX.1 Dev LoRA in ComfyUI with training-matched behavior using a single RCFluxDev custom node.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

ComfyUI > Nodes > ComfyUI_ChatterBox_SRT_Voice

ComfyUI Extension: ComfyUI_ChatterBox_SRT_Voice

Repo Name

ComfyUI_ChatterBox_SRT_Voice

Author
diodiogod (Account age: 768 days) Nodes
View all nodes(10) Latest Updated
2026-03-21 Github Stars
0.08K

Github Ask diodiogod Current Questions Past Questions

Table of Content

Description
ComfyUI_ChatterBox_SRT_Voice Introduction
How ComfyUI_ChatterBox_SRT_Voice Works
ComfyUI_ChatterBox_SRT_Voice Features
ComfyUI_ChatterBox_SRT_Voice Models
What's New with ComfyUI_ChatterBox_SRT_Voice
Troubleshooting ComfyUI_ChatterBox_SRT_Voice
Learn More about ComfyUI_ChatterBox_SRT_Voice
Related Nodes

How to Install ComfyUI_ChatterBox_SRT_Voice

Install this extension via the ComfyUI Manager by searching for ComfyUI_ChatterBox_SRT_Voice

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_ChatterBox_SRT_Voice in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI_ChatterBox_SRT_Voice Description

ComfyUI_ChatterBox_SRT_Voice is an extension that integrates voice capabilities into the ComfyUI interface, enabling users to interact using speech. It enhances user experience by converting spoken input into text, facilitating seamless communication within the application.

ComfyUI_ChatterBox_SRT_Voice Introduction

ComfyUI_ChatterBox_SRT_Voice is an extension designed to enhance your text-to-speech (TTS) experience within the ComfyUI framework. This extension integrates high-quality TTS and voice conversion capabilities using ResembleAI's ChatterBox technology. It allows you to generate speech from text with unlimited text length and provides a specialized node for handling SRT (SubRip Subtitle) timings. This means you can synchronize audio with subtitles seamlessly, making it ideal for projects that require precise timing, such as video dubbing or multimedia presentations.

How ComfyUI_ChatterBox_SRT_Voice Works

At its core, ComfyUI_ChatterBox_SRT_Voice leverages advanced TTS models to convert written text into spoken words. It uses ResembleAI's ChatterBoxTTS, which is known for its high-quality voice synthesis. The extension processes text input, applies any specified voice or language settings, and generates audio output. The SRT node ensures that the generated speech aligns perfectly with subtitle timings, providing a natural and coherent audio-visual experience.

ComfyUI_ChatterBox_SRT_Voice Features

ChatterBox TTS: Generate speech from text with optional voice cloning for personalized voice outputs.
SRT Timing Node: Aligns audio with subtitle timings using SRT files, ensuring precise synchronization.
Character & Narrator Switching: Seamlessly switch between different characters or narrators using tags like [CharacterName].
Language Switching: Use bracket syntax [language:character] to switch languages and models automatically.
Iterative Voice Conversion: Refine voice conversion outputs through multiple iterations for improved quality.
Pause Tags System: Insert pauses in speech using tags like [pause:1s] for natural timing control.
Multi-language Support: Supports multiple languages, including English, German, Spanish, French, and more.

ComfyUI_ChatterBox_SRT_Voice Models

The extension supports various models tailored for different languages and purposes. For instance, the F5-TTS model offers high-quality voice synthesis with support for multiple languages, while the ChatterBox model provides robust TTS capabilities with language and character switching features. Choosing the right model depends on your specific needs, such as the language of your text or the desired voice characteristics.

What's New with ComfyUI_ChatterBox_SRT_Voice

Recent updates have introduced several new features and improvements:

F5-TTS Integration: Enhanced voice synthesis with reference audio and text.
Audio Analyzer: Visualize audio waveforms for precise timing extraction.
Character & Language Switching: Improved syntax for seamless transitions between characters and languages.
Iterative Voice Conversion: Enhanced caching for faster experimentation with voice refinement.

Troubleshooting ComfyUI_ChatterBox_SRT_Voice

If you encounter issues while using the extension, consider the following solutions:

Audio Misalignment: Ensure your SRT files are correctly formatted and match the audio content.
Voice Model Errors: Verify that the required models are downloaded and placed in the correct directory.
Language Switching Issues: Double-check the syntax of your language tags and ensure the corresponding models are available.

Learn More about ComfyUI_ChatterBox_SRT_Voice

To further explore the capabilities of ComfyUI_ChatterBox_SRT_Voice, you can access additional resources such as:

ChatterBox Demo
Model Downloads on Hugging Face
Community forums and tutorials available through the ComfyUI and ResembleAI websites. These resources provide valuable insights and support for maximizing the potential of the ComfyUI_ChatterBox_SRT_Voice extension in your creative projects.

ComfyUI_ChatterBox_SRT_Voice Related Nodes

🌊 Audio Wave Analyzer

🔧 Audio Wave Analyzer Options

🔧 F5-TTS Edit Options

👄 F5-TTS Speech Editor

🎤 F5-TTS Voice Generation

📺 F5-TTS SRT Voice Generation

📺 ChatterBox SRT Voice TTS

🎙️ ChatterBox Voice Capture (diogod)

🎤 ChatterBox Voice TTS (diogod)

🔄 ChatterBox Voice Conversion (diogod)

Table of Content

Description
ComfyUI_ChatterBox_SRT_Voice Introduction
How ComfyUI_ChatterBox_SRT_Voice Works
ComfyUI_ChatterBox_SRT_Voice Features
ComfyUI_ChatterBox_SRT_Voice Models
What's New with ComfyUI_ChatterBox_SRT_Voice
Troubleshooting ComfyUI_ChatterBox_SRT_Voice
Learn More about ComfyUI_ChatterBox_SRT_Voice
Related Nodes

Flux PuLID for Face Swapping

Take your face swapping projects to new heights with Flux PuLID.

Stable Video Infinity 2.0 | Long-Form Video Generator

Create long, smooth, story-driven AI videos effortlessly.

ComfyUI Grounding | Object Tracking Workflow

Track any subject with pixel-perfect accuracy for stunning VFX results.

Qwen Image Edit | Precise AI Photo Editing

Edit photos fast with style, relighting, and object control precision.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.