RunComfy

Wan 2.2 Animate | Character Swap & Lip-Sync

Transforms any face to speak and move like the original with ease.

AnimateDiff + ControlNet + IPAdapter V1 | Japanese Anime Style

Transform your videos into mesmerizing Japanese anime.

PMRF Ultra Fast Upscaler | Low VRAM ComfyUI

Ultra fast PMRF upscaler! 3.79s on medium machine. 2x scale.

ReActor | Fast Face Swap

With ComfyUI ReActor, you can easily swap the faces of one or more characters in images or videos.

ComfyUI > Nodes > ComfyUI_ChatterBox_Voice > 🎤 ChatterBox Voice TTS

ComfyUI Node: 🎤 ChatterBox Voice TTS

Class Name

ChatterBoxVoiceTTS

Category
ChatterBox Voice

Author
ShmuelRonen (Account age: 1863days) Extension
ComfyUI_ChatterBox_Voice Latest Updated
2025-06-04 Github Stars
0.02K

Github Ask ShmuelRonen Current Questions Past Questions

Table of Content

Description
ChatterBoxVoiceTTS:
ChatterBoxVoiceTTS Input Parameters:
ChatterBoxVoiceTTS Output Parameters:
ChatterBoxVoiceTTS Usage Tips:
ChatterBoxVoiceTTS Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_ChatterBox_Voice

Install this extension via the ComfyUI Manager by searching for ComfyUI_ChatterBox_Voice

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_ChatterBox_Voice in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🎤 ChatterBox Voice TTS Description

ChatterBoxVoiceTTS converts text to natural speech using advanced models for dynamic applications.

🎤 ChatterBox Voice TTS:

ChatterBoxVoiceTTS is a sophisticated text-to-speech (TTS) node designed to convert written text into natural-sounding speech. It leverages advanced machine learning models to generate high-quality audio outputs that mimic human speech patterns. This node is particularly beneficial for applications requiring dynamic voice synthesis, such as virtual assistants, audiobooks, and interactive voice response systems. By utilizing a combination of text tokenization, voice encoding, and speech generation, ChatterBoxVoiceTTS ensures that the synthesized speech is both intelligible and expressive. The node also incorporates a watermarking feature to protect the generated audio content, making it a reliable choice for content creators and developers who need to maintain the integrity of their audio outputs.

🎤 ChatterBox Voice TTS Input Parameters:

text

The text parameter is the primary input for the ChatterBoxVoiceTTS node, representing the written content that you wish to convert into speech. This parameter accepts a string of text, which is then processed and tokenized by the node to facilitate speech synthesis. The quality and clarity of the generated speech are directly influenced by the input text, so it is important to ensure that the text is well-structured and free of errors. There are no explicit minimum or maximum length constraints, but longer texts may require more processing time.

exaggeration

The exaggeration parameter controls the emotional intensity of the synthesized speech. By adjusting this parameter, you can influence how expressive the generated voice sounds, ranging from a neutral tone to a more animated or emotional delivery. This parameter accepts a numerical value, typically between 0 and 1, where 0 represents no exaggeration and 1 represents maximum exaggeration. The default value is usually set to a moderate level to balance expressiveness and naturalness.

audio_prompt_path

The audio_prompt_path parameter is an optional input that allows you to specify a path to an audio file containing a voice prompt. This prompt is used to condition the speech synthesis process, enabling the node to mimic the style or characteristics of the provided voice sample. If this parameter is not provided, the node relies on pre-configured conditionals to generate speech. This feature is particularly useful for applications requiring voice cloning or personalized voice outputs.

🎤 ChatterBox Voice TTS Output Parameters:

waveform

The waveform output parameter represents the synthesized audio data in a format that can be easily processed or played back. This parameter provides the audio waveform as a tensor, which includes a batch dimension for compatibility with various audio processing frameworks. The waveform is the core output of the node, encapsulating the generated speech in a form that can be directly used in applications or further processed for enhancements.

sample_rate

The sample_rate output parameter indicates the sampling rate of the generated audio waveform. This parameter is crucial for ensuring that the audio is played back at the correct speed and quality. The sample rate is typically set to a standard value, such as 16,000 Hz or 22,050 Hz, which balances audio quality and processing efficiency. Understanding the sample rate is important for integrating the output with other audio systems or for performing additional audio processing tasks.

🎤 ChatterBox Voice TTS Usage Tips:

Ensure that the input text is clear and well-structured to achieve the best speech synthesis results. Avoid using overly complex sentences or ambiguous language.
Experiment with the exaggeration parameter to find the right balance of expressiveness for your application. A higher value can make the speech more engaging, while a lower value may be suitable for formal or informational content.
Utilize the audio_prompt_path feature to create personalized voice outputs by providing a sample of the desired voice style. This can enhance the user experience in applications requiring voice customization.

🎤 ChatterBox Voice TTS Common Errors and Solutions:

"Please `prepare_conditionals` first or specify `audio_prompt_path`"

Explanation: This error occurs when the node attempts to generate speech without having the necessary conditionals prepared or an audio prompt specified.
Solution: Ensure that you have either prepared the conditionals using the appropriate method or provided a valid path to an audio prompt file. This will allow the node to proceed with the speech synthesis process.

"Invalid text tokens"

Explanation: This error indicates that the input text could not be properly tokenized, possibly due to unsupported characters or formatting issues.
Solution: Review the input text for any unusual characters or formatting errors. Ensure that the text is compatible with the tokenizer and free of unsupported symbols.

🎤 ChatterBox Voice TTS Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_ChatterBox_Voice

Table of Content

Description
ChatterBoxVoiceTTS:
ChatterBoxVoiceTTS Input Parameters:
ChatterBoxVoiceTTS Output Parameters:
ChatterBoxVoiceTTS Usage Tips:
ChatterBoxVoiceTTS Common Errors and Solutions:
Related Nodes

DiffuEraser | Video Inpainting

Erase objects from videos with auto-masking and realistic reconstruction.

Flux Kontext Zoom Out ComfyUI Workflow | Seamless Outpainting

Zoom Out LoRA enlarges images seamlessly with natural continuation.

Consistent Character Creator 3.0 | Easy Consistency, Any Angle

Make characters stay the same, every angle, strong and perfect.

Flux TTP Upscale | 4K Face Restore

Repair distorted faces and upscale images to 4K resolution.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: 🎤 ChatterBox Voice TTS

ChatterBoxVoiceTTS

How to Install ComfyUI_ChatterBox_Voice

🎤 ChatterBox Voice TTS Description

🎤 ChatterBox Voice TTS:

🎤 ChatterBox Voice TTS Input Parameters:

text

exaggeration

audio_prompt_path

🎤 ChatterBox Voice TTS Output Parameters:

waveform

sample_rate

🎤 ChatterBox Voice TTS Usage Tips:

🎤 ChatterBox Voice TTS Common Errors and Solutions:

"Please prepare_conditionals first or specify audio_prompt_path"

"Invalid text tokens"

🎤 ChatterBox Voice TTS Related Nodes

"Please `prepare_conditionals` first or specify `audio_prompt_path`"