ComfyUI > Nodes > TTS Audio Suite > 🔧 F5-TTS Edit Options

ComfyUI Node: 🔧 F5-TTS Edit Options

Class Name

ChatterBoxF5TTSEditOptions

Category
TTS Audio Suite/👄 F5-TTS
Author
diogod (Account age: 667days)
Extension
TTS Audio Suite
Latest Updated
2025-12-13
Github Stars
0.46K

How to Install TTS Audio Suite

Install this extension via the ComfyUI Manager by searching for TTS Audio Suite
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter TTS Audio Suite in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

🔧 F5-TTS Edit Options Description

Advanced configuration options for fine-tuning text-to-speech outputs in the TTS-Audio-Suite.

🔧 F5-TTS Edit Options:

The ChatterBoxF5TTSEditOptions node is designed to provide advanced configuration options for the F5-TTS Speech Editor, enhancing your ability to fine-tune and customize text-to-speech outputs. This node is part of the TTS-Audio-Suite and follows the Audio Analyzer pattern, offering a separate options node to manage various settings. Its primary purpose is to give you greater control over the speech synthesis process, allowing for more natural and contextually appropriate audio outputs. By utilizing this node, you can adjust parameters that influence the timing, pacing, and overall quality of the generated speech, ensuring that the audio aligns with your specific artistic or functional requirements. This node is particularly beneficial for AI artists and developers who seek to create more engaging and realistic audio experiences.

🔧 F5-TTS Edit Options Input Parameters:

min_stretch_ratio

The min_stretch_ratio parameter is a floating-point value that determines the minimum factor by which audio can be sped up in the smart_natural mode. It allows you to control the pacing of the speech, with a lower value resulting in faster speech. The default value is 0.5, meaning the audio can be half as long, while the minimum and maximum values are 0.1 and 2.0, respectively. This parameter is useful for adjusting the speed of speech to match the desired tempo or to fit within specific time constraints.

timing_tolerance

The timing_tolerance parameter is a floating-point value that specifies the maximum allowed deviation in seconds for timing adjustments in the smart_natural mode. It provides flexibility in how closely the generated speech adheres to the original timing, with higher values allowing for more deviation. The default value is 2.0, with a range from 0.5 to 10.0. This parameter is essential for ensuring that the speech timing feels natural and is adaptable to different contexts or requirements.

crash_protection_template

The crash_protection_template is a string parameter that defines a custom padding template for short text segments to prevent crashes in ChatterBox. Due to a known bug, text segments shorter than approximately 21 characters can cause CUDA tensor errors. This template uses {seg} as a placeholder for the original text and can include various padding styles, such as hesitation or repetition. The default template is hmm ,, {seg} hmm ,,, but you can customize it to suit your needs or disable it by using an empty string. This parameter is crucial for maintaining stability and preventing errors during text-to-speech generation.

🔧 F5-TTS Edit Options Output Parameters:

The ChatterBoxF5TTSEditOptions node does not explicitly define output parameters in the provided context. Its primary function is to configure and adjust input parameters for the F5-TTS Speech Editor, which in turn affects the output of the text-to-speech process.

🔧 F5-TTS Edit Options Usage Tips:

  • Experiment with the min_stretch_ratio to find the optimal speech speed that matches your project's requirements, whether you need faster or slower speech.
  • Adjust the timing_tolerance to achieve a balance between natural-sounding speech and precise timing, especially when synchronizing audio with visual elements.
  • Customize the crash_protection_template to prevent errors with short text segments, ensuring smooth and uninterrupted text-to-speech generation.

🔧 F5-TTS Edit Options Common Errors and Solutions:

CUDA tensor error due to short text

  • Explanation: This error occurs when text segments shorter than approximately 21 characters are processed, leading to CUDA tensor errors.
  • Solution: Use the crash_protection_template parameter to add padding to short text segments, preventing the error. Customize the template to fit your needs or use the default setting.

Unexpected audio speed

  • Explanation: The audio may play faster or slower than expected if the min_stretch_ratio is not set correctly.
  • Solution: Adjust the min_stretch_ratio parameter to control the speed of the audio, ensuring it aligns with your desired pacing.

Timing mismatch in speech

  • Explanation: The generated speech may not match the expected timing if the timing_tolerance is set too high or too low.
  • Solution: Fine-tune the timing_tolerance parameter to achieve the right balance between natural timing and adherence to the original script.

🔧 F5-TTS Edit Options Related Nodes

Go back to the extension to check out more related nodes.
TTS Audio Suite
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.