Install this extension via the ComfyUI Manager by searching
for TTS Audio Suite
1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter TTS Audio Suite in the search bar
After installation, click the Restart button to
restart ComfyUI. Then, manually
refresh your browser to clear the cache and access
the updated list of nodes.
Visit
ComfyUI Online
for ready-to-use ComfyUI environment
TTS Audio Suite is a universal multi-engine TTS extension for ComfyUI, featuring modular engine adapters, character voice management, SRT subtitle support, and advanced audio processing, supporting engines like ChatterBox and F5-TTS.
TTS-Audio-Suite Introduction
TTS-Audio-Suite is a versatile extension designed for ComfyUI, providing a comprehensive solution for Text-to-Speech (TTS) and Voice Conversion. This extension integrates multiple TTS engines, including ChatterBoxTTS, F5-TTS, Higgs Audio 2, and RVC (Real-time Voice Conversion), offering a unified platform for generating high-quality speech from text. It is particularly beneficial for AI artists who wish to incorporate realistic voiceovers into their projects without needing extensive technical knowledge. The suite's modular architecture ensures it is adaptable for future enhancements and integrations, making it a robust tool for creative audio projects.
How TTS-Audio-Suite Works
At its core, TTS-Audio-Suite converts written text into spoken words using advanced machine learning models. It leverages different TTS engines, each with unique capabilities, to produce natural-sounding speech. The extension can also transform one voice into another, allowing for dynamic character interactions in audio projects. By using a modular approach, the suite can seamlessly switch between different engines and models, providing flexibility and customization to suit various artistic needs. For instance, you can use the ChatterBox engine for multilingual support or the F5-TTS for high-quality voice cloning.
TTS-Audio-Suite Features
Multi-Engine TTS: Supports various engines like ChatterBox, F5-TTS, and Higgs Audio 2, each offering unique voice synthesis capabilities.
Voice Conversion: Allows real-time voice transformation using RVC models, enabling character voice changes within a project.
Voice Capture & Recording: Features smart silence detection for capturing voice inputs, which can be used for voice cloning.
Character & Language Switching: Easily switch between different characters and languages using simple tags, enhancing multilingual projects.
Emotion Control: Adjust the emotional tone of the speech with parameters for expressive and dynamic audio outputs.
Advanced Audio Processing: Includes options for noise reduction and echo removal, ensuring high-quality audio production.
TTS-Audio-Suite Models
The suite includes several models tailored for different tasks:
ChatterBox Multilingual TTS: Ideal for projects requiring support for multiple languages, offering seamless language switching.
F5-TTS: Known for its high-quality voice cloning capabilities, suitable for projects needing precise voice replication.
Higgs Audio 2: Provides state-of-the-art voice cloning with advanced neural voice replication, perfect for creating distinct character voices.
RVC Models: Used for real-time voice conversion, allowing for dynamic voice changes in interactive media.
What's New with TTS-Audio-Suite
Recent updates have introduced several enhancements:
Improved Multilingual Support: Expanded language options and better model integration for smoother language transitions.
Enhanced Voice Conversion: Iterative refinement for more accurate voice transformations.
New Audio Processing Tools: Added features for noise reduction and echo removal, improving overall audio quality.
User-Friendly Interface: Simplified controls and settings for easier customization and use by non-technical users.
Troubleshooting TTS-Audio-Suite
If you encounter issues while using TTS-Audio-Suite, here are some common solutions:
Model Loading Errors: Ensure all required models are downloaded and placed in the correct directories. Check the console output for specific error messages.
Audio Quality Issues: Adjust the audio processing settings, such as noise reduction and echo removal, to improve output quality.
Voice Conversion Problems: Verify that the correct models are selected and that the input audio is clear and free of background noise.
Learn More about TTS-Audio-Suite
For further assistance and resources, consider exploring the following:
Tutorials and Documentation: Detailed guides and examples are available to help you get started and make the most of the suite's features.
Community Forums: Join discussions with other users and developers to share tips, ask questions, and get support.
Example Workflows: Download and experiment with pre-configured workflows to see how different features can be combined for creative projects.
By leveraging these resources, you can enhance your understanding and use of TTS-Audio-Suite, unlocking its full potential for your audio projects.
RunComfy is the
premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals.
RunComfy also provides AI Models,
enabling artists to harness the latest AI tools to create incredible art.