RunComfy

Flux Klein Face Swap | Realistic AI Face Editor

Swap faces perfectly. Natural, lifelike, and fast AI-powered editing.

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Flux Kontext Pulid | Consistent Character Generation

Create consistent characters using FLUX Kontext with a single face reference image.

Hunyuan Video | Video to Video

Combine text prompt and source video to generate new video.

ComfyUI > Nodes > ComfyUI-MegaTTS > MegaTTS Voice Maker

ComfyUI Node: MegaTTS Voice Maker

Class Name

MegaTTS_VoiceMaker

Category
🧪AILab/🔊Audio

Author
1038lab (Account age: 774days) Extension
ComfyUI-MegaTTS Latest Updated
2025-04-13 Github Stars
0.03K

Github Ask 1038lab Current Questions Past Questions

Table of Content

Description
MegaTTS_VoiceMaker:
MegaTTS_VoiceMaker Input Parameters:
MegaTTS_VoiceMaker Output Parameters:
MegaTTS_VoiceMaker Usage Tips:
MegaTTS_VoiceMaker Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-MegaTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-MegaTTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-MegaTTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

MegaTTS Voice Maker Description

Sophisticated node for high-quality text-to-speech audio generation with advanced models for natural-sounding speech synthesis.

MegaTTS Voice Maker:

MegaTTS_VoiceMaker is a sophisticated node designed to facilitate the generation of high-quality text-to-speech (TTS) audio outputs. It leverages advanced TTS models to convert textual input into natural-sounding speech, making it an invaluable tool for AI artists and developers looking to integrate voice synthesis into their projects. The node is capable of processing input text and generating audio that closely mimics human speech, with options to adjust pronunciation strength and voice similarity to achieve the desired output. This flexibility allows users to create personalized and contextually appropriate audio content, enhancing the overall user experience in applications such as virtual assistants, audiobooks, and interactive media.

MegaTTS Voice Maker Input Parameters:

input_text

The input_text parameter is the primary input for the MegaTTS_VoiceMaker node, representing the text that you wish to convert into speech. This parameter is crucial as it directly influences the content of the generated audio. There are no specific minimum or maximum values for this parameter, but it is important to ensure that the text is clear and free of errors to achieve the best results. The input text should be concise and well-structured to facilitate accurate and natural-sounding speech synthesis.

language

The language parameter specifies the language in which the input text is written. This parameter is essential for ensuring that the TTS model applies the correct phonetic and linguistic rules during speech synthesis. The available options for this parameter depend on the languages supported by the TTS model being used. Selecting the appropriate language is crucial for achieving accurate pronunciation and intonation in the generated audio.

pronunciation_strength

The pronunciation_strength parameter allows you to adjust the emphasis placed on pronunciation during speech synthesis. This parameter can be used to fine-tune the clarity and articulation of the generated speech. A higher value will result in more pronounced and distinct speech, while a lower value will produce a more relaxed and natural-sounding output. The default value is typically set to a balanced level, but you can adjust it based on your specific needs and preferences.

voice_similarity

The voice_similarity parameter controls how closely the generated speech resembles a reference voice. This parameter is useful for creating consistent and recognizable voice outputs, especially when using a specific voice as a reference. A higher value will result in speech that closely matches the reference voice, while a lower value will allow for more variation. The default value is set to provide a good balance between similarity and naturalness, but you can adjust it to suit your requirements.

MegaTTS Voice Maker Output Parameters:

audio_output

The audio_output parameter is the primary output of the MegaTTS_VoiceMaker node, containing the synthesized speech audio. This output is crucial as it represents the final product of the TTS process, which can be used in various applications such as voiceovers, virtual assistants, and multimedia content. The audio output is typically provided in a standard format, such as a waveform, with a sample rate that ensures high-quality playback. The quality and characteristics of the audio output are influenced by the input parameters, allowing you to customize the speech synthesis to meet your specific needs.

status

The status parameter provides feedback on the success or failure of the TTS process. It is an important output that helps you understand whether the node executed successfully or encountered any issues. The status message can include information about successful processing, memory cleanup, or any errors that occurred during execution. This feedback is valuable for troubleshooting and ensuring that the TTS process runs smoothly.

MegaTTS Voice Maker Usage Tips:

Ensure that your input text is clear and free of grammatical errors to achieve the best speech synthesis results.
Experiment with the pronunciation_strength and voice_similarity parameters to find the right balance for your specific application, whether you need clear articulation or a more natural-sounding voice.
Use the language parameter to match the input text's language, ensuring accurate pronunciation and intonation.

MegaTTS Voice Maker Common Errors and Solutions:

Failed to initialize TTS inferencer: `<error_message>`

Explanation: This error occurs when the TTS inferencer fails to initialize, possibly due to missing model files or incorrect configuration.
Solution: Ensure that all necessary model files are present and correctly configured. Try re-downloading or reinstalling the required files and check the configuration settings.

No input audio provided

Explanation: This error indicates that the node did not receive any input audio data, which is necessary for processing.
Solution: Verify that the input parameters are correctly set and that the input text is provided. Ensure that the input data is in the correct format and try again.

Error: Waveform must be a tensor

Explanation: This error suggests that the input waveform is not in the expected tensor format, which is required for processing.
Solution: Ensure that the input waveform is correctly formatted as a tensor. Convert the waveform to the appropriate format if necessary and retry the process.

MegaTTS Voice Maker Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-MegaTTS

Table of Content

Description
MegaTTS_VoiceMaker:
MegaTTS_VoiceMaker Input Parameters:
MegaTTS_VoiceMaker Output Parameters:
MegaTTS_VoiceMaker Usage Tips:
MegaTTS_VoiceMaker Common Errors and Solutions:
Related Nodes

Wan 2.1 LoRA

Enhance Wan 2.1 video generation with LoRA models for improved style and customization.

ByteDance USO | Unified Style & Subject Generator

ByteDance USO makes subject and style fusion simple and powerful.

Hunyuan Image 2.1 | High-Res AI Image Generator

Next-gen 2.1 model for crisp, sharp, ultra-clear AI visuals fast.

Hunyuan3D 2.1 | Image to 3D Model

Big jump from 2.0: Turn photos into incredible 3D models instantly.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: MegaTTS Voice Maker

MegaTTS_VoiceMaker

How to Install ComfyUI-MegaTTS

MegaTTS Voice Maker Description

MegaTTS Voice Maker:

MegaTTS Voice Maker Input Parameters:

input_text

language

pronunciation_strength

voice_similarity

MegaTTS Voice Maker Output Parameters:

audio_output

status

MegaTTS Voice Maker Usage Tips:

MegaTTS Voice Maker Common Errors and Solutions:

Failed to initialize TTS inferencer: <error_message>

No input audio provided

Error: Waveform must be a tensor

MegaTTS Voice Maker Related Nodes

Failed to initialize TTS inferencer: `<error_message>`