RunComfy

Qwen Image Edit 2511 | Smart Image Edit Workflow

Edits your image exactly how you tell it to—fast and precise.

SDXL Turbo | Rapid Text to Image

Experience fast text-to-image synthesis with SDXL Turbo.

ComfyUI Trellis2 | Image-to-3D Mesh Generation Workflow

Convert images into structured, editable 3D meshes with precise geometry and topology control.

Flux Kontext Pulid | Consistent Character Generation

Create consistent characters using FLUX Kontext with a single face reference image.

ComfyUI > Nodes > ComfyUI > ElevenLabs Text to Speech

ComfyUI Node: ElevenLabs Text to Speech

Class Name

ElevenLabsTextToSpeech

Category
api node/audio/ElevenLabs

Author
ComfyAnonymous (Account age: 763days) Extension
ComfyUI Latest Updated
2026-05-13 Github Stars
112.77K

Github Ask ComfyAnonymous Current Questions Past Questions

Table of Content

Description
ElevenLabsTextToSpeech:
ElevenLabsTextToSpeech Input Parameters:
ElevenLabsTextToSpeech Output Parameters:
ElevenLabsTextToSpeech Usage Tips:
ElevenLabsTextToSpeech Common Errors and Solutions:
Related Nodes

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ElevenLabs Text to Speech Description

Converts text to speech with advanced technology for high-quality audio output, offering voice selection and parameter adjustments.

ElevenLabs Text to Speech:

The ElevenLabsTextToSpeech node is designed to convert written text into spoken words, providing a seamless way to generate audio content from textual input. This node leverages advanced text-to-speech technology to produce natural-sounding speech, making it an invaluable tool for creating voiceovers, audiobooks, and other audio-based applications. By allowing you to select different voices and adjust parameters such as stability and text normalization, this node offers flexibility and control over the speech synthesis process. Its primary goal is to transform text into high-quality audio output, enhancing the accessibility and engagement of your content.

ElevenLabs Text to Speech Input Parameters:

voice

This parameter specifies the voice to be used for speech synthesis. You can connect it from a Voice Selector or use an Instant Voice Clone. The choice of voice can significantly impact the tone and style of the generated speech, allowing you to tailor the audio output to suit your specific needs.

text

The text parameter is the core input for this node, representing the written content you wish to convert into speech. It supports multiline input, enabling you to provide extensive text passages. The clarity and coherence of the output speech are directly influenced by the quality and structure of the input text.

stability

This parameter controls the voice stability, with a default value of 0.5. It ranges from 0.0 to 1.0, where lower values offer a broader emotional range, and higher values result in more consistent but potentially monotonous speech. Adjusting this setting allows you to fine-tune the expressiveness of the synthesized voice.

apply_text_normalization

This option determines the mode of text normalization applied during speech synthesis. It offers three settings: "auto," "on," and "off." "Auto" lets the system decide the best approach, "on" always applies normalization, and "off" skips it. Text normalization can enhance the naturalness and intelligibility of the speech output.

model

The model parameter is a dictionary that includes settings such as model ID, similarity boost, speed, and optional features like speaker boost and style. These settings influence the characteristics and performance of the speech synthesis, allowing for customization based on your requirements.

language_code

This parameter specifies the language code for the text-to-speech conversion. If left blank, the system may default to a standard language setting. Providing a specific language code ensures that the speech synthesis aligns with the linguistic nuances of the input text.

seed

The seed parameter is used to initialize the random number generator for the speech synthesis process. It can help achieve consistent results across multiple runs by controlling the randomness in the voice generation.

output_format

This parameter defines the format of the audio output, such as MP3 or WAV. Choosing the appropriate format is crucial for compatibility with different playback systems and applications.

ElevenLabs Text to Speech Output Parameters:

audio

The primary output of this node is the audio parameter, which contains the synthesized speech in the specified format. This audio output can be used in various applications, from multimedia projects to accessibility tools, providing a versatile solution for converting text into engaging spoken content.

ElevenLabs Text to Speech Usage Tips:

Experiment with different voices to find the one that best matches the tone and style of your project.
Adjust the stability parameter to balance between emotional expressiveness and consistency in the speech output.
Use the apply_text_normalization setting to enhance the clarity and naturalness of the synthesized speech, especially for complex or technical text.

ElevenLabs Text to Speech Common Errors and Solutions:

Invalid voice selection

Explanation: The selected voice may not be available or properly configured.
Solution: Ensure that the voice is correctly connected from a Voice Selector or Instant Voice Clone and is supported by the system.

Text input too short

Explanation: The provided text does not meet the minimum length requirement.
Solution: Ensure that the text input is at least one character long and contains meaningful content for conversion.

Unsupported language code

Explanation: The specified language code is not recognized or supported by the system.
Solution: Verify that the language code is correct and supported by the ElevenLabs text-to-speech service.

Output format not recognized

Explanation: The chosen output format is not supported.
Solution: Select a valid output format, such as MP3 or WAV, that is compatible with the ElevenLabs text-to-speech service.

ElevenLabs Text to Speech Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI

Table of Content

Description
ElevenLabsTextToSpeech:
ElevenLabsTextToSpeech Input Parameters:
ElevenLabsTextToSpeech Output Parameters:
ElevenLabsTextToSpeech Usage Tips:
ElevenLabsTextToSpeech Common Errors and Solutions:
Related Nodes

Qwen Edit 2509 Light Restoration | Photo Relight Tool

Fix bad lighting fast for perfect, clean, balanced photos every time.

Wan 2.2 Image Generation | 2-in-1 Workflow Pack

MoE Mix + Low-Only with upscale. Pick one.

Wan2.2 Fun Camera | Cinematic Motion from Images

Turn still images into lively cinematic shots with smooth camera moves.

OmniGen | Image-To-Image

OmniGen: Modify Images Based on Reference Images and Prompts

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: ElevenLabs Text to Speech

ElevenLabsTextToSpeech

How to Install ComfyUI

ElevenLabs Text to Speech Description

ElevenLabs Text to Speech:

ElevenLabs Text to Speech Input Parameters:

voice

text

stability

apply_text_normalization

model

language_code

seed

output_format

ElevenLabs Text to Speech Output Parameters:

audio

ElevenLabs Text to Speech Usage Tips:

ElevenLabs Text to Speech Common Errors and Solutions:

Invalid voice selection

Text input too short

Unsupported language code

Output format not recognized

ElevenLabs Text to Speech Related Nodes