RunComfy

Pose Control LipSync S2V | Expressive Video Generator

Turn images into talking, moving characters with pose and audio control.

DreamO | Unified Multi-Task Image Customization Framework

Perform identity, style, try-on, and multi-condition image generation from 1–3 references

Instagirl v.20 | Wan 2.2 LoRA Demo

A Wan 2.2 workflow for demoing the Instagirl LoRA by Instara.

Qwen Image Edit 2509 | Multi-Image Editor

Turn 2–3 images into one seamless, edited masterpiece instantly.

ComfyUI > Nodes > ComfyUI > ElevenLabs Text to Dialogue

ComfyUI Node: ElevenLabs Text to Dialogue

Class Name

ElevenLabsTextToDialogue

Category
api node/audio/ElevenLabs

Author
ComfyAnonymous (Account age: 763days) Extension
ComfyUI Latest Updated
2026-05-13 Github Stars
112.77K

Github Ask ComfyAnonymous Current Questions Past Questions

Table of Content

Description
ElevenLabsTextToDialogue:
ElevenLabsTextToDialogue Input Parameters:
ElevenLabsTextToDialogue Output Parameters:
ElevenLabsTextToDialogue Usage Tips:
ElevenLabsTextToDialogue Common Errors and Solutions:
Related Nodes

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ElevenLabs Text to Dialogue Description

Transform written text into multi-speaker dialogue audio using AI for dynamic, interactive content creation with customizable voices.

ElevenLabs Text to Dialogue:

The ElevenLabsTextToDialogue node is designed to transform written text into a multi-speaker dialogue audio format, leveraging advanced AI models to generate realistic and engaging conversations. This node allows you to input multiple dialogue segments, each with its own text and designated voice, enabling the creation of dynamic and interactive audio content. It is particularly beneficial for projects that require diverse character interactions, such as audio dramas, podcasts, or interactive storytelling. By utilizing this node, you can achieve high-quality dialogue generation with customizable voice settings, ensuring that each character's voice is distinct and appropriate for the context. The node supports various languages and offers options for text normalization, providing flexibility and control over the final audio output.

ElevenLabs Text to Dialogue Input Parameters:

stability

This parameter controls the voice stability during dialogue generation. It ranges from 0.0 to 1.0, with a default value of 0.5. Lower values allow for a broader emotional range in the generated voices, making them more expressive and varied, while higher values produce more consistent and stable speech, which might be more suitable for formal or monotonous dialogue.

apply_text_normalization

This parameter determines how text normalization is applied to the input text. It offers three options: "auto," "on," and "off." "Auto" lets the system decide the best approach for normalization, "on" always applies normalization to ensure consistency in text processing, and "off" skips normalization, preserving the original text format.

model

This parameter specifies the model used for dialogue generation. The available option is "eleven_v3," which is optimized for creating high-quality, multi-speaker dialogues. Selecting the appropriate model ensures that the generated audio meets the desired quality and style.

inputs

This parameter is a list of dialogue segments, where each segment includes the text and the corresponding voice ID. It is crucial for defining the structure and content of the dialogue, allowing you to specify different voices for each part of the conversation, thus enhancing the realism and engagement of the audio output.

language_code

This optional parameter allows you to specify the language of the dialogue using ISO-639-1 language codes. Providing a language code ensures that the dialogue is generated in the correct language, which is essential for multilingual projects or when targeting specific language audiences.

seed

This optional parameter is used for deterministic sampling, allowing you to reproduce the same dialogue output across different runs. By setting a specific seed value, you can ensure consistency in the generated audio, which is useful for testing or when a specific output is desired.

output_format

This parameter defines the format of the generated audio output. It is essential for ensuring compatibility with different audio playback systems or further processing tools, allowing you to choose the format that best suits your project's needs.

ElevenLabs Text to Dialogue Output Parameters:

audio

The output of this node is an audio file containing the generated multi-speaker dialogue. This audio output is the culmination of the text-to-dialogue transformation process, providing a ready-to-use audio file that can be integrated into various multimedia projects. The quality and characteristics of the audio depend on the input parameters and settings, ensuring that the final product aligns with your creative vision.

ElevenLabs Text to Dialogue Usage Tips:

Experiment with different stability settings to find the right balance between emotional expressiveness and consistency for your project's needs.
Use the seed parameter to ensure reproducibility of the dialogue output, which is particularly useful for iterative testing or when specific audio results are required.
Consider the language_code parameter to generate dialogues in different languages, expanding the reach and applicability of your audio content.

ElevenLabs Text to Dialogue Common Errors and Solutions:

Invalid voice ID

Explanation: This error occurs when the specified voice ID does not match any available voices in the system.
Solution: Verify that the voice ID provided in the inputs matches one of the available voices. Check for typos or incorrect IDs.

Unsupported language code

Explanation: This error arises when the language code provided is not supported by the model.
Solution: Ensure that the language code is a valid ISO-639-1 code and is supported by the ElevenLabs model. Refer to the documentation for a list of supported languages.

Text normalization mode error

Explanation: This error occurs if an invalid option is selected for text normalization.
Solution: Choose from the available options: "auto," "on," or "off." Ensure that the input matches one of these options exactly.

ElevenLabs Text to Dialogue Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI

Table of Content

Description
ElevenLabsTextToDialogue:
ElevenLabsTextToDialogue Input Parameters:
ElevenLabsTextToDialogue Output Parameters:
ElevenLabsTextToDialogue Usage Tips:
ElevenLabsTextToDialogue Common Errors and Solutions:
Related Nodes

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Multitalk | Realistic Talking Video Maker

One-click create multi-speaker lip-sync videos from portraits and voices!

Nunchaku Qwen Image | Multi-Image Editor

Blend and style multiple images with next-level control.

Wan 2.2 + Lightx2v V2 | Ultra Fast I2V & T2V

Dual Light LoRA setup, 4X faster.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy