RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Flux Fill | Inpaint and Outpaint

Official Flux Tools - Flux Fill for Inpainting and Outpainting

FLUX LoRA Training

Guide you through the entire process of training FLUX LoRA models using your custom datasets.

ComfyUI > Nodes > ComfyUI > ElevenLabs Instant Voice Clone

ComfyUI Node: ElevenLabs Instant Voice Clone

Class Name

ElevenLabsInstantVoiceClone

Category
api node/audio/ElevenLabs

Author
ComfyAnonymous (Account age: 763days) Extension
ComfyUI Latest Updated
2026-05-13 Github Stars
112.77K

Github Ask ComfyAnonymous Current Questions Past Questions

Table of Content

Description
ElevenLabsInstantVoiceClone:
ElevenLabsInstantVoiceClone Input Parameters:
ElevenLabsInstantVoiceClone Output Parameters:
ElevenLabsInstantVoiceClone Usage Tips:
ElevenLabsInstantVoiceClone Common Errors and Solutions:
Related Nodes

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ElevenLabs Instant Voice Clone Description

Transform source audio into target voice for instant voice cloning with high-quality, natural results.

ElevenLabs Instant Voice Clone:

The ElevenLabsInstantVoiceClone node is designed to transform a source audio clip into a target voice, allowing you to clone voices instantly. This node is particularly useful for applications where you want to maintain the original content and emotion of the audio while changing the speaker's voice. It leverages advanced speech-to-speech transformation models to ensure high-quality voice cloning, making it an essential tool for AI artists and developers working on projects that require voice modification. By using this node, you can achieve seamless voice transformations that sound natural and consistent, enhancing the overall auditory experience of your projects.

ElevenLabs Instant Voice Clone Input Parameters:

voice

The voice parameter specifies the target voice for the transformation. It is crucial for determining which voice the source audio will be transformed into. This parameter should be connected from either the Voice Selector or Instant Voice Clone nodes. The choice of voice can significantly impact the final output, as it dictates the characteristics and qualities of the transformed audio.

audio

The audio parameter is the source audio that you wish to transform. This input is essential as it provides the original content and emotion that will be preserved during the transformation process. The quality and clarity of the source audio can affect the final output, so it is recommended to use high-quality audio files for the best results.

stability

The stability parameter controls the voice stability during the transformation process. It ranges from 0.0 to 1.0, with a default value of 0.5. Lower values allow for a broader emotional range in the transformed voice, making it more expressive and varied. In contrast, higher values produce more consistent speech, which can sometimes result in a monotonous tone. Adjusting this parameter allows you to fine-tune the emotional expression of the transformed voice to suit your project's needs.

model

The model parameter allows you to select the speech-to-speech transformation model to use. Available options include eleven_multilingual_sts_v2 and eleven_english_sts_v2. This choice determines the underlying technology used for the transformation, which can affect the quality and characteristics of the output. Selecting the appropriate model based on the language and specific requirements of your project can enhance the effectiveness of the voice cloning process.

ElevenLabs Instant Voice Clone Output Parameters:

transformed_audio

The transformed_audio parameter is the output of the node, representing the audio that has been transformed into the target voice. This output retains the original content and emotion of the source audio while adopting the characteristics of the selected target voice. The quality of the transformed audio is influenced by the input parameters, such as the choice of voice, stability, and model, making it essential to configure these settings appropriately for optimal results.

ElevenLabs Instant Voice Clone Usage Tips:

Experiment with different stability values to achieve the desired emotional expression in the transformed voice. Lower values can add more expressiveness, while higher values ensure consistency.
Choose the appropriate model based on the language and specific requirements of your project to enhance the quality of the voice transformation.
Ensure that the source audio is of high quality to achieve the best possible results in the transformed output.

ElevenLabs Instant Voice Clone Common Errors and Solutions:

Unknown voice: `<voice_name>`

Explanation: This error occurs when the specified voice is not recognized by the system, possibly due to a typo or an unsupported voice selection.
Solution: Verify that the voice name is correctly spelled and is available in the predefined ElevenLabs voices. Use the Voice Selector node to ensure the correct voice is chosen.

Invalid audio input

Explanation: This error indicates that the provided audio input is not valid, which could be due to an unsupported file format or corrupted audio data.
Solution: Check the audio file format and ensure it is supported by the node. Use a different audio file if necessary and ensure the file is not corrupted.

Model selection error

Explanation: This error arises when an invalid model is selected, which may not be compatible with the current configuration or input parameters.
Solution: Double-check the model selection and ensure it matches the requirements of your project. Use one of the available models: eleven_multilingual_sts_v2 or eleven_english_sts_v2.

ElevenLabs Instant Voice Clone Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI

Table of Content

Description
ElevenLabsInstantVoiceClone:
ElevenLabsInstantVoiceClone Input Parameters:
ElevenLabsInstantVoiceClone Output Parameters:
ElevenLabsInstantVoiceClone Usage Tips:
ElevenLabsInstantVoiceClone Common Errors and Solutions:
Related Nodes

Z-Image Finetuned Models Collection | Multi-Style Generator

Create stunning, detailed images across multiple styles and moods easily.

OmniGen2 | Text-to-Image & Editing

Powerful unified model for image generation and editing

Push-In Camera - A Motion LoRA for Wan 2.1

One image in, blockbuster push-in shots out. Zero complexity.

Wan 2.2 Lightning T2V I2V | 4-Step Ultra Fast

Wan 2.2 now 20x faster! T2V + I2V in 4 steps.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: ElevenLabs Instant Voice Clone

ElevenLabsInstantVoiceClone

How to Install ComfyUI

ElevenLabs Instant Voice Clone Description

ElevenLabs Instant Voice Clone:

ElevenLabs Instant Voice Clone Input Parameters:

voice

audio

stability

model

ElevenLabs Instant Voice Clone Output Parameters:

transformed_audio

ElevenLabs Instant Voice Clone Usage Tips:

ElevenLabs Instant Voice Clone Common Errors and Solutions:

Unknown voice: <voice_name>

Invalid audio input

Model selection error

ElevenLabs Instant Voice Clone Related Nodes

Unknown voice: `<voice_name>`