ComfyUI > Nodes > ComfyUI > ElevenLabs Instant Voice Clone

ComfyUI Node: ElevenLabs Instant Voice Clone

Class Name

ElevenLabsInstantVoiceClone

Category
api node/audio/ElevenLabs
Author
ComfyAnonymous (Account age: 763days)
Extension
ComfyUI
Latest Updated
2026-05-13
Github Stars
112.77K

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ElevenLabs Instant Voice Clone Description

Transform source audio into target voice for instant voice cloning with high-quality, natural results.

ElevenLabs Instant Voice Clone:

The ElevenLabsInstantVoiceClone node is designed to transform a source audio clip into a target voice, allowing you to clone voices instantly. This node is particularly useful for applications where you want to maintain the original content and emotion of the audio while changing the speaker's voice. It leverages advanced speech-to-speech transformation models to ensure high-quality voice cloning, making it an essential tool for AI artists and developers working on projects that require voice modification. By using this node, you can achieve seamless voice transformations that sound natural and consistent, enhancing the overall auditory experience of your projects.

ElevenLabs Instant Voice Clone Input Parameters:

voice

The voice parameter specifies the target voice for the transformation. It is crucial for determining which voice the source audio will be transformed into. This parameter should be connected from either the Voice Selector or Instant Voice Clone nodes. The choice of voice can significantly impact the final output, as it dictates the characteristics and qualities of the transformed audio.

audio

The audio parameter is the source audio that you wish to transform. This input is essential as it provides the original content and emotion that will be preserved during the transformation process. The quality and clarity of the source audio can affect the final output, so it is recommended to use high-quality audio files for the best results.

stability

The stability parameter controls the voice stability during the transformation process. It ranges from 0.0 to 1.0, with a default value of 0.5. Lower values allow for a broader emotional range in the transformed voice, making it more expressive and varied. In contrast, higher values produce more consistent speech, which can sometimes result in a monotonous tone. Adjusting this parameter allows you to fine-tune the emotional expression of the transformed voice to suit your project's needs.

model

The model parameter allows you to select the speech-to-speech transformation model to use. Available options include eleven_multilingual_sts_v2 and eleven_english_sts_v2. This choice determines the underlying technology used for the transformation, which can affect the quality and characteristics of the output. Selecting the appropriate model based on the language and specific requirements of your project can enhance the effectiveness of the voice cloning process.

ElevenLabs Instant Voice Clone Output Parameters:

transformed_audio

The transformed_audio parameter is the output of the node, representing the audio that has been transformed into the target voice. This output retains the original content and emotion of the source audio while adopting the characteristics of the selected target voice. The quality of the transformed audio is influenced by the input parameters, such as the choice of voice, stability, and model, making it essential to configure these settings appropriately for optimal results.

ElevenLabs Instant Voice Clone Usage Tips:

  • Experiment with different stability values to achieve the desired emotional expression in the transformed voice. Lower values can add more expressiveness, while higher values ensure consistency.
  • Choose the appropriate model based on the language and specific requirements of your project to enhance the quality of the voice transformation.
  • Ensure that the source audio is of high quality to achieve the best possible results in the transformed output.

ElevenLabs Instant Voice Clone Common Errors and Solutions:

Unknown voice: <voice_name>

  • Explanation: This error occurs when the specified voice is not recognized by the system, possibly due to a typo or an unsupported voice selection.
  • Solution: Verify that the voice name is correctly spelled and is available in the predefined ElevenLabs voices. Use the Voice Selector node to ensure the correct voice is chosen.

Invalid audio input

  • Explanation: This error indicates that the provided audio input is not valid, which could be due to an unsupported file format or corrupted audio data.
  • Solution: Check the audio file format and ensure it is supported by the node. Use a different audio file if necessary and ensure the file is not corrupted.

Model selection error

  • Explanation: This error arises when an invalid model is selected, which may not be compatible with the current configuration or input parameters.
  • Solution: Double-check the model selection and ensure it matches the requirements of your project. Use one of the available models: eleven_multilingual_sts_v2 or eleven_english_sts_v2.

ElevenLabs Instant Voice Clone Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ElevenLabs Instant Voice Clone