ComfyUI > Nodes > ComfyUI_ParlerTTS > ParlerTTS_Sampler

ComfyUI Node: ParlerTTS_Sampler

Class Name

ParlerTTS_Sampler

Category
Parler_TTS
Author
smthemex (Account age: 1019days)
Extension
ComfyUI_ParlerTTS
Latest Updated
2025-07-02
Github Stars
0.05K

How to Install ComfyUI_ParlerTTS

Install this extension via the ComfyUI Manager by searching for ComfyUI_ParlerTTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_ParlerTTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ParlerTTS_Sampler Description

Specialized node for converting text to speech using ParlerTTS model, ensuring high-quality audio outputs.

ParlerTTS_Sampler:

The ParlerTTS_Sampler is a specialized node designed to facilitate the conversion of text into speech using the ParlerTTS model. This node is integral for generating high-quality audio outputs from textual inputs, leveraging advanced text-to-speech synthesis techniques. It is particularly beneficial for applications requiring natural and expressive speech generation, such as virtual assistants, audiobooks, and interactive voice response systems. The node operates by decoding audio codes into audio values, ensuring that the resulting speech is both coherent and natural-sounding. Its primary goal is to provide users with an efficient and reliable method for producing speech from text, enhancing the accessibility and engagement of digital content.

ParlerTTS_Sampler Input Parameters:

decode_sequentially

This parameter determines whether the audio decoding process should be performed sequentially or not. When set to True, the node processes audio codes one at a time, which can be beneficial for managing memory usage and ensuring smooth audio playback in real-time applications. Conversely, setting it to False allows for batch processing, which can speed up the decoding process but may require more memory. The choice between sequential and batch processing can significantly impact the node's performance and the quality of the audio output, depending on the specific use case and system capabilities.

use_4dim_audio_codes

This parameter indicates whether the node should utilize four-dimensional audio codes during the decoding process. Enabling this option can enhance the granularity and detail of the audio output, potentially leading to more nuanced and expressive speech synthesis. However, it may also increase the computational complexity and resource requirements of the node. Users should consider the trade-offs between audio quality and system performance when configuring this parameter, especially in resource-constrained environments.

use_audio_scales

This parameter allows the node to apply specific audio scales during the decoding process, which can be used to adjust the pitch, speed, or other characteristics of the generated speech. By fine-tuning these scales, users can customize the audio output to better suit their needs, whether for artistic expression or to match specific voice characteristics. This flexibility makes the node a powerful tool for creating personalized and dynamic speech outputs.

ParlerTTS_Sampler Output Parameters:

audio_values

The audio_values output parameter represents the final audio waveform generated by the node. This array of floating-point numbers corresponds to the sound wave of the synthesized speech, which can be played back using standard audio playback tools. The quality and characteristics of the audio output are influenced by the input parameters and the underlying model's capabilities. Understanding and interpreting these audio values is crucial for evaluating the effectiveness of the text-to-speech conversion and making any necessary adjustments to the input parameters.

ParlerTTS_Sampler Usage Tips:

  • To optimize performance, consider using batch processing by setting decode_sequentially to False when working with large datasets or when real-time processing is not required.
  • Experiment with use_audio_scales to achieve the desired voice characteristics, such as adjusting the pitch or speed of the generated speech to better fit your application's needs.
  • If memory usage is a concern, especially on devices with limited resources, enable decode_sequentially to process audio codes one at a time, reducing the overall memory footprint.

ParlerTTS_Sampler Common Errors and Solutions:

"Audio codes exceed codebook size"

  • Explanation: This error occurs when the audio codes provided to the node exceed the maximum size defined by the audio encoder's codebook.
  • Solution: Ensure that the input audio codes are within the valid range specified by the encoder's configuration. Adjust the input data or modify the encoder settings to accommodate larger audio codes if necessary.

"Invalid audio scales provided"

  • Explanation: This error indicates that the audio scales specified in the input parameters are not compatible with the node's requirements.
  • Solution: Verify that the audio scales are correctly formatted and match the expected input structure. Refer to the documentation for guidance on configuring audio scales appropriately.

ParlerTTS_Sampler Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI_ParlerTTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.