RunComfy

Flux Klein Face Swap | Realistic AI Face Editor

Swap faces perfectly. Natural, lifelike, and fast AI-powered editing.

FLUX LoRA Training

Guide you through the entire process of training FLUX LoRA models using your custom datasets.

SAM 3 | Advanced Object Segmentation Tool

Next-gen segmentation tool for precise object masking and tracking.

Dance Video Transform | Scene Customization & Face Swap

Transform dance videos with scene editing, face-swapping, and motion preservation.

ComfyUI > Nodes > ComfyUI-IndexTTS2 > IndexTTS2 Save Audio

ComfyUI Node: IndexTTS2 Save Audio

Class Name

IndexTTS2SaveAudio

Category
Audio/IndexTTS

Author
snicolast (Account age: 2913days) Extension
ComfyUI-IndexTTS2 Latest Updated
2025-10-13 Github Stars
0.14K

Github Ask snicolast Current Questions Past Questions

Table of Content

Description
IndexTTS2SaveAudio:
IndexTTS2SaveAudio Input Parameters:
IndexTTS2SaveAudio Output Parameters:
IndexTTS2SaveAudio Usage Tips:
IndexTTS2SaveAudio Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-IndexTTS2

Install this extension via the ComfyUI Manager by searching for ComfyUI-IndexTTS2

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-IndexTTS2 in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

IndexTTS2 Save Audio Description

Efficiently save TTS-generated audio with format options and quality control for AI projects.

IndexTTS2 Save Audio:

The IndexTTS2SaveAudio node is designed to facilitate the process of saving audio data generated by text-to-speech (TTS) systems. This node is particularly useful for AI artists and developers working with audio synthesis, as it provides a streamlined method to convert audio waveforms into various file formats, such as WAV or MP3, while ensuring the audio quality is maintained through optional normalization. The node's primary function is to handle audio data efficiently, allowing users to specify parameters like file format and normalization options, which can significantly impact the final audio output's quality and size. By leveraging this node, you can easily manage and store audio outputs from TTS systems, making it an essential tool for projects that require high-quality audio synthesis and storage.

IndexTTS2 Save Audio Input Parameters:

audio

The audio parameter represents the audio data to be saved. It is expected to be a dictionary containing the waveform and sample rate. The waveform should be a NumPy array or a PyTorch tensor, and the sample rate should be an integer. This parameter is crucial as it forms the basis of the audio file that will be saved. The waveform data should be in a format that the node can process, typically with a shape of (B, C, N), where B is the batch size, C is the number of channels, and N is the number of samples.

name

The name parameter specifies the prefix for the saved audio file's name. It is a string that allows you to identify the audio file easily. If not provided, a default prefix "tts2" is used. This parameter helps in organizing and managing multiple audio files by providing a meaningful naming convention.

format

The format parameter determines the file format in which the audio will be saved. Common options include "wav" and "mp3". This parameter is important as it affects the compatibility and quality of the audio file. Choosing the right format depends on the intended use of the audio file, with WAV being uncompressed and high-quality, while MP3 offers compression and smaller file sizes.

normalize_peak

The normalize_peak parameter is a boolean that indicates whether the audio should be peak-normalized. When set to true, the audio waveform is adjusted to ensure that its peak amplitude is at a specified level, typically just below the maximum possible value. This helps in preventing clipping and maintaining consistent audio levels across different files.

wav_pcm

The wav_pcm parameter specifies the PCM format for WAV files, with "pcm16" being a common option. This parameter is relevant when saving audio in WAV format, as it determines the bit depth of the audio file, affecting both its quality and size. PCM16 is a standard choice for high-quality audio.

mp3_bitrate

The mp3_bitrate parameter sets the bitrate for MP3 files, with "320k" being a typical value for high-quality audio. This parameter is crucial for balancing audio quality and file size when saving in MP3 format. Higher bitrates result in better audio quality but larger file sizes.

IndexTTS2 Save Audio Output Parameters:

paths

The paths parameter is a list of strings representing the file paths where the audio files have been saved. This output is important as it provides you with the exact locations of the saved audio files, allowing for easy access and further processing or sharing of the audio data.

IndexTTS2 Save Audio Usage Tips:

Ensure that the audio parameter contains valid waveform and sample rate data to avoid errors during processing.
Use the normalize_peak option to maintain consistent audio levels across different files, especially when dealing with varying input audio sources.
Choose the format parameter based on your needs for audio quality and file size; WAV for high quality and MP3 for smaller file sizes.

IndexTTS2 Save Audio Common Errors and Solutions:

ValueError: AUDIO input must be shaped (B, C, N)

Explanation: This error occurs when the input audio waveform does not have the expected shape of (B, C, N).
Solution: Ensure that your audio data is correctly formatted with the appropriate batch size, channels, and number of samples.

ValueError: Invalid AUDIO input. Expected (sample_rate:int, numpy_array)

Explanation: This error indicates that the audio input does not contain a valid sample rate and waveform data.
Solution: Verify that the audio parameter is a dictionary with a valid sample rate and waveform data, and that the waveform is a NumPy array or PyTorch tensor.

IndexTTS2 Save Audio Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-IndexTTS2

Table of Content

Description
IndexTTS2SaveAudio:
IndexTTS2SaveAudio Input Parameters:
IndexTTS2SaveAudio Output Parameters:
IndexTTS2SaveAudio Usage Tips:
IndexTTS2SaveAudio Common Errors and Solutions:
Related Nodes

FLUX Kontext Face Swap | Seamless Face Replacement

Photoreal face replacement with prompt-guided control and natural blending

Hunyuan Video 1.5 | Fast AI Video Generator

Turn text or images into smooth 1080p videos quickly and easily.

Qwen Image Edit Plus 2509 LoRA Inference | AI Toolkit ComfyUI

Apply AI Toolkit-trained Qwen Image Edit Plus 2509 LoRAs in ComfyUI with preview-aligned edits using a single RCQwenImageEditPlus custom node.

Wan 2.1 Fun | I2V + T2V

Empower your AI videos with Wan 2.1 Fun.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: IndexTTS2 Save Audio

IndexTTS2SaveAudio

How to Install ComfyUI-IndexTTS2

IndexTTS2 Save Audio Description

IndexTTS2 Save Audio:

IndexTTS2 Save Audio Input Parameters:

audio

name

format

normalize_peak

wav_pcm

mp3_bitrate

IndexTTS2 Save Audio Output Parameters:

paths

IndexTTS2 Save Audio Usage Tips:

IndexTTS2 Save Audio Common Errors and Solutions:

ValueError: AUDIO input must be shaped (B, C, N)

ValueError: Invalid AUDIO input. Expected (sample_rate:int, numpy_array)

IndexTTS2 Save Audio Related Nodes