IPAdapter Plus (V2) | One-Image Style Transfer

Use IPAdapter Plus and ControlNet for precise style transfer with a single reference image.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

Hunyuan Video | Image-Prompt to Video

Convert an image and a text prompt into a dynamic video.

Consistent & Realistic Characters

Create consistent and realistic characters with precise control over facial features, poses, and compositions.

ComfyUI > Nodes > Comfyui-Spark-TTS > Audio Recorder

ComfyUI Node: Audio Recorder

Class Name

SparkTTS_AudioRecorder

Category
🧪AILab/🔊Audio

Author
1038lab (Account age: 774days) Extension
Comfyui-Spark-TTS Latest Updated
2025-04-15 Github Stars
0.09K

Github Ask 1038lab Current Questions Past Questions

Table of Content

Description
SparkTTS_AudioRecorder:
SparkTTS_AudioRecorder Input Parameters:
SparkTTS_AudioRecorder Output Parameters:
SparkTTS_AudioRecorder Usage Tips:
SparkTTS_AudioRecorder Common Errors and Solutions:
Related Nodes

How to Install Comfyui-Spark-TTS

Install this extension via the ComfyUI Manager by searching for Comfyui-Spark-TTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Comfyui-Spark-TTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Audio Recorder Description

Specialized node for audio recording and processing in ComfyUI, enhancing audio quality with noise reduction and echo cancellation.

Audio Recorder:

The SparkTTS_AudioRecorder is a specialized node designed for audio recording and processing within the ComfyUI environment. Its primary function is to capture audio input and enhance its quality by applying various audio processing techniques. This node is particularly beneficial for users who need to record audio with minimal noise and echo, ensuring a clean and clear output. By leveraging advanced methods such as spectral gating and noise profiling, the SparkTTS_AudioRecorder effectively reduces unwanted noise and echoes, resulting in a more polished audio recording. This makes it an invaluable tool for AI artists and developers working on projects that require high-quality audio input, such as voice cloning or audio synthesis.

Audio Recorder Input Parameters:

recording

This parameter indicates whether the audio recording process should be initiated. It is a boolean value where True starts the recording, and False skips the process. This parameter is crucial as it controls the execution of the audio recording function.

recording_duration

This parameter specifies the length of time, in seconds, for which the audio should be recorded. It directly impacts the duration of the captured audio and should be set according to the user's needs. There is no explicit minimum or maximum value provided, but it should be a positive integer.

noise_threshold

The noise threshold parameter determines the sensitivity of the noise reduction process. It is used to set the level at which the noise is considered significant enough to be removed. A higher threshold may result in more aggressive noise reduction, potentially affecting the quality of the desired audio, while a lower threshold may leave some noise in the recording.

smoothing_kernel_size

This parameter defines the size of the kernel used for smoothing the mask applied during the spectral gating process. It affects how smooth the transitions are in the noise reduction process. A larger kernel size results in smoother transitions but may also blur some audio details.

sample_rate

The sample rate parameter specifies the number of samples per second used in the audio recording. It is an integer value that affects the quality and size of the audio file. Common sample rates include 44100 Hz for CD-quality audio. The sample rate should be chosen based on the desired audio quality and the capabilities of the recording device.

Audio Recorder Output Parameters:

final_audio

The final_audio output is a dictionary containing the processed audio waveform and its sample rate. The waveform is a tensor representing the audio data, and the sample rate is an integer indicating the number of samples per second. This output is crucial as it provides the cleaned and enhanced audio ready for further use or analysis.

Audio Recorder Usage Tips:

Ensure that the recording environment is as quiet as possible to minimize the initial noise level, which will improve the effectiveness of the noise reduction process.
Adjust the noise_threshold parameter carefully to balance between noise reduction and audio quality. Test different values to find the optimal setting for your specific recording conditions.
Use a sample rate that matches the quality requirements of your project. Higher sample rates provide better audio quality but result in larger file sizes.

Audio Recorder Common Errors and Solutions:

Audio processing failed: `<error_message>`

Explanation: This error occurs when there is an issue during the audio processing phase, which could be due to incorrect parameter values or unexpected input data.
Solution: Check all input parameters for correctness, ensure that the recording environment is suitable, and verify that the sample rate is supported by your hardware. If the problem persists, review the error message for specific details and adjust the parameters accordingly.

Audio Recorder Related Nodes

Go back to the extension to check out more related nodes.

Comfyui-Spark-TTS

Table of Content

Description
SparkTTS_AudioRecorder:
SparkTTS_AudioRecorder Input Parameters:
SparkTTS_AudioRecorder Output Parameters:
SparkTTS_AudioRecorder Usage Tips:
SparkTTS_AudioRecorder Common Errors and Solutions:
Related Nodes

Nvidia Cosmos | Text & Image to Video Creation

Generate videos from text prompts or create frame interpolation between two images with Nvidia's Cosmos.

EchoMimic | Audio-driven Portrait Animations

Generate realistic talking heads and body gestures synced with the provided audio.

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

ComfyUI Vid2Vid Dance Transfer

Transfers the motion and style from a source video onto a target image or object.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.