Visit ComfyUI Online for ready-to-use ComfyUI environment
Enhances speech audio quality by reducing noise using high sample rate processing and adaptive VAD-driven denoising.
The Egregora_RNNoise_Denoise node is designed to enhance audio quality by reducing noise, specifically focusing on speech. It operates at a high sample rate of 48 kHz, processing audio in 10 ms chunks, which equates to 480 samples per chunk. This node can handle both mono and stereo audio, either processing each channel separately or downmixing to mono for a unified approach. It leverages the pyrnnoise library, version 0.3.x or higher, utilizing the denoise_chunk API to effectively reduce noise. The node incorporates a static strength parameter, an adaptive mix driven by per-frame Voice Activity Detection (VAD), and a post-gain feature with a ceiling to ensure optimal audio output. This combination of features makes it particularly effective for improving the clarity of speech in audio recordings, making it a valuable tool for AI artists working with audio content.
This parameter represents the input audio data that the node will process. It should be a NumPy array containing audio samples at a 48 kHz sample rate. The audio data is expected to be in a normalized floating-point format, where the values range between -1.0 and 1.0. The node will convert these values to 16-bit integers for processing. The quality of the input audio can significantly impact the effectiveness of the denoising process, so it is recommended to provide clean and well-recorded audio samples for optimal results.
This output parameter provides the Voice Activity Detection (VAD) probabilities for each frame of the processed audio. It is a NumPy array of floating-point values, where each value represents the likelihood of speech presence in the corresponding audio frame. These probabilities can be used to understand the speech dynamics within the audio, offering insights into which parts of the audio contain speech and which are likely to be noise. This information can be particularly useful for further audio processing or analysis tasks.
pyrnnoise library is not installed in your environment, which is necessary for the node to function.pyrnnoise library by running pip install pyrnnoise in your command line or terminal.pyrnnoise does not support the denoise_chunk API, which is required by the node.pyrnnoise version 0.3.x or higher installed. You can upgrade the library by running pip install --upgrade pyrnnoise.numpy and scipy to handle audio data conversion if necessary.RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.