(SP) Normalizer:
The SignalProcessingNormalizer node is designed to enhance audio processing by applying various normalization techniques to audio signals. Its primary purpose is to adjust the loudness of audio to a desired level, ensuring consistency and clarity across different audio tracks. This node is particularly beneficial for audio projects where maintaining a uniform loudness level is crucial, such as in music production, podcasting, or broadcasting. By offering multiple normalization modes, including RMS, LUFS, peak, and automatic gain control, it provides flexibility to achieve the desired audio quality. Each mode targets specific aspects of the audio signal, allowing you to tailor the normalization process to your specific needs. This node is an essential tool for anyone looking to improve the audio quality and maintain a professional sound standard in their projects.
(SP) Normalizer Input Parameters:
audio_input
This parameter represents the audio data that you want to process. It is expected to be in a format compatible with the node's processing functions, typically a dictionary containing a torch.Tensor for the waveform and an integer for the sample rate. The audio input serves as the foundation for the normalization process, and its quality and format can significantly impact the results.
mode
The mode parameter determines the normalization technique to be applied to the audio signal. It offers four options: "rms", "lufs", "peak", and "auto". Each mode targets a different aspect of audio normalization. "RMS" focuses on the root mean square level, "LUFS" targets loudness units relative to full scale, "peak" adjusts the maximum amplitude, and "auto" applies automatic gain control. Choosing the right mode is crucial for achieving the desired audio quality.
target_rms
This parameter specifies the target RMS (Root Mean Square) level for normalization when the "rms" mode is selected. It is a float value with a default of 0.1, a minimum of 0, and a maximum of 10.0, adjustable in steps of 0.1. The RMS level is a measure of the average power of the audio signal, and setting an appropriate target can help achieve a balanced and consistent sound.
target_lufs_db
The target_lufs_db parameter sets the target LUFS (Loudness Units Full Scale) level for normalization in "lufs" mode. It is a float value with a default of -14.0, ranging from -100 to 100.0, with a step size of 0.1. LUFS is a standard for measuring perceived loudness, and adjusting this parameter helps ensure that the audio meets industry loudness standards.
target_peak
This parameter defines the target peak level for normalization when using the "peak" mode. It is a float value with a default of 0.9, a minimum of 0.0, and a maximum of 1.0, adjustable in steps of 0.1. The peak level represents the maximum amplitude of the audio signal, and setting an appropriate target can prevent distortion and clipping.
target_auto
The target_auto parameter is used in "auto" mode to set the target level for automatic gain control. It is a float value with a default of 0.7, a minimum of 0.0, and a maximum of 1.0, adjustable in steps of 0.1. This parameter helps maintain a consistent volume level by automatically adjusting the gain based on the audio signal's characteristics.
target_auto_alpha
This parameter is used in conjunction with target_auto to fine-tune the automatic gain control process. It is a float value with a default of 0.1, a minimum of 0.0, and a maximum of 10.0, adjustable in steps of 0.1. The target_auto_alpha parameter influences the responsiveness of the gain control, allowing for smoother or more aggressive adjustments.
(SP) Normalizer Output Parameters:
processed_audio
The processed_audio output parameter provides the normalized audio signal as a result of the node's processing. This output is crucial for evaluating the effectiveness of the normalization process and ensuring that the audio meets the desired loudness and quality standards. The processed audio can be used in further audio processing or directly in your projects.
(SP) Normalizer Usage Tips:
- Experiment with different normalization modes to find the one that best suits your audio project's needs. For example, use "lufs" mode for broadcast standards and "peak" mode to prevent clipping.
- Adjust the target parameters incrementally to fine-tune the audio output, ensuring that the changes align with your desired audio quality and loudness levels.
(SP) Normalizer Common Errors and Solutions:
Output file <output_filepath> was not created.
- Explanation: This error indicates that the processed audio file was not successfully saved to the specified location.
- Solution: Ensure that the output directory exists and that you have write permissions. Check the file path for any typos or incorrect directory structures.
Processed audio output is None for mode <mode>.
- Explanation: This error suggests that the audio processing did not produce a valid output for the specified mode.
- Solution: Verify that the input audio data is correctly formatted and that the mode parameter is set to a valid option. Ensure that all target parameters are within their specified ranges.
