Visit ComfyUI Online for ready-to-use ComfyUI environment
Integrates audio with muted video for synchronized multimedia experience, beneficial for AI artists, auto-syncs audio duration to video, offers flexibility for perfect alignment without manual editing.
The AudioXVideoAudioCombiner node is designed to seamlessly integrate audio with a muted video, creating a synchronized multimedia experience. This node is particularly beneficial for AI artists who wish to enhance their visual content with audio that matches the video's duration and mood. By automatically syncing the audio duration to the video, it ensures that the final output is cohesive and professional. The node also offers flexibility by allowing users to loop audio if it is shorter than the video, or pad it with silence, ensuring that the audio perfectly aligns with the video length. This functionality is crucial for creating engaging and immersive video content without the need for extensive manual editing.
The video parameter accepts a batch of images in the ComfyUI video format, representing the video content to which audio will be added. This input is crucial as it determines the visual component of the final output. The video is expected to be a torch.Tensor, and any deviation from this format will result in an error. The number of frames in the video will influence the duration of the final output, as the node aims to synchronize the audio to match this duration.
The audio parameter requires a dictionary containing the audio waveform and sample rate, which represents the audio content to be combined with the video. This input is essential for providing the sound that will accompany the video. The audio waveform should be a torch.Tensor, and the dictionary must include keys for 'waveform' and 'sample_rate'. The duration of the audio will be adjusted to match the video's duration, either by looping or padding, depending on the optional parameters.
The sync_duration parameter is a boolean option that, when set to True, automatically synchronizes the audio duration to match the video duration. This ensures that the audio and video are perfectly aligned, providing a seamless viewing experience. The default value is True, and it is recommended to keep this setting enabled unless specific desynchronization is desired.
The loop_audio parameter is a boolean option that, when enabled, allows the audio to loop if it is shorter than the video. This is useful for maintaining audio presence throughout the entire video duration without abrupt stops. The default value is False, meaning that by default, the audio will not loop and will instead be padded with silence if it is shorter than the video.
The video output parameter returns the original video content, now combined with the synchronized audio. This output is in the same format as the input video, ensuring compatibility with further processing or playback.
The synced_audio output parameter provides the audio that has been adjusted to match the video's duration. This audio is either looped or padded with silence, depending on the input parameters, ensuring that it aligns perfectly with the video.
The info output parameter is a string that contains information about the processing that was performed, such as whether the audio was looped or padded. This can be useful for debugging or understanding the adjustments made during the combination process.
torch.Tensor to avoid input format errors.sync_duration parameter to automatically align audio and video durations for a seamless output.loop_audio if your audio is shorter than the video and you want continuous sound throughout the video.torch.Tensor format.torch.Tensor before passing it to the node.torch.Tensor.RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.