RunComfy

Flux 2 Dev | Photoreal Text-to-Image Generator

Next-level image realism with advanced generation control power

Flux Krea Dev | Natural Text to Image

The best open-source FLUX model! Absolutely incredible natural results.

Instagirl v.20 | Wan 2.2 LoRA Demo

A Wan 2.2 workflow for demoing the Instagirl LoRA by Instara.

FLUX.2 [klein] 4B & 9B | Ultra-Fast Flux Image Generator

Blazing-fast visual creation with unified editing control.

ComfyUI > Nodes > ComfyUI-AudioX > AudioX Video Audio Combiner

ComfyUI Node: AudioX Video Audio Combiner

Class Name

AudioXVideoAudioCombiner

Category
AudioX/Utils

Author
lum3on (Account age: 314days) Extension
ComfyUI-AudioX Latest Updated
2025-06-24 Github Stars
0.04K

Github Ask lum3on Current Questions Past Questions

Table of Content

Description
AudioXVideoAudioCombiner:
AudioXVideoAudioCombiner Input Parameters:
AudioXVideoAudioCombiner Output Parameters:
AudioXVideoAudioCombiner Usage Tips:
AudioXVideoAudioCombiner Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-AudioX

Install this extension via the ComfyUI Manager by searching for ComfyUI-AudioX

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioX in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

AudioX Video Audio Combiner Description

Integrates audio with muted video for synchronized multimedia experience, beneficial for AI artists, auto-syncs audio duration to video, offers flexibility for perfect alignment without manual editing.

AudioX Video Audio Combiner:

The AudioXVideoAudioCombiner node is designed to seamlessly integrate audio with a muted video, creating a synchronized multimedia experience. This node is particularly beneficial for AI artists who wish to enhance their visual content with audio that matches the video's duration and mood. By automatically syncing the audio duration to the video, it ensures that the final output is cohesive and professional. The node also offers flexibility by allowing users to loop audio if it is shorter than the video, or pad it with silence, ensuring that the audio perfectly aligns with the video length. This functionality is crucial for creating engaging and immersive video content without the need for extensive manual editing.

AudioX Video Audio Combiner Input Parameters:

video

The video parameter accepts a batch of images in the ComfyUI video format, representing the video content to which audio will be added. This input is crucial as it determines the visual component of the final output. The video is expected to be a torch.Tensor, and any deviation from this format will result in an error. The number of frames in the video will influence the duration of the final output, as the node aims to synchronize the audio to match this duration.

audio

The audio parameter requires a dictionary containing the audio waveform and sample rate, which represents the audio content to be combined with the video. This input is essential for providing the sound that will accompany the video. The audio waveform should be a torch.Tensor, and the dictionary must include keys for 'waveform' and 'sample_rate'. The duration of the audio will be adjusted to match the video's duration, either by looping or padding, depending on the optional parameters.

sync_duration

The sync_duration parameter is a boolean option that, when set to True, automatically synchronizes the audio duration to match the video duration. This ensures that the audio and video are perfectly aligned, providing a seamless viewing experience. The default value is True, and it is recommended to keep this setting enabled unless specific desynchronization is desired.

loop_audio

The loop_audio parameter is a boolean option that, when enabled, allows the audio to loop if it is shorter than the video. This is useful for maintaining audio presence throughout the entire video duration without abrupt stops. The default value is False, meaning that by default, the audio will not loop and will instead be padded with silence if it is shorter than the video.

AudioX Video Audio Combiner Output Parameters:

video

The video output parameter returns the original video content, now combined with the synchronized audio. This output is in the same format as the input video, ensuring compatibility with further processing or playback.

synced_audio

The synced_audio output parameter provides the audio that has been adjusted to match the video's duration. This audio is either looped or padded with silence, depending on the input parameters, ensuring that it aligns perfectly with the video.

info

The info output parameter is a string that contains information about the processing that was performed, such as whether the audio was looped or padded. This can be useful for debugging or understanding the adjustments made during the combination process.

AudioX Video Audio Combiner Usage Tips:

Ensure that your video input is a torch.Tensor to avoid input format errors.
Use the sync_duration parameter to automatically align audio and video durations for a seamless output.
Enable loop_audio if your audio is shorter than the video and you want continuous sound throughout the video.

AudioX Video Audio Combiner Common Errors and Solutions:

Video input must be a torch.Tensor

Explanation: This error occurs when the video input is not in the expected torch.Tensor format.
Solution: Ensure that the video input is correctly formatted as a torch.Tensor before passing it to the node.

Audio input must be a dictionary with 'waveform' and 'sample_rate' keys

Explanation: This error indicates that the audio input is not a dictionary or is missing required keys.
Solution: Verify that the audio input is a dictionary containing both 'waveform' and 'sample_rate' keys, with the waveform being a torch.Tensor.

AudioX Video Audio Combiner Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-AudioX

Table of Content

Description
AudioXVideoAudioCombiner:
AudioXVideoAudioCombiner Input Parameters:
AudioXVideoAudioCombiner Output Parameters:
AudioXVideoAudioCombiner Usage Tips:
AudioXVideoAudioCombiner Common Errors and Solutions:
Related Nodes

Consistent Character Creator 3.0 | Easy Consistency, Any Angle

Make characters stay the same, every angle, strong and perfect.

Push-In Camera - A Motion LoRA for Wan 2.1

One image in, blockbuster push-in shots out. Zero complexity.

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy