ComfyUI > Nodes > wavespeed-comfyui > WaveSpeedAI MMAudio V2

ComfyUI Node: WaveSpeedAI MMAudio V2

Class Name

WaveSpeedAI MMAudioV2Node

Category
WaveSpeedAI
Author
WaveSpeedAI (Account age: 447days)
Extension
wavespeed-comfyui
Latest Updated
2026-03-18
Github Stars
0.06K

How to Install wavespeed-comfyui

Install this extension via the ComfyUI Manager by searching for wavespeed-comfyui
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter wavespeed-comfyui in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

WaveSpeedAI MMAudio V2 Description

Generates synchronized, high-quality audio for videos using WaveSpeed AI's MMAudio V2 model.

WaveSpeedAI MMAudio V2:

The WaveSpeedAI MMAudioV2Node is a powerful tool designed to generate synchronized audio for video content using WaveSpeed AI's advanced MMAudio V2 model. This node is particularly beneficial for creators looking to enhance their video projects with audio that is not only high-quality but also perfectly aligned with the visual elements. By leveraging sophisticated AI algorithms, the node ensures that the audio complements the video seamlessly, providing an immersive experience for viewers. This capability is especially useful for AI artists and content creators who wish to add a professional touch to their videos without the need for extensive technical expertise in audio production.

WaveSpeedAI MMAudio V2 Input Parameters:

client

The client parameter is essential for authenticating and authorizing requests to the WaveSpeed AI API. It typically contains the API key required to access the service. This parameter ensures that the node can communicate with the WaveSpeed AI servers to process the video and generate the corresponding audio.

video

The video parameter is a mandatory input that specifies the video content for which synchronized audio is to be generated. It is crucial that this parameter is provided, as the node uses the visual cues from the video to create audio that aligns perfectly with the imagery. The video should be in a format supported by the node and accessible via a valid URL or file path.

prompt

The prompt parameter is a text input that guides the audio generation process. It allows you to specify the mood, theme, or specific elements you want the audio to reflect. This parameter is vital for tailoring the audio output to match the desired artistic vision or narrative of the video.

negative_prompt

The negative_prompt parameter is an optional input that helps refine the audio generation by specifying elements or themes to avoid. This can be useful for ensuring that the generated audio does not include unwanted sounds or moods that might detract from the video's intended impact.

duration

The duration parameter defines the length of the audio to be generated, measured in seconds. It is important to set this parameter to match the length of the video to ensure synchronization. The default value is 8 seconds, but it can be adjusted to fit the specific needs of your project.

seed

The seed parameter is used to initialize the random number generator for the audio generation process. By setting a specific seed value, you can ensure that the audio output is consistent across multiple runs with the same input parameters. This is useful for reproducibility and fine-tuning the results.

num_inference_steps

The num_inference_steps parameter determines the number of steps the model takes during the inference process. A higher number of steps can lead to more refined audio output but may also increase the processing time. The default value is 25 steps, providing a balance between quality and efficiency.

guidance_scale

The guidance_scale parameter influences the strength of the prompt in guiding the audio generation. A higher value places more emphasis on the prompt, potentially leading to audio that more closely aligns with the specified themes or moods. The default value is 4.5, which offers a moderate level of guidance.

mask_away_clip

The mask_away_clip parameter is a boolean option that, when enabled, masks certain parts of the video to prevent them from influencing the audio generation. This can be useful for focusing the audio on specific elements of the video, ensuring that the generated sound complements the most important visual aspects.

WaveSpeedAI MMAudio V2 Output Parameters:

video_url

The video_url output parameter provides the URL of the video with the newly generated synchronized audio. This URL allows you to access and download the final video product, which combines the original visual content with the AI-generated audio, ready for sharing or further editing.

WaveSpeedAI MMAudio V2 Usage Tips:

  • Ensure that the video parameter is correctly specified and accessible to avoid errors during processing.
  • Use the prompt and negative_prompt parameters creatively to guide the audio generation towards your desired artistic outcome.
  • Experiment with the guidance_scale and num_inference_steps to find the optimal balance between processing time and audio quality.

WaveSpeedAI MMAudio V2 Common Errors and Solutions:

Video is required

  • Explanation: This error occurs when the video parameter is not provided or is empty.
  • Solution: Ensure that you specify a valid video URL or file path in the video parameter before executing the node.

Prompt is required

  • Explanation: This error is raised when the prompt parameter is missing or empty.
  • Solution: Provide a meaningful text prompt to guide the audio generation process.

No video URL in the generated result

  • Explanation: This error indicates that the node was unable to generate a video URL, possibly due to an issue with the input parameters or the API request.
  • Solution: Verify that all required parameters are correctly set and that the API client is properly configured with a valid API key. Retry the request after checking these settings.

WaveSpeedAI MMAudio V2 Related Nodes

Go back to the extension to check out more related nodes.
wavespeed-comfyui
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

WaveSpeedAI MMAudio V2