logo
RunComfy
  • Models
  • ComfyUI
  • TrainerNew
  • API
  • Pricing
discord logo
ComfyUI>Workflows>MMAudio | Video-to-Audio

MMAudio | Video-to-Audio

Workflow Name: RunComfy/MMAudio
Workflow ID: 0000...1180
MMAudio generates synchronized audio from video and text inputs with unmatched precision. Using multimodal joint training, it adapts to diverse audio-visual and audio-text datasets seamlessly. Its advanced synchronization module ensures perfect alignment, transforming audio creation for modern content needs.

The ComfyUI-MMAudio nodes and its associated workflow are fully developed by Kijai. We give all due credit to Kijai for this innovative work. On the RunComfy platform, we are simply presenting Kijai’s contributions to the community. It is important to note that there is currently no formal connection or partnership between RunComfy and Kijai. We deeply appreciate Kijai’s work!

MMAudio

MMAudio is a powerful tool for creating synchronized audio from video and text inputs. It utilizes multimodal joint training to learn from diverse audio-visual and audio-text datasets, ensuring exceptional adaptability. With its advanced synchronization module, it perfectly aligns audio to video frames. MMAudio revolutionizes audio generation, streamlining the process for creators and innovators alike.

1.1 How to Use MMAudio Workflow?

MMAudio

This is the MMAudio workflow, Left Side nodes are inputs for uploading video, Middle is processing MMAudio nodes, and right is the outputs node.

  • Upload your Video in input nodes.
  • Write your audio generation prompts.
  • Click Render !!!

1.2 Video Input

MMAudio

  • Click and Upload your Reference Video.

The video is set to downscale the video to ?*512 resolution as processing HD Video or longer video may run of out memory.

1.3 MMAudio Processing

MMAudio

  • Positive: Enter the video generation prompts for the audio.
  • Negative: Enter what you don't want to hear.
  • Steps : More steps may improve audio quality.

1.4 MMAudio Models

MMAudio

These are the model downloader nodes, it will automatically download models in your comfyui in 2-3 mins.

  • MMAudio Models : https://github.com/hkchengrex/MMAudio

With its innovative multimodal training and precise synchronization, MMAudio sets a new standard in audio generation. Whether you're crafting videos, animations, or immersive experiences, MMAudio empowers creators with seamless, high-quality audio. Elevate your projects and bring your ideas to life with MMAudio.

Want More ComfyUI Workflows?

FLUX ControlNet Depth-V3 & Canny-V3

Achieve better control with FLUX-ControlNet-Depth & FLUX-ControlNet-Canny for FLUX.1 [dev].

Wan 2.2 Animate | Character Swap & Lip-Sync

Transforms any face to speak and move like the original with ease.

Segment Anything V2 (SAM2) | Video Segmentation

Object segmentation of videos with unrivaled accuracy.

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

CogVideoX-5B | Advanced Text-to-Video Model

CogVideoX-5B: Advanced text-to-video model for high-quality video generation.

ByteDance USO | Unified Style & Subject Generator

ByteDance USO makes subject and style fusion simple and powerful.

ComfyUI Phantom | Subjects to Video

Reference-driven video generation using Wan2.1 14B

Wan2.2 Fun Inp | Cinematic Video Generator

From 2 images to stunning videos with smooth, controllable transitions.

Follow us
  • LinkedIn
  • Facebook
  • Instagram
  • Twitter
Support
  • Discord
  • Email
  • System Status
  • Affiliate
Resources
  • Free ComfyUI Online
  • ComfyUI Guides
  • RunComfy API
  • ComfyUI Tutorials
  • ComfyUI Nodes
  • Learn More
Legal
  • Terms of Service
  • Privacy Policy
  • Cookie Policy
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.