CatVTON for easy and accurate virtual try-on.

Self Forcing | Autoregressive Keyframe-to-Video Generation

SUPER FAST! 5-second video in 45 seconds!

ACE-Step Music Generation | AI Audio Creation

Generate studio-quality music 15× faster with breakthrough diffusion technology.

BAGEL AI | T2I + I2T + I2I

Multimodal understanding and generation with open-source AI.

ComfyUI > Nodes > Comfyui_Comfly_v2 > Comfly_lip_sync

ComfyUI Node: Comfly_lip_sync

Class Name

Comfly_lip_sync

Category
Comfly-v2/Comfly_kling

Author
ainewsto (Account age: 1002days) Extension
Comfyui_Comfly_v2 Latest Updated
2025-06-06 Github Stars
0.04K

Github Ask ainewsto Current Questions Past Questions

Table of Content

Description
Comfly_lip_sync:
Comfly_lip_sync Input Parameters:
Comfly_lip_sync Output Parameters:
Comfly_lip_sync Usage Tips:
Comfly_lip_sync Common Errors and Solutions:
Related Nodes

How to Install Comfyui_Comfly_v2

Install this extension via the ComfyUI Manager by searching for Comfyui_Comfly_v2

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Comfyui_Comfly_v2 in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Comfly_lip_sync Description

Synchronize lip movements in videos with audio or text for realistic lip-syncing in multimedia projects.

Comfly_lip_sync:

The Comfly_lip_sync node is designed to synchronize lip movements in a video with either audio content or a text prompt, providing a seamless and realistic lip-syncing experience. This node is particularly beneficial for AI artists and content creators who wish to enhance their video projects by ensuring that the visual representation of speech matches the audio or text input. By leveraging advanced algorithms, the node analyzes the audio or text and adjusts the mouth movements in the video accordingly, creating a natural and convincing synchronization. This capability is essential for applications such as animated films, virtual avatars, and other multimedia projects where accurate lip-syncing is crucial for maintaining viewer engagement and realism.

Comfly_lip_sync Input Parameters:

video

The video parameter is the input video file that will be processed for lip-syncing. It is crucial that the video contains a distinct face to ensure accurate synchronization. The video file should not exceed 100MB in size, with dimensions between 720px and 1920px in both height and width, and a duration between 2 and 10 seconds. These constraints ensure optimal processing and output quality.

audio

The audio parameter is the input audio file that provides the speech content for lip-syncing. The audio should contain clearly distinguishable vocals to facilitate accurate synchronization with the video. The file size should not exceed 5MB to ensure efficient processing. This parameter is essential when syncing lip movements to an audio track.

voice_language

The voice_language parameter specifies the language of the voice in the audio or text input. This selection helps the node to accurately interpret and synchronize the lip movements with the corresponding language nuances. The available options are determined by the KlingLipSyncVoiceLanguage enumeration, ensuring compatibility with various languages.

Comfly_lip_sync Output Parameters:

VIDEO

The VIDEO output is the processed video file with synchronized lip movements. This output is the primary result of the node's operation, providing a video where the mouth movements align with the audio or text input, enhancing the realism and engagement of the content.

video_id

The video_id is a unique identifier for the processed video. This identifier can be used for tracking and referencing the video within larger workflows or systems, ensuring easy management and retrieval of the processed content.

duration

The duration output indicates the length of the processed video. This information is useful for verifying that the video meets the expected duration constraints and for planning further processing or integration steps in multimedia projects.

Comfly_lip_sync Usage Tips:

Ensure that the input video contains a clear and distinct face to achieve the best lip-syncing results.
Use high-quality audio with clearly distinguishable vocals to facilitate accurate synchronization.
Keep the video and audio file sizes within the specified limits to ensure efficient processing and optimal output quality.

Comfly_lip_sync Common Errors and Solutions:

Video length does not meet requirements

Explanation: The input video does not adhere to the required duration constraints of 2 to 10 seconds.
Solution: Adjust the video length to fall within the specified range before processing.

Video dimensions are out of bounds

Explanation: The input video dimensions do not meet the required range of 720px to 1920px.
Solution: Resize the video to ensure both height and width are within the specified limits.

Audio file is too large

Explanation: The input audio file exceeds the maximum size limit of 5MB.
Solution: Compress or trim the audio file to reduce its size before processing.

Missing or invalid voice language

Explanation: The voice_language parameter is not set or contains an invalid value.
Solution: Select a valid language option from the KlingLipSyncVoiceLanguage enumeration.

Comfly_lip_sync Related Nodes

Go back to the extension to check out more related nodes.

Comfyui_Comfly_v2

Table of Content

Description
Comfly_lip_sync:
Comfly_lip_sync Input Parameters:
Comfly_lip_sync Output Parameters:
Comfly_lip_sync Usage Tips:
Comfly_lip_sync Common Errors and Solutions:
Related Nodes

Hallo2 | Lip-Sync Portrait Animation

Audio-driven lip-sync for portrait animation in 4K.

Flux Depth and Canny

Official Flux Tools - Flux Depth and Canny ControlNet Model

Wan 2.1 FLF2V | First-Last Frame Video

Generate smooth videos from a start and end frame using Wan 2.1 FLF2V.

ReActor | Fast Face Swap

Professional face swapping toolkit for ComfyUI that enables natural face replacement and enhancement.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.