ComfyUI > Nodes > ComfyUI > Kling 2.6 Image(First Frame) to Video with Audio

ComfyUI Node: Kling 2.6 Image(First Frame) to Video with Audio

Class Name

KlingImageToVideoWithAudio

Category
api node/video/Kling
Author
ComfyAnonymous (Account age: 763days)
Extension
ComfyUI
Latest Updated
2026-05-13
Github Stars
112.77K

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Kling 2.6 Image(First Frame) to Video with Audio Description

Transform static images into dynamic videos with audio for immersive storytelling and engaging visual content.

Kling 2.6 Image(First Frame) to Video with Audio:

The KlingImageToVideoWithAudio node is designed to transform a static image into a dynamic video with accompanying audio, leveraging advanced video generation techniques. This node is particularly beneficial for AI artists and creators who wish to bring still images to life by adding motion and sound, creating a more immersive and engaging experience. By using this node, you can convert the first frame of an image into a video sequence, enriched with audio, which can be particularly useful for storytelling, presentations, or enhancing visual content with auditory elements. The node's capabilities allow for the seamless integration of visual and audio components, making it a powerful tool for creative projects that require a blend of static and dynamic media.

Kling 2.6 Image(First Frame) to Video with Audio Input Parameters:

model_name

This parameter specifies the model to be used for video generation. It determines the underlying algorithm and style applied to the video creation process. The default model is KlingVideoGenModelName.kling_v1, which is optimized for standard video generation tasks.

cfg_scale

The cfg_scale parameter controls the configuration scale, influencing the degree of adherence to the input prompts. A higher value results in a video that closely follows the specified prompts, while a lower value allows for more creative freedom and variation. The exact range and default value are not specified in the context, but it typically requires careful adjustment to balance creativity and prompt fidelity.

model_mode

This parameter defines the mode of the video generation model. The default mode is KlingVideoGenMode.std, which stands for standard mode, suitable for general video creation tasks. Different modes may offer specialized features or effects, depending on the model's capabilities.

aspect_ratio

The aspect_ratio parameter sets the width-to-height ratio of the generated video. It is crucial for ensuring that the video fits the desired display format or platform requirements. The parameter is represented by KlingVideoGenAspectRatio, which likely includes predefined options to choose from.

duration

This parameter specifies the length of the generated video. The default duration is KlingVideoGenDuration.field_5, which indicates a predefined time setting. Adjusting this parameter allows you to control how long the video will play, which is essential for matching the video's length to the accompanying audio or the intended use case.

prompt

The prompt parameter is a textual input that guides the video generation process. It provides the creative direction or theme for the video, influencing the visual elements and narrative. Crafting a clear and descriptive prompt is key to achieving the desired video output.

negative_prompt

This parameter allows you to specify elements or themes to avoid in the video generation process. By providing a negative_prompt, you can steer the model away from certain styles or content, ensuring the final video aligns more closely with your vision.

camera_control

The camera_control parameter manages the virtual camera movements within the generated video. It affects how the scene is captured and can add dynamic perspectives or focus shifts, enhancing the video's visual appeal.

Kling 2.6 Image(First Frame) to Video with Audio Output Parameters:

video

The video output is the primary result of the node, providing the generated video file that combines the visual transformation of the input image with the integrated audio. This output is crucial for reviewing and utilizing the final animated content in various applications.

duration

This output parameter indicates the actual duration of the generated video. It is important for verifying that the video length matches the intended specifications and for synchronizing with other media elements if necessary.

Kling 2.6 Image(First Frame) to Video with Audio Usage Tips:

  • Experiment with different prompt and negative_prompt combinations to fine-tune the creative direction of your video. This can help achieve a more personalized and unique output.
  • Adjust the cfg_scale to balance between strict adherence to the prompt and allowing for creative variations. This can be particularly useful when exploring different artistic styles or effects.
  • Ensure that the aspect_ratio matches the intended display format to avoid cropping or distortion when the video is viewed on different devices or platforms.

Kling 2.6 Image(First Frame) to Video with Audio Common Errors and Solutions:

Kling request failed. Code: <code>, Message: <message>, Data: <data>

  • Explanation: This error indicates that the request to the Kling API failed, possibly due to incorrect parameters or server issues.
  • Solution: Verify that all input parameters are correctly set and within the allowed ranges. Check your network connection and try again. If the problem persists, consult the API documentation or support for further assistance.

Video or audio file size exceeds limit

  • Explanation: The input video or audio file exceeds the maximum allowed size, which can prevent successful processing.
  • Solution: Ensure that the audio file is no larger than 5MB and the video file is no larger than 100MB. Resize or compress the files if necessary before retrying.

Invalid aspect ratio

  • Explanation: The specified aspect ratio is not supported by the node, leading to an error in video generation.
  • Solution: Choose an aspect ratio from the predefined options in KlingVideoGenAspectRatio to ensure compatibility with the node's requirements.

Kling 2.6 Image(First Frame) to Video with Audio Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Kling 2.6 Image(First Frame) to Video with Audio