logo
RunComfy
  • ComfyUI
  • TrainerNew
  • Models
  • API
  • Pricing
discord logo
MODELS
Explore
All Models
LIBRARY
Generations
MODEL APIS
API Docs
API Keys
ACCOUNT
Usage

Kling Video O3 Pro Reference To Video: Cinematic Identity-Locked Video Generation on Models and API | RunComfy

kling/kling-video-o3/pro/reference-to-video

Combine reference images, an optional reference video, and a prompt into a 3-15s Pro-grade clip with identity consistency using Kling Video O3 Pro Reference To Video, on RunComfy models and HTTP API.

Describe the scene, action, and camera. Refer to references by position, e.g. 'The woman in Figure 1 walks with the man in Figure 2 through a neon-lit alley.'
Optional reference video for motion guidance, style transfer, or scene continuity. When provided, billing switches to the with-reference-video tier and overrides the sound multiplier.
Image 1
Image 2
Reference images of characters, props, or styles. Up to 7 without a reference video, or up to 4 when a reference video is provided.
When a reference video is provided, retain its original audio track in the generated output. Default: enabled.
When enabled (and no reference video is provided), synthesize matching ambient audio and effects. Adds ~20% to the per-second cost. Default: disabled.
Output frame ratio. 16:9 for landscape, 9:16 for vertical social, 1:1 for square. Default: 16:9.
Length of the generated clip in seconds (3-15). Default: 5.
Editing scope. Use intelligent for auto-decided pacing and cuts, or customize for prompt-driven manual control. Default: intelligent.
Additional prompt segments to guide scene transitions and progressions. The sum of durations in multi_prompt must equal to total video duration.
Idle
The rate is $0.112 per second without sound, $0.135 per second with synthesized sound, and $0.168 per second when a reference video is provided.

Introduction To Kling Video O3 Pro Reference To Video

Kuaishou's Kling Video O3 Pro Reference To Video turns reference images, an optional reference video, and a prompt into 3 to 15 second cinematic clips at $0.112 per second without sound, $0.135 per second with synthesized sound, or $0.168 per second when a reference video is supplied.

Trading reshoots, casting days, and frame-by-frame compositing for one guided generation, the model gives film teams, ad agencies, brand studios, and product groups identity-locked characters and styles in broadcast-grade footage.

For developers, Kling Video O3 Pro Reference To Video on RunComfy can be used both in the browser and via an HTTP API, so you don't need to host or scale the model yourself.

Ideal for: Hero Character Films | Premium Spokesperson Spots | Style-Consistent Cinematic Reels

Kuaishou / Kling Video O3 Pro Reference To Video#


This is the Pro-tier reference-driven entry in Kuaishou's O3 generation, tuned for final-render fidelity. It generates new clips while preserving the identity of the characters, props, or styles you pass in as references, and accepts an optional reference video for motion or style guidance.


It fits teams that need cinematic short-form video featuring specific people, products, or art directions — without a shoot, manual rotoscoping, or self-hosting the model.


Highlights#


  • Identity preservation: Lock facial features, wardrobe, and props from your references so subjects stay consistent across every frame of Kling Video O3 Pro Reference To Video output.
  • Multi-reference composition: Combine up to 7 reference images, or up to 4 alongside a reference video, to stage multi-character or multi-prop scenes in one pass.
  • Optional reference video: Drop in a clip for motion guidance, style transfer, or scene continuity instead of starting from stills alone.
  • Sound paths: Keep the original audio from the reference clip, or synthesize new synchronized sound when no reference video is provided.
  • Pro-grade fidelity: Targets the top of the O3 family for lighting, composition, and motion realism suitable for hero and broadcast cuts.
  • Multi-format output: 16:9, 9:16, and 1:1 cover landscape, vertical, and square placements from a single model.
  • Flexible duration: Any whole-second length from 3 to 15 seconds works for hooks, beats, or full short-form posts.

Related Models

pika-2-2/text-to-video

Create high quality videos from text prompts using Pika 2.2.

luma-ray-2/text-to-video

Generate high quality videos from text prompts using Luma Ray 2.

runway-gen-3-alpha/turbo/image-to-video

Lightning-fast video creation with lifelike and smooth kinetics.

sora-2/text-to-video

Generate realistic videos with synced audio from text using OpenAI Sora 2.

kling-video-o3/pro/text-to-video

Cinematic Pro-tier text-to-video at $0.112 per second of output.

pikadditions

Add a person or object into an existing video with smart compositing.

Frequently Asked Questions

What is Kling Video O3 Pro Reference To Video and what does it do?

Kling Video O3 Pro Reference To Video is Kuaishou's Pro-tier reference-driven entry in the O3 family. It generates a 3 to 15 second cinematic clip from a prompt while preserving the identity of the people, props, or styles you supply as reference images, with an optional reference video for motion or style guidance. The result keeps consistent characters and looks across frames at the top of the O3 fidelity range.

What kinds of references can I use with Kling Video O3 Pro Reference To Video?

You can attach reference images of characters, props, or art styles, plus an optional reference video for motion or style transfer. Without a reference video, up to 7 image references are supported; with a reference video, image references are capped at 4. In the prompt, you bind subjects to specific images by position — for example, "Figure 1 stands next to Figure 2" — so Kling Video O3 Pro Reference To Video knows which subject to place where.

How does Kling Video O3 Pro Reference To Video compare to the O3 Standard reference tier?

Compared with the Standard reference-to-video tier, Kling Video O3 Pro Reference To Video targets a higher fidelity ceiling on motion, lighting, and identity stability, which suits hero and broadcast cuts based on available provider information. Standard is positioned for iteration and high-volume drafts at a lower per-second rate. The control surface — prompt, references, aspect ratio, duration, sound, shot type — is the same, so prompts and references move between tiers without rework.

Which teams and use cases benefit most from Kling Video O3 Pro Reference To Video?

Film teams, ad agencies, brand studios, e-commerce video producers, and product groups use Kling Video O3 Pro Reference To Video to produce identity-locked spots from existing stills — different scenes, props, or moods without reshoots. It also fits spokesperson content, premium concept reels, multi-character storytelling beats, and style-consistent campaign cutdowns. Developers integrate it into automated pipelines that turn a brief plus a few references into a finished short clip.

What input limits should I know before using Kling Video O3 Pro Reference To Video?

The model requires a prompt; references and the reference video are optional but recommended. Image references are capped at 7 (or 4 alongside a reference video), aspect_ratio accepts 16:9, 9:16, or 1:1, duration is an integer between 3 and 15 seconds, and shot_type accepts customize or intelligent. Sound and keep_original_sound are booleans, and sound generation only applies when no reference video is supplied. For other constraints such as resolution or file format, check the current RunComfy parameter panel for the exact limits, since they may vary by provider settings.

Can developers use Kling Video O3 Pro Reference To Video through the RunComfy API?

Yes. You can prototype Kling Video O3 Pro Reference To Video in the RunComfy AI Playground Web UI — dialing in references, prompt phrasing, aspect ratio, duration, and audio toggles — and then call the same model via the RunComfy API with identical parameters. This keeps creative iteration in the browser while production runs in code, without changing how the model behaves.

How much does it cost to generate with Kling Video O3 Pro Reference To Video on RunComfy?

Generations consume usd / credits from your RunComfy balance. Kling Video O3 Pro Reference To Video bills $0.112 per second without sound, $0.135 per second when synthesized sound is enabled, and $0.168 per second when a reference video is supplied (the reference-video rate overrides the sound multiplier). As examples, 5 seconds without sound is around $0.560, 10 seconds with sound is around $1.350, and 15 seconds with a reference video is around $2.520. New users typically get a free trial usd amount; refer to the Generation section of the model page for the latest rates.

What prompting style works best with Kling Video O3 Pro Reference To Video?

Kling Video O3 Pro Reference To Video responds best to clear, specific prompts that bind references to roles using "Figure 1", "Figure 2", and so on, then describe the action, environment, and camera move. Concrete cues like "50mm slow dolly-in", "golden hour rim light", or "neon practicals" anchor look and motion better than vague mood words. For complex scenes, use multi-prompt segments to separate beats so transitions stay clean within a single clip.

Follow us
  • LinkedIn
  • Facebook
  • Instagram
  • Twitter
Support
  • Discord
  • Email
  • System Status
  • Affiliate
Video Models
  • Wan 2.6 Flash
  • Seedance 1.0 Pro Fast
  • Wan 2.6
  • Seedance 2.0 Pro
  • Wan 2.7
  • Seedance 1.0
  • View All Models →
Image Models
  • seedream 4.0
  • Flux 2 Dev
  • Nano Banana Pro
  • Nano Banana 2 Edit
  • GPT Image 2 Image Edit
  • Flux 2 Flash Edit
  • View All Models →
Legal
  • Terms of Service
  • Privacy Policy
  • Cookie Policy
RunComfy
Copyright 2026 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Examples Of Kling Video O3 Pro Reference To Video

Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...