Kling O1 Reference Image to Video: Image-to-Video with Motion Fidelity on playground and API | RunComfy

kling/kling-video-o1/image-to-video/reference

Generate cinematic videos from images or text using reference footage, preserving motion style, camera angles, and scene continuity for unified, high-fidelity visual storytelling.

1. Get started

Use RunComfy's API to run kling/kling-video-o1/image-to-video/reference. For accepted inputs and outputs, see the model's schema.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/kling/kling-video-o1/image-to-video/reference \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "prompt": "Replace the character in the video with @Element1, maintaining the same movements and camera angles. Transform the landscape into @Image1",
    "elements": [
      {
        "frontal_image_url": "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-2.png",
        "reference_image_urls": [
          "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-3.jpeg"
        ]
      }
    ],
    "image_urls": [
      "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-4.png"
    ]
  }'

2. Authentication

Set the YOUR_API_TOKEN environment variable with your API key (manage keys in your Profile) and include it on every request as a Bearer token via the Authorization header: Authorization: Bearer $YOUR_API_TOKEN.

3. API reference

Submit a request

Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/kling/kling-video-o1/image-to-video/reference \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "prompt": "Replace the character in the video with @Element1, maintaining the same movements and camera angles. Transform the landscape into @Image1",
    "elements": [
      {
        "frontal_image_url": "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-2.png",
        "reference_image_urls": [
          "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-3.jpeg"
        ]
      }
    ],
    "image_urls": [
      "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-4.png"
    ]
  }'

Monitor request status

Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
  --header "Authorization: Bearer <token>"

Retrieve request results

Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
  --header "Authorization: Bearer <token>"

Cancel a request

Cancel a queued job by request_id, in-progress jobs cannot be cancelled.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
  --header "Authorization: Bearer <token>"

4. File inputs

Hosted file (URL)

Provide a publicly reachable HTTPS URL. Ensure the host allows server‑side fetches (no login/cookies required) and isn't rate‑limited or blocking bots. Recommended limits: images ≤ 50 MB (~4K), videos ≤ 100 MB (~2–5 min @ 720p). Prefer stable or pre‑signed URLs for private assets.

5. Schema

Input schema

{
  "type": "object",
  "title": "Input",
  "required": [
    "prompt",
    "elements",
    "image_urls"
  ],
  "properties": {
    "prompt": {
      "title": "Prompt",
      "description": "Reference elements as @Element1, @Element2 and images as @Image1, @Image2 in order. Spell these elements words strictly as they are.",
      "type": "string",
      "default": "Replace the character in the video with @Element1, maintaining the same movements and camera angles. Transform the landscape into @Image1"
    },
    "image_urls": {
      "title": "Reference Images",
      "description": "Additional reference images for style/appearance. Reference in prompt as @Image1, @Image2, etc. Maximum 7 total across elements + reference images + start image.",
      "type": "array",
      "items": {
        "type": "string",
        "format": "image_uri"
      },
      "default": [
        "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-4.png"
      ],
      "maxItems": 7,
      "minItems": 0
    },
    "elements": {
      "title": "Elements",
      "description": "Provide characters/objects to include. Reference in prompt as @Element1, @Element2, etc. Maximum 7 total across elements + reference images + start image.",
      "type": "array",
      "default": [
        {
          "frontal_image_url": "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-2.png",
          "reference_image_urls": [
            "https://playgrounds-storage-public.runcomfy.net/tools/7244/media-files/input-1-3.jpeg"
          ]
        }
      ],
      "items": {
        "frontal_image_url": {
          "title": "Frontal Image Url",
          "description": "The frontal image of the element (main view).Max file size: 10.0MB, Min width: 300px, Min height: 300px, Min aspect ratio: 0.40, Max aspect ratio: 2.50, Timeout: 20.0s",
          "type": "string",
          "format": "image_uri",
          "default": ""
        },
        "reference_image_urls": {
          "title": "Reference Image Urls",
          "description": "Additional reference images from different angles. 1-4 images supported. At least one image is required.",
          "type": "array",
          "format": "image_uris",
          "items": {
            "type": "string",
            "format": "image_uri"
          },
          "default": [],
          "maxItems": 4,
          "minItems": 1
        }
      },
      "maxItems": 7,
      "minItems": 0
    },
    "duration": {
      "title": "Duration (seconds)",
      "description": "Video duration in seconds. Only 5 and 10 are supported.",
      "type": "String",
      "enum": [
        "5",
        "10"
      ],
      "default": "5"
    },
    "aspect_ratio": {
      "title": "Aspect Ratio (W:H)",
      "description": "The aspect ratio of the generated video frame.",
      "type": "string",
      "enum": [
        "16:9",
        "9:16",
        "1:1"
      ],
      "default": "16:9"
    }
  }
}

Output schema

{
  "output": {
    "type": "object",
    "properties": {
      "image": {
        "type": "string",
        "format": "uri",
        "description": "single image URL"
      },
      "video": {
        "type": "string",
        "format": "uri",
        "description": "single video URL"
      },
      "images": {
        "type": "array",
        "description": "multiple image URLs",
        "items": { "type": "string", "format": "uri" }
      },
      "videos": {
        "type": "array",
        "description": "multiple video URLs",
        "items": { "type": "string", "format": "uri" }
      }
    }
  }
}