wan-ai/wan-2-2/speech-to-video
Animate a single photo with synced speech, singing, or performance, delivering expressive motion, natural lip sync, and high-resolution outputs.
Table of contents
1. Get started
Use RunComfy's API to run wan-ai/wan-2-2/speech-to-video. For accepted inputs and outputs, see the model's schema.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/wan-ai/wan-2-2/speech-to-video \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"image_url": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.png",
"audio_url": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.mp3"
}'2. Authentication
Set the YOUR_API_TOKEN environment variable with your API key (manage keys in your Profile) and include it on every request as a Bearer token via the Authorization header: Authorization: Bearer $YOUR_API_TOKEN.
3. API reference
Submit a request
Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/wan-ai/wan-2-2/speech-to-video \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"image_url": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.png",
"audio_url": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.mp3"
}'Monitor request status
Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
--header "Authorization: Bearer <token>"Retrieve request results
Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
--header "Authorization: Bearer <token>"Cancel a request
Cancel a queued job by request_id, in-progress jobs cannot be cancelled.
curl --request POST \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
--header "Authorization: Bearer <token>"4. File inputs
Hosted file (URL)
Provide a publicly reachable HTTPS URL. Ensure the host allows server‑side fetches (no login/cookies required) and isn't rate‑limited or blocking bots. Recommended limits: images ≤ 50 MB (~4K), videos ≤ 100 MB (~2–5 min @ 720p). Prefer stable or pre‑signed URLs for private assets.
5. Schema
Input schema
{
"type": "object",
"title": "Input",
"required": [
"image_url",
"audio_url"
],
"properties": {
"image_url": {
"title": "Image",
"description": "Image format must be: jpg, jpeg, png, bmp, webp.",
"type": "string",
"validations": [
{
"validation_rule": "width_pixels>",
"validation_value": 400,
"validation_error": "The uploaded image width and height must exceed 400 pixels."
},
{
"validation_rule": "height_pixels>",
"validation_value": 400,
"validation_error": "The uploaded image width and height must exceed 400 pixels."
},
{
"validation_rule": "width_pixels<",
"validation_value": 7000,
"validation_error": "The uploaded image width and height must not exceed 7000 pixels."
},
{
"validation_rule": "height_pixels<",
"validation_value": 7000,
"validation_error": "The uploaded image width and height must not exceed 7000 pixels."
}
],
"default": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.png"
},
"audio_url": {
"title": "Audio",
"description": "Audio format must be: wav, mp3. The duration of this audio must be less than 20s",
"type": "string",
"validations": [
{
"validation_rule": "file_size_mb<",
"validation_value": 15,
"validation_error": "File size must be less than 15 MB."
}
],
"default": "https://playgrounds-storage-public.runcomfy.net/tools/7078/media-files/input.mp3"
},
"style": {
"title": "Style",
"description": "The style of your character.",
"type": "string",
"enum": [
"speech",
"sing",
"perform"
],
"default": "speech"
},
"resolution": {
"title": "Resolution",
"description": "",
"type": "string",
"enum": [
"480P",
"720P"
],
"default": "480P"
}
}
}Output schema
{
"output": {
"type": "object",
"properties": {
"image": {
"type": "string",
"format": "uri",
"description": "single image URL"
},
"video": {
"type": "string",
"format": "uri",
"description": "single video URL"
},
"images": {
"type": "array",
"description": "multiple image URLs",
"items": { "type": "string", "format": "uri" }
},
"videos": {
"type": "array",
"description": "multiple video URLs",
"items": { "type": "string", "format": "uri" }
}
}
}
}