Infinite Talk Multi-Person: Audio-to-Video Generation with Multi-Person Support
Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.
Table of contents
1. Get started
Use RunComfy's API to run community/infinite-talk/fast/multi. For accepted inputs and outputs, see the model's schema.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
"right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
"image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
}'2. Authentication
Set the YOUR_API_TOKEN environment variable with your API key (manage keys in your Profile) and include it on every request as a Bearer token via the Authorization header: Authorization: Bearer $YOUR_API_TOKEN.
3. API reference
Submit a request
Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
"right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
"image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
}'Monitor request status
Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
--header "Authorization: Bearer <token>"Retrieve request results
Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
--header "Authorization: Bearer <token>"Cancel a request
Cancel a queued job by request_id, in-progress jobs cannot be cancelled.
curl --request POST \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
--header "Authorization: Bearer <token>"4. File inputs
Hosted file (URL)
Provide a publicly reachable HTTPS URL. Ensure the host allows server‑side fetches (no login/cookies required) and isn't rate‑limited or blocking bots. Recommended limits: images ≤ 50 MB (~4K), videos ≤ 100 MB (~2–5 min @ 720p). Prefer stable or pre‑signed URLs for private assets.
5. Schema
Input schema
{
"type": "object",
"title": "Input",
"required": [
"left_audio",
"right_audio",
"image"
],
"properties": {
"left_audio": {
"title": "Left Audio",
"description": "The audio of the person on the left for generating the output. The duration of this audio should be less than 10 minutes.",
"type": "string",
"default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3"
},
"right_audio": {
"title": "Right Audio",
"description": "The audio of the person on the right for generating the output. The duration of this audio should be less than 10 minutes.",
"type": "string",
"default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3"
},
"image": {
"title": "Image",
"description": "The image for generating the output.",
"type": "string",
"default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
},
"prompt": {
"title": "Prompt",
"description": "The positive prompt for the generation.",
"type": "string",
"default": ""
},
"order": {
"title": "Order",
"description": "The order of the two audio sources in the output video. \"meanwhile\" means both audio sources will play at the same time, \"left_right\" means the left audio will play first then the right audio will play, and \"right_left\" means the right audio will play first then the left audio will play.",
"type": "string",
"enum": [
"meanwhile",
"left_right",
"right_left"
],
"default": "left_right"
},
"seed": {
"title": "Seed",
"description": "The random seed to use for the generation. -1 means a random seed will be used.",
"type": "integer",
"minimum": -1,
"maximum": 2147483647,
"default": -1
}
}
}Output schema
{
"output": {
"type": "object",
"properties": {
"image": {
"type": "string",
"format": "uri",
"description": "single image URL"
},
"video": {
"type": "string",
"format": "uri",
"description": "single video URL"
},
"images": {
"type": "array",
"description": "multiple image URLs",
"items": { "type": "string", "format": "uri" }
},
"videos": {
"type": "array",
"description": "multiple video URLs",
"items": { "type": "string", "format": "uri" }
}
}
}
}RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.
