Community Infinite Talk API | Pricing & Docs

community/infinite-talk/fast/multi

Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.

1. Get started

Use RunComfy's API to run community/infinite-talk/fast/multi. For accepted inputs and outputs, see the model's schema.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
    "right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
    "image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
  }'

2. Authentication

Set the YOUR_API_TOKEN environment variable with your API key (manage keys in your Profile) and include it on every request as a Bearer token via the Authorization header: Authorization: Bearer $YOUR_API_TOKEN.

3. API reference

Submit a request

Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
    "right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
    "image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
  }'

Monitor request status

Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
  --header "Authorization: Bearer <token>"

Retrieve request results

Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
  --header "Authorization: Bearer <token>"

Cancel a request

Cancel a queued job by request_id; in-progress jobs cannot be cancelled.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
  --header "Authorization: Bearer <token>"

4. File inputs

Hosted file (URL)

Provide a publicly reachable HTTPS URL. Ensure the host allows server-side fetches (no login/cookies required) and isn't rate-limited or blocking bots. Recommended limits: images ≤ 50 MB (~4K), videos ≤ 100 MB (~2–5 min @ 720p). Prefer stable or pre-signed URLs for private assets.

5. Schema

Input schema

{
  "type": "object",
  "title": "Input schema",
  "required": [
    "left_audio",
    "right_audio",
    "image"
  ],
  "properties": {
    "left_audio": {
      "title": "Left Audio",
      "description": "The audio of the person on the left for generating the output. The duration of this audio should be less than 10 minutes.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3"
    },
    "right_audio": {
      "title": "Right Audio",
      "description": "The audio of the person on the right for generating the output. The duration of this audio should be less than 10 minutes.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3"
    },
    "image": {
      "title": "Image",
      "description": "The image for generating the output.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
    },
    "prompt": {
      "title": "Prompt",
      "description": "The positive prompt for the generation.",
      "type": "string",
      "default": ""
    },
    "order": {
      "title": "Order",
      "description": "The order of the two audio sources in the output video. \"meanwhile\" means both audio sources will play at the same time, \"left_right\" means the left audio will play first then the right audio will play, and \"right_left\" means the right audio will play first then the left audio will play.",
      "type": "string",
      "enum": [
        "meanwhile",
        "left_right",
        "right_left"
      ],
      "default": "left_right"
    },
    "seed": {
      "title": "Seed",
      "description": "The random seed to use for the generation. -1 means a random seed will be used.",
      "type": "integer",
      "minimum": -1,
      "maximum": 2147483647,
      "default": -1
    }
  }
}

Output schema

{
  "output": {
    "type": "object",
    "properties": {
      "image": {
        "type": "string",
        "format": "uri",
        "description": "single image URL"
      },
      "video": {
        "type": "string",
        "format": "uri",
        "description": "single video URL"
      },
      "images": {
        "type": "array",
        "description": "multiple image URLs",
        "items": {
          "type": "string",
          "format": "uri"
        }
      },
      "videos": {
        "type": "array",
        "description": "multiple video URLs",
        "items": {
          "type": "string",
          "format": "uri"
        }
      }
    }
  }
}

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

1. Get started

Use RunComfy's API to run community/infinite-talk/fast/multi. For accepted inputs and outputs, see the model's schema.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
    "right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
    "image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
  }'

3. API reference

Submit a request

Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/models/community/infinite-talk/fast/multi \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer <token>" \
  --data '{
    "left_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3",
    "right_audio": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3",
    "image": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
  }'

Monitor request status

Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
  --header "Authorization: Bearer <token>"

Retrieve request results

Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.

curl --request GET \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
  --header "Authorization: Bearer <token>"

Cancel a request

Cancel a queued job by request_id; in-progress jobs cannot be cancelled.

curl --request POST \
  --url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
  --header "Authorization: Bearer <token>"

4. File inputs

Hosted file (URL)

5. Schema

Input schema

{
  "type": "object",
  "title": "Input schema",
  "required": [
    "left_audio",
    "right_audio",
    "image"
  ],
  "properties": {
    "left_audio": {
      "title": "Left Audio",
      "description": "The audio of the person on the left for generating the output. The duration of this audio should be less than 10 minutes.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.mp3"
    },
    "right_audio": {
      "title": "Right Audio",
      "description": "The audio of the person on the right for generating the output. The duration of this audio should be less than 10 minutes.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-2.mp3"
    },
    "image": {
      "title": "Image",
      "description": "The image for generating the output.",
      "type": "string",
      "default": "https://playgrounds-storage-public.runcomfy.net/tools/7230/media-files/input-1-1.png"
    },
    "prompt": {
      "title": "Prompt",
      "description": "The positive prompt for the generation.",
      "type": "string",
      "default": ""
    },
    "order": {
      "title": "Order",
      "description": "The order of the two audio sources in the output video. \"meanwhile\" means both audio sources will play at the same time, \"left_right\" means the left audio will play first then the right audio will play, and \"right_left\" means the right audio will play first then the left audio will play.",
      "type": "string",
      "enum": [
        "meanwhile",
        "left_right",
        "right_left"
      ],
      "default": "left_right"
    },
    "seed": {
      "title": "Seed",
      "description": "The random seed to use for the generation. -1 means a random seed will be used.",
      "type": "integer",
      "minimum": -1,
      "maximum": 2147483647,
      "default": -1
    }
  }
}

Output schema

{
  "output": {
    "type": "object",
    "properties": {
      "image": {
        "type": "string",
        "format": "uri",
        "description": "single image URL"
      },
      "video": {
        "type": "string",
        "format": "uri",
        "description": "single video URL"
      },
      "images": {
        "type": "array",
        "description": "multiple image URLs",
        "items": {
          "type": "string",
          "format": "uri"
        }
      },
      "videos": {
        "type": "array",
        "description": "multiple video URLs",
        "items": {
          "type": "string",
          "format": "uri"
        }
      }
    }
  }
}

Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.

Table of contents

1. Get started

2. Authentication

3. API reference

Submit a request

Monitor request status

Retrieve request results

Cancel a request

4. File inputs

Hosted file (URL)

5. Schema

Input schema

Output schema

Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.

Table of contents

1. Get started

2. Authentication

3. API reference

Submit a request

Monitor request status

Retrieve request results

Cancel a request

4. File inputs

Hosted file (URL)

5. Schema

Input schema

Output schema

Infinite Talk Multi-Person: Audio-to-Video Generation with Multi-Person Support

Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.

Table of contents

1. Get started

2. Authentication

3. API reference

Submit a request

Monitor request status

Retrieve request results

Cancel a request

4. File inputs

Hosted file (URL)

5. Schema

Input schema

Output schema

Infinite Talk Multi-Person: Audio-to-Video Generation with Multi-Person Support

Transform speech into realistic talking videos with precise lip-sync, expressive motion, and stable identity for lifelike avatars, dubbing, and long-form visual storytelling.

Table of contents

1. Get started

2. Authentication

3. API reference

Submit a request

Monitor request status

Retrieve request results

Cancel a request

4. File inputs

Hosted file (URL)

5. Schema

Input schema

Output schema