FusionX: シネマ品質の画像から動画生成 | Image to Video
FusionX は、テキストプロンプトまたは参照画像から映画品質のビデオを生成するツールで、Wan2.1-14B-Fusionx_Image2Video と NAG ガイダンスによる融合により、滑らかで高精細な映像を実現します。
Table of contents
1. Get started
Use RunComfy's API to run community/wan-2-1/fusionx/image-to-video. For accepted inputs and outputs, see the model's schema.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/community/wan-2-1/fusionx/image-to-video \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"600_prompt": "A close-up shot of a young woman standing against a graffiti-covered corridor wall. She wears a grey hoodie with a faded bear graphic, her hair tied in a messy bun. The camera slowly dolly zooms out as she raises her chin slightly and smiles with subtle confidence, eyes fixed on the lens. Warm bokeh lights blur behind her down the hallway, adding depth and intimacy. Her expression shifts from surprise to playful defiance, as if teasing the viewer with a secret. The mood is cinematic, cozy, and spontaneous — a slice of street-life charm.",
"608_image": "RunComfy_examples_1235_1.png"
}'2. Authentication
Set the YOUR_API_TOKEN environment variable with your API key (manage keys in your Profile) and include it on every request as a Bearer token via the Authorization header: Authorization: Bearer $YOUR_API_TOKEN.
3. API reference
Submit a request
Submit an asynchronous generation job and immediately receive a request_id plus URLs to check status, fetch results, and cancel.
curl --request POST \
--url https://model-api.runcomfy.net/v1/models/community/wan-2-1/fusionx/image-to-video \
--header "Content-Type: application/json" \
--header "Authorization: Bearer <token>" \
--data '{
"600_prompt": "A close-up shot of a young woman standing against a graffiti-covered corridor wall. She wears a grey hoodie with a faded bear graphic, her hair tied in a messy bun. The camera slowly dolly zooms out as she raises her chin slightly and smiles with subtle confidence, eyes fixed on the lens. Warm bokeh lights blur behind her down the hallway, adding depth and intimacy. Her expression shifts from surprise to playful defiance, as if teasing the viewer with a secret. The mood is cinematic, cozy, and spontaneous — a slice of street-life charm.",
"608_image": "RunComfy_examples_1235_1.png"
}'Monitor request status
Fetch the current state for a request_id ("in_queue", "in_progress", "completed", or "cancelled").
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/status \
--header "Authorization: Bearer <token>"Retrieve request results
Retrieve the final outputs and metadata for the given request_id; if the job is not complete, the response returns the current state so you can continue polling.
curl --request GET \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/result \
--header "Authorization: Bearer <token>"Cancel a request
Cancel a queued job by request_id, in-progress jobs cannot be cancelled.
curl --request POST \
--url https://model-api.runcomfy.net/v1/requests/{request_id}/cancel \
--header "Authorization: Bearer <token>"4. File inputs
Hosted file (URL)
Provide a publicly reachable HTTPS URL. Ensure the host allows server‑side fetches (no login/cookies required) and isn't rate‑limited or blocking bots. Recommended limits: images ≤ 50 MB (~4K), videos ≤ 100 MB (~2–5 min @ 720p). Prefer stable or pre‑signed URLs for private assets.
5. Schema
Input schema
{
"type": "object",
"title": "Input",
"required": [
"600_prompt",
"608_image"
],
"properties": {
"600_prompt": {
"title": "Prompt",
"description": "",
"type": "string",
"default": "A close-up shot of a young woman standing against a graffiti-covered corridor wall. She wears a grey hoodie with a faded bear graphic, her hair tied in a messy bun. The camera slowly dolly zooms out as she raises her chin slightly and smiles with subtle confidence, eyes fixed on the lens. Warm bokeh lights blur behind her down the hallway, adding depth and intimacy. Her expression shifts from surprise to playful defiance, as if teasing the viewer with a secret. The mood is cinematic, cozy, and spontaneous — a slice of street-life charm."
},
"608_image": {
"title": "Image",
"description": "",
"type": "string",
"default": "RunComfy_examples_1235_1.png"
},
"598_seed": {
"title": "Seed",
"description": "",
"type": "integer",
"default": 96860978
},
"602_widthx602_height": {
"title": "Resolution (W:H)",
"description": "",
"type": "string",
"enum": [
"480x480 (1:1)",
"720x720 (1:1)",
"480x720 (2:3)",
"720x480 (3:2)",
"540x960 (9:16)",
"576x1024 (9:16)",
"720x1280 (9:16)",
"960x540 (16:9)",
"1024x576 (16:9)",
"1280x720 (16:9)"
],
"default": "720x480 (3:2)"
},
"598_steps": {
"title": "Steps",
"description": "Number of denoising iterations; more steps refine detail and stability but take longer.",
"type": "integer",
"default": 10,
"minimum": 6,
"maximum": 20
},
"598_cfg": {
"title": "Guidance Scale",
"description": "Controls how strongly the output adheres to the prompt versus allowing creative variation.",
"type": "float",
"default": 1,
"minimum": 0.6,
"maximum": 2
},
"598_shift": {
"title": "Shift",
"description": "Offsets the diffusion sampling schedule, trading stability for stronger motion/style as the value increases.",
"type": "float",
"default": 5,
"minimum": 1,
"maximum": 15
},
"602_num_frames": {
"title": "Number of Frames",
"description": "",
"type": "integer",
"enum": [
81,
141
],
"default": 81
},
"609_frame_rate": {
"title": "Frames Per Second",
"description": "",
"type": "integer",
"default": 16,
"minimum": 12,
"maximum": 24
}
}
}Output schema
{
"output": {
"type": "object",
"properties": {
"image": {
"type": "string",
"format": "uri",
"description": "single image URL"
},
"video": {
"type": "string",
"format": "uri",
"description": "single video URL"
},
"images": {
"type": "array",
"description": "multiple image URLs",
"items": { "type": "string", "format": "uri" }
},
"videos": {
"type": "array",
"description": "multiple video URLs",
"items": { "type": "string", "format": "uri" }
}
}
}
}