Wan2.2 Fun Inp turns two still images into a coherent video by guiding the model from a first frame to a last frame with natural interpolation in between. It is designed for artists, animators, and filmmakers who want cinematic consistency while retaining prompt control. The workflow ships with two parallel presets so you can prioritize either ultra-fast 4-step synthesis or more general fp8-scaled generation, both powered by Wan 2.2 Fun Inpaint.
Wan2.2 Fun Inp turns two still images into a coherent video by guiding the model from a first frame to a last frame with natural interpolation in between. It is designed for artists, animators, and filmmakers who want cinematic consistency while retaining prompt control. The workflow ships with two parallel presets so you can prioritize either ultra-fast 4-step synthesis or more general fp8-scaled generation, both powered by Wan 2.2 Fun Inpaint.
Wan 2.2 Fun Inpaint 14B (fp8 scaled)
The main diffusion backbone specialized for “Fun Inpaint” video generation. Two variants are included: high noise for larger motion and creative transitions, and low noise when you need tighter fidelity to your start/end frames.
• High noise:
• Low noise:
Lightning 4-Step LoRA for I2V
An optional LoRA that compresses the sampling schedule to just four steps for rapid iteration, ideal for previews and quick drafts.
• Low noise LoRA:
• High noise LoRA:
Wan VAE
The VAE handles latent–pixel conversions used by Wan models; it preserves detail and tone during decode/encode. See the Wan 2.2 package on .
CLIP text encoder
Encodes your positive and negative prompts into conditioning vectors that steer the visual narrative. Reference implementation: .
ComfyUI Video Helper Suite (export)
Combines generated frames into an MP4 at your chosen frame rate. Repo: .
The graph contains two parallel groups you can toggle depending on speed vs generality. Enable only one at a time for clean runs.
Use this for very fast previews. The group loads the Wan 2.2 backbone plus a Lightning 4-Step LoRA and routes your prompts through the short sampler path. Provide your start and end images, then adjust the high-level parameters as needed. Internally, WanFunInpaintToVideo
(#111) seeds the trajectory from first to last frame, while a short sampler refines motion and structure in a handful of steps.
Choose this when you want a broader operating range without the 4-step constraint. This path uses the fp8-scaled Wan 2.2 model directly, maintaining the same first-to-last frame guidance but with a standard sampler budget for more nuanced detail recovery and motion shaping. The node WanFunInpaintToVideo
(#148) anchors the trajectory and hands off to the downstream sampler for refinement.
Both groups include an Upload start and end images section. Plug a start image that sets the opening composition and an end image that defines the final pose or scene. The workflow will interpolate the motion and appearance between them, respecting your text prompts. For best results, keep aspect ratio consistent across both images.
Write what you want to see in the Positive Prompt and what to avoid in the Negative Prompt. The nodes CLIP Text Encode (Positive Prompt)
and CLIP Text Encode (Negative Prompt)
transform your text into conditioning that steers content, style, and dynamics. Use concise, scene-oriented phrases (actions, camera cues, materials, mood) rather than long lists.
Set width
, height
, and length
in the WanFunInpaintToVideo
node to define spatial resolution and frame count. Defaults are tuned for a tall 576×1024 video with about 3–4 seconds of motion at 24 fps. Longer sequences generally benefit from the fp8-scaled path; short previews are great with the 4-step LoRA group.
VHS_VideoCombine
assembles frames into an MP4 with a default 24 fps and a quality-friendly CRF. The file names are prefixed for each branch (for example, Fun_Inp
and Fun_Inp_4_Step
) so you can compare outputs easily. Adjust the frame rate if you need slower or faster playback.
Box-select a group and use Ctrl+B to enable or disable it. If you enable the fp8_scaled group, disable the fp8_scaled + 4 steps LoRA group, and vice versa. You can also use ComfyUI’s partial execution features to run just the sections you are tweaking.
WanFunInpaintToVideo
(#111 and #148)The core engine that blends your start_image and end_image into a continuous latent trajectory. It accepts width
, height
, and length
to set video size and duration, then emits a latent sequence plus updated positive/negative conditioning. Start here when tuning continuity, pacing, or composition across the shot.
UNETLoader
(#101, #102)Chooses the Wan 2.2 Fun Inpaint model variant. Use high noise for bolder motion and more transformative interpolations. Use low noise when preserving the start and end frame identity and texture is the priority. Pair either with or without the 4-step LoRA depending on speed needs.
ModelSamplingSD3
(#93)Configures the sampler schedule used downstream. Keep it aligned with the chosen LoRA or fp8 path. If you see temporal flicker, modest adjustments to the sampler mode or steps can smooth transitions without over-sharpening details.
KSamplerAdvanced
(#150)Applies a refinement pass to the latent sequence. Increase steps slightly if you need crisper micro-detail on faces, hands, or thin structures; reduce steps for softer, dreamier motion. Avoid extreme CFG or step counts that can destabilize temporal consistency.
VHS_VideoCombine
(#159)Merges rendered frames to MP4. Adjust frame_rate
for motion feel and playback speed, and keep the default pix_fmt
for broad player compatibility. Lower CRF yields larger files with finer gradients; higher CRF compresses more aggressively.
width
×height
to reduce unwanted cropping or warping.The Wan 2.2 Inp Fun workflow expands the creative possibilities of AI video generation by bridging start-to-end frame control with natural interpolation. It’s a versatile tool for artists, animators, and filmmakers who want cinematic consistency in their AI-driven projects.
Special thanks to the ComfyUI and Wan teams for enabling seamless Inp Fun workflow integration into next-gen creative pipelines.
RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.