ltx/ltx-2/retake-video

Modify video inputs into synchronized 4K motion with unified audio-video generation, fast modes, and open-source access for cinematic, high-fidelity creative production.

The URL of the video to retake.
The start time of the video to retake in seconds.
The duration of the video to retake in seconds.
The retake mode to use for the retake.

Introduction to LTX 2 Retake Video Generator

LTX 2 retake video is an open-source AI video foundation model from Lightricks Ltd, announced on October 23, 2025. Designed to redefine creative workflows, LTX 2 delivers unified audio and video generation in a single pass, producing native 4K resolution clips up to 50 frames per second. Its multiple input support – text, image, depth map, or short video – gives you full creative control, while three performance modes (Fast, Pro, Ultra) ensure flexibility across quality, speed, and cost. Open-source access to weights and tooling makes high-end video generation accessible even on consumer-grade GPUs, empowering creators, studios, and enterprises alike. LTX 2 retake video lets you transform concepts into immersive, synchronized motion with ease. As a generation tool, it helps you convert prompts or reference clips into 4K-quality video content ready for production. It is built for creators like you who demand precise control, cinematic fidelity, and cost-efficient rendering for storytelling, marketing, gaming, and design.

Examples Created with LTX 2 Video Retake

Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...
Video thumbnail
Loading...

What makes LTX 2 Video Retake stand out

LTX 2 video retake is a high-fidelity video-to-video model built for precise retakes that preserve spatial layout, motion continuity, and scene identity. Given a source clip, LTX 2 performs targeted transformations rather than full-frame regeneration, maintaining believable structure while updating style, content, or pacing. With unified audio-video generation, LTX 2 can replace visuals, audio, or both in sync, minimizing drift across edits. Support for 4K motion and fast modes allows LTX 2 to deliver cinematic detail with production-friendly turnaround times. Granular control over start time and duration helps LTX 2 operate surgically on segments, keeping the surrounding footage intact for seamless joins. Open-source access enables teams to adapt LTX 2 to varied pipelines while retaining consistent realism and temporal stability. In complex scenes, LTX 2 balances adherence to the base camera motion with prompt-driven changes. Key capabilities:

  • Structure-preserving edits that maintain pose, layout, depth, and motion cues to avoid warping or re-synthesis artifacts.
  • Unified audio-video retakes: replace audio, replace video, or both with tight temporal synchronization.
  • Segment control via start_time and duration for localized changes while protecting the rest of the timeline.
  • Realistic continuity across frames with stable lighting, shadows, reflections, and minimized flicker.
  • Style and content restyling for cinematic grades, texture updates, and object swaps without breaking composition.
  • Production-ready performance with 4K motion support and fast modes for iterative turnarounds.

Prompting guide for LTX 2

Begin by supplying a clear video_url and a concise prompt describing the intended retake; specify what to change and what to preserve so LTX 2 video retake targets the correct region. Use start_time and duration to define the exact segment, allowing LTX 2 to work nondestructively within in-out boundaries. Choose retake_mode to replace_audio, replace_video, or replace_audio_and_video; LTX 2 will synchronize timing and maintain alignment with the source. For visual edits, describe motion, style, and camera cues; for audio edits, provide timing or dialogue intent so LTX 2 video retake can align delivery and ambience. Iterate with concise refinements and, when needed, provide references to guide color, texture, or mood while LTX 2 preserves the original scene geometry. When upscaling or finishing, keep shot cadence and edit points consistent so LTX 2 video retake can render transitions cleanly. Example prompts and cases:

  • Replace_video, 0-5 s: preserve subject and camera path; restyle background to overcast city, subtle cinematic grade.
  • Replace_audio, 2-7 s: keep all visuals; swap ambient crowd sound with quiet interior tone, maintain natural reverberation.
  • Replace_audio_and_video, 5-10 s: convert midday to golden hour, add soft wind ambience; preserve actor pose and pacing.
  • Localized background change only: keep skin tones, wardrobe, and lens bokeh; replace skyline with foggy harbor.
  • Object removal: remove a moving boom mic in the top-right; maintain reflections and consistent grain. Pro tips:
  • State hard constraints first so LTX 2 prioritizes them.
  • Use precise spatial and temporal language like background only, lower third, 0-3 s region.
  • Avoid conflicting adjectives; prefer a few strong descriptors that match the footage.
  • Feed clean, stable sources; trim or stabilize shaky handles before processing.
  • Iterate with short updates, compare versions side by side, and lock what works before expanding scope.

Related Playgrounds

Frequently Asked Questions

What is LTX 2 and what does its video-to-video feature do?

LTX 2 is an open-source AI video foundation model developed by Lightricks that enables users to generate and edit high-resolution clips. Its video-to-video function allows creators to transform existing footage into new output styles or sequences while maintaining motion consistency and synchronized audio.

How does LTX 2’s video-to-video mode differ from earlier versions like LTX Video?

LTX 2 introduces unified audio-video generation in one pass, unlike older models that required separate audio stitching. The video-to-video mode in LTX 2 also supports multi-keyframe conditioning, achieving both higher fidelity and smoother transitions compared to earlier releases.

Is LTX 2 free to use, and how are costs structured for video-to-video generation?

LTX 2 itself is open source, but access through Runcomfy’s playground platform requires user credits. Each video-to-video generation consumes credits based on duration and chosen performance mode—Fast, Pro, or Ultra—with free trial credits available for new users.

What kind of input files can LTX 2 handle for its video-to-video workflows?

LTX 2 supports a range of input modalities, allowing users to upload text prompts, images, depth maps, or short videos. The video-to-video mode is particularly flexible, letting creators refine existing footage with custom styles, camera movement, and sound synchronization.

Who benefits most from using LTX 2 in video-to-video production?

LTX 2 is ideal for creators, studios, and designers in fields like marketing, VFX, film, or gaming. Its video-to-video capabilities help these users efficiently craft storyboards, branded content, and cinematic sequences with production-level output and reduced overhead.

How does the quality of LTX 2’s video-to-video output compare to competitors?

Thanks to native 4K generation and up to 50 fps output, LTX 2 delivers superior fidelity. Its video-to-video engine can reproduce fine motion details, synchronized sound, and consistent color tone, producing results that rival or exceed most closed-source alternatives.

What makes LTX 2’s video-to-video engine cost-efficient?

LTX 2 is designed with optimized compute efficiency that lowers the cost of generation by up to 50% over comparable models. Users can choose between Fast, Pro, or Ultra modes in the video-to-video pipeline depending on how they balance cost and fidelity.

Can LTX 2’s video-to-video feature be used on consumer hardware?

Yes, LTX 2 is built for accessibility, running efficiently on consumer-grade GPUs. Even in video-to-video tasks, the lightweight architecture ensures smooth processing without the need for server-grade setups, making it practical for independent creators.

Are there any limitations to the LTX 2 video-to-video generation process?

LTX 2 can currently produce clips of around 10 seconds with synchronized audio and video. In the video-to-video mode, longer sequences may require stitching or batching, and ultra-high-detail renders could need extended generation times or stronger GPUs.