Features smooth scene transitions, natural cuts, and consistent motion.
Where most music tools force you to regenerate an entire track to make it longer, Ace Step Audio Outpaint grows a finished song outward — writing fresh bars at the head, the tail, or both — while leaving the source audio bit-for-bit untouched. Feed it a track URL, point it at how many seconds to add before and after, and steer the new material with comma-separated style tags plus (optionally) lyrics. The continuations inherit the source's tempo feel and instrumentation, then crossfade into the original so the seam reads as part of the arrangement rather than an edit point.
Output format: Audio only / source up to 60 minutes / extension range 0–240 seconds per direction / provider-defined sample rate.
| Parameter | Required | Type | Default | Range / Options | Description |
|---|---|---|---|---|---|
| audio* | Yes (*) | string | — | HTTPS URL to MP3 / WAV / FLAC, up to 60 min | Source audio file to extend. |
| tags* | Yes (*) | string | — | Free text | Comma-separated genre, mood, and instrument tags steering the style of the extensions. |
| extend_before_duration | No | number | 0 | 0 – 240 | Seconds of new audio generated before the original track. |
| extend_after_duration | No | number | 30 | 0 – 240 | Seconds of new audio generated after the original track. |
| lyrics | No | string | — | Free text or [inst] / [instrumental] | Optional lyrics for vocals in the extended sections. |
| seed | No | integer | -1 | -1 – 2147483647 | Random seed for reproducibility; -1 randomizes. |
Ace Step Audio Outpaint on RunComfy uses time-based billing tied to the total output duration (original audio + extend_before_duration + extend_after_duration).
| Billing unit | Rate |
|---|---|
| Per second of total output | $0.0002 |
Estimated cost examples
| Original | Extend Before | Extend After | Total | Approx. cost |
|---|---|---|---|---|
| 60 s | 0 s | 30 s | 90 s | ~$0.018 |
| 90 s | 10 s | 30 s | 130 s | ~$0.026 |
| 120 s | 0 s | 60 s | 180 s | ~$0.036 |
| 180 s | 30 s | 30 s | 240 s | ~$0.048 |
Features smooth scene transitions, natural cuts, and consistent motion.
Transform static visuals into cinematic motion with Kling O1's precise scene control and lifelike generation.
Turn photos into expressive videos with synced voice motion.
Generates up to 4-minute songs with vocals from style tags and lyrics
Animate stills into native 4K cinematic clips with start-end frame guidance and synchronized sound.
Create lifelike talking visuals with AI that matches voice and motion seamlessly.
Ace Step Audio Outpaint is an audio extension model from acestep-ai that generates new content before, after, or on both sides of an existing track. In an audio-to-audio workflow on RunComfy, you provide a source URL, choose how many seconds to extend in each direction, and supply style tags or optional lyrics, and Ace Step Audio Outpaint produces continuations that blend with the original. It is built for lengthening tracks without re-rendering the source.
Ace Step Audio Outpaint is best suited for adding intros and outros, lengthening background music for video and podcasts, building extended remixes, and producing adaptive game or media audio that needs to keep going past its original cut. It also works well for songwriting, where you want to extend an existing idea in a matching style. Because it is bidirectional, it covers both opening and closing extensions in one pass.
Compared to manual loop-and-stitch workflows, Ace Step Audio Outpaint generates style-matched bars and blends them into the source instead of relying on repeating segments. Compared to full text-to-music models, it preserves the existing track and only adds new content at the requested ends, which keeps the original arrangement intact. This typically gives technical artists tighter control over how a track grows in length.
Music producers, video editors, game audio teams, and ad creatives benefit from Ace Step Audio Outpaint when they need to extend an existing track to fit a longer scene, episode, or campaign. Developers can wrap it into editing tools that let users mark a track and request seconds of intro or outro through an audio-to-audio interface. Content teams can also use it for last-mile fixes when a backing track is just shy of the final cut length.
Source audio is typically supplied as a public HTTPS URL to MP3, WAV, or FLAC, and based on available provider information may be up to about 60 minutes long. Each extension direction in Ace Step Audio Outpaint accepts a duration in seconds within the 0–240 range, with extend_before_duration controlling the start side and extend_after_duration the end side. Other constraints such as sample rate and exact format support depend on provider settings, so check the RunComfy parameter panel for the live limits.
You can prototype Ace Step Audio Outpaint in the RunComfy AI Playground Web UI by adjusting the source URL, extension durations, tags, lyrics, and seed until the audio-to-audio result matches your target. Once the configuration is stable, call the same Ace Step Audio Outpaint model through the RunComfy API with identical parameters to automate extensions from your backend or content pipeline. This keeps creative iteration in the browser and production runs in code, without changing model behavior.
Ace Step Audio Outpaint generations consume usd / credits from your RunComfy balance, and based on available provider information the model is billed at $0.0002 per second of total output (original audio plus both extension durations). New users typically get a free trial usd amount to experiment, after which usage follows the Generation rules shown on the model page. For current rates and any mode-specific differences, refer to the Generation section of the Ace Step Audio Outpaint page on RunComfy.
RunComfy provides access to the Ace Step Audio Outpaint model and the audio-to-audio workflow, but commercial usage rights for the extended audio depend on the license from the original model author and provider (acestep-ai), as well as any rights you hold over the source track you upload. Before releasing extended audio in commercial products, ads, films, or games, review the official ACE-Step license and your source-audio rights. For platform-side questions you can reach out to hi@runcomfy.com.
RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.