Ace Step Audio Outpaint: Extend Tracks With Matching Style and Vocals on Models and API

acestep-ai/ace-step/audio-outpaint

Extend an audio track at the start, end, or both with style-matched continuations and optional lyrics, accessible on RunComfy models and HTTP API.

Idle

The rate is $0.0002 per second of total output audio.

Introduction To Ace Step Audio Outpaint

ACE Studio's Ace Step Audio Outpaint extends an existing track at the start, end, or both directions at $0.0002 per second of total output, generating new bars that match the source's style, rhythm, and mood. Trading manual tail recording, intro re-edits, and stitched-loop workarounds for tag-driven, prompt-controlled Ace Step Audio Outpaint passes, the model speeds up music lengthening for producers, video editors, game audio teams, and content creators. For developers, Ace Step Audio Outpaint on RunComfy can be used both in the browser and via an HTTP API, so you don't need to host or scale the model yourself.
Ideal for: Intro And Outro Generation | Background Music Lengthening | Loop-Free Track Extensions

ACE Studio / Ace Step Audio Outpaint#

Where most music tools force you to regenerate an entire track to make it longer, Ace Step Audio Outpaint grows a finished song outward — writing fresh bars at the head, the tail, or both — while leaving the source audio bit-for-bit untouched. Feed it a track URL, point it at how many seconds to add before and after, and steer the new material with comma-separated style tags plus (optionally) lyrics. The continuations inherit the source's tempo feel and instrumentation, then crossfade into the original so the seam reads as part of the arrangement rather than an edit point.

Output format: Audio only / source up to 60 minutes / extension range 0–240 seconds per direction / provider-defined sample rate.

Parameters#

Parameter	Required	Type	Default	Range / Options	Description
audio*	Yes (*)	string	—	HTTPS URL to MP3 / WAV / FLAC, up to 60 min	Source audio file to extend.
tags*	Yes (*)	string	—	Free text	Comma-separated genre, mood, and instrument tags steering the style of the extensions.
extend_before_duration	No	number	0	0 – 240	Seconds of new audio generated before the original track.
extend_after_duration	No	number	30	0 – 240	Seconds of new audio generated after the original track.
lyrics	No	string	—	Free text or [inst] / [instrumental]	Optional lyrics for vocals in the extended sections.
seed	No	integer	-1	-1 – 2147483647	Random seed for reproducibility; -1 randomizes.

Pricing#

Ace Step Audio Outpaint on RunComfy uses time-based billing tied to the total output duration (original audio + extend_before_duration + extend_after_duration).

Billing unit	Rate
Per second of total output	$0.0002

Estimated cost examples

Original	Extend Before	Extend After	Total	Approx. cost
60 s	0 s	30 s	90 s	~$0.018
90 s	10 s	30 s	130 s	~$0.026
120 s	0 s	60 s	180 s	~$0.036
180 s	30 s	30 s	240 s	~$0.048

Prompt & Reference Tips#

Use descriptive, specific tags such as "lofi, hiphop, jazzy, chill, piano" instead of generic words like "music" for tighter style matching.
Keep the original track and extension durations roughly balanced when you want a consistent feel across the full output.
Start with shorter 15–30s extensions to validate style match before committing to longer renders.
Match the new tags to the source's energy when extending continuously; diverge only when you want a clear genre shift at the seam.
Provide structured lyrics with consistent syllable counts so the new vocal phrasing fits the surrounding bars.
Fix the seed when iterating so you can attribute changes to tag or duration edits rather than random variation.
For clean source results, use audio with consistent tempo and minimal background noise before running Ace Step Audio Outpaint.

Related Models

kling-video-o1/image-to-video

Transform static visuals into cinematic motion with Kling O1's precise scene control and lifelike generation.

video-background-removal/green-screen

AI-driven tool for seamless object separation and smooth video compositing.

veo-3/image-to-video

Realistic motion, dynamic camerawork, and improved physics.

kling-3.0/pro/text-to-video

Premium cinematic text-to-video with the highest visual fidelity in the Kling V3.0 family.

kling-video-o1/video-to-video/reference

Transform reference clips with cinematic fidelity, refined motion, and seamless style control for creative professionals.

wan-2-2/fun-camera

Create smooth motion clips from stills with custom camera moves.

Frequently Asked Questions

What is Ace Step Audio Outpaint and what does it do in an audio-to-audio workflow?

Ace Step Audio Outpaint is an audio extension model from acestep-ai that generates new content before, after, or on both sides of an existing track. In an audio-to-audio workflow on RunComfy, you provide a source URL, choose how many seconds to extend in each direction, and supply style tags or optional lyrics, and Ace Step Audio Outpaint produces continuations that blend with the original. It is built for lengthening tracks without re-rendering the source.

What kinds of generation tasks is Ace Step Audio Outpaint best suited for?

Ace Step Audio Outpaint is best suited for adding intros and outros, lengthening background music for video and podcasts, building extended remixes, and producing adaptive game or media audio that needs to keep going past its original cut. It also works well for songwriting, where you want to extend an existing idea in a matching style. Because it is bidirectional, it covers both opening and closing extensions in one pass.

How does Ace Step Audio Outpaint compare to loop-based or full text-to-music approaches?

Compared to manual loop-and-stitch workflows, Ace Step Audio Outpaint generates style-matched bars and blends them into the source instead of relying on repeating segments. Compared to full text-to-music models, it preserves the existing track and only adds new content at the requested ends, which keeps the original arrangement intact. This typically gives technical artists tighter control over how a track grows in length.

Which teams and use cases benefit most from Ace Step Audio Outpaint in production?

Music producers, video editors, game audio teams, and ad creatives benefit from Ace Step Audio Outpaint when they need to extend an existing track to fit a longer scene, episode, or campaign. Developers can wrap it into editing tools that let users mark a track and request seconds of intro or outro through an audio-to-audio interface. Content teams can also use it for last-mile fixes when a backing track is just shy of the final cut length.

What input and output limits should I know before using Ace Step Audio Outpaint?

Source audio is typically supplied as a public HTTPS URL to MP3, WAV, or FLAC, and based on available provider information may be up to about 60 minutes long. Each extension direction in Ace Step Audio Outpaint accepts a duration in seconds within the 0–240 range, with extend_before_duration controlling the start side and extend_after_duration the end side. Other constraints such as sample rate and exact format support depend on provider settings, so check the RunComfy parameter panel for the live limits.

How do I move from testing Ace Step Audio Outpaint in the Playground to using it in production via the RunComfy API?

You can prototype Ace Step Audio Outpaint in the RunComfy AI Playground Web UI by adjusting the source URL, extension durations, tags, lyrics, and seed until the audio-to-audio result matches your target. Once the configuration is stable, call the same Ace Step Audio Outpaint model through the RunComfy API with identical parameters to automate extensions from your backend or content pipeline. This keeps creative iteration in the browser and production runs in code, without changing model behavior.

How is pricing handled when running Ace Step Audio Outpaint on RunComfy?

Ace Step Audio Outpaint generations consume usd / credits from your RunComfy balance, and based on available provider information the model is billed at $0.0002 per second of total output (original audio plus both extension durations). New users typically get a free trial usd amount to experiment, after which usage follows the Generation rules shown on the model page. For current rates and any mode-specific differences, refer to the Generation section of the Ace Step Audio Outpaint page on RunComfy.

Can I use Ace Step Audio Outpaint outputs commercially?

RunComfy provides access to the Ace Step Audio Outpaint model and the audio-to-audio workflow, but commercial usage rights for the extended audio depend on the license from the original model author and provider (acestep-ai), as well as any rights you hold over the source track you upload. Before releasing extended audio in commercial products, ads, films, or games, review the official ACE-Step license and your source-audio rights. For platform-side questions you can reach out to hi@runcomfy.com.

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Ace Step Audio Outpaint: Extend Tracks With Matching Style and Vocals on Models and API | RunComfy

Extend an audio track at the start, end, or both with style-matched continuations and optional lyrics, accessible on RunComfy models and HTTP API.

Introduction To Ace Step Audio Outpaint

ACE Studio / Ace Step Audio Outpaint#

Parameters#

Pricing#

Prompt & Reference Tips#

Related Models

Frequently Asked Questions

What is Ace Step Audio Outpaint and what does it do in an audio-to-audio workflow?

What kinds of generation tasks is Ace Step Audio Outpaint best suited for?

How does Ace Step Audio Outpaint compare to loop-based or full text-to-music approaches?

Which teams and use cases benefit most from Ace Step Audio Outpaint in production?

What input and output limits should I know before using Ace Step Audio Outpaint?

How do I move from testing Ace Step Audio Outpaint in the Playground to using it in production via the RunComfy API?

How is pricing handled when running Ace Step Audio Outpaint on RunComfy?

Can I use Ace Step Audio Outpaint outputs commercially?

Ace Step Audio Outpaint: Extend Tracks With Matching Style and Vocals on Models and API | RunComfy

Extend an audio track at the start, end, or both with style-matched continuations and optional lyrics, accessible on RunComfy models and HTTP API.

Introduction To Ace Step Audio Outpaint

ACE Studio / Ace Step Audio Outpaint#

Parameters#

Pricing#

Prompt & Reference Tips#

Related Models

Frequently Asked Questions

What is Ace Step Audio Outpaint and what does it do in an audio-to-audio workflow?

What kinds of generation tasks is Ace Step Audio Outpaint best suited for?

How does Ace Step Audio Outpaint compare to loop-based or full text-to-music approaches?

Which teams and use cases benefit most from Ace Step Audio Outpaint in production?

What input and output limits should I know before using Ace Step Audio Outpaint?

How do I move from testing Ace Step Audio Outpaint in the Playground to using it in production via the RunComfy API?

How is pricing handled when running Ace Step Audio Outpaint on RunComfy?

Can I use Ace Step Audio Outpaint outputs commercially?