logo
RunComfy
  • ComfyUI
  • TrainerNew
  • Models
  • API
  • Pricing
discord logo
MODELS
Explore
All Models
LIBRARY
Generations
MODEL APIS
API Docs
API Keys
ACCOUNT
Usage

Ace Step Audio Outpaint: Extend Tracks With Matching Style and Vocals on Models and API | RunComfy

acestep-ai/ace-step/audio-outpaint

Extend an audio track at the start, end, or both with style-matched continuations and optional lyrics, accessible on RunComfy models and HTTP API.

0:00
0:00
Source audio to extend. Provide an HTTPS URL to an MP3, WAV, or FLAC file (up to 60 minutes).
Comma-separated genre, mood, and instrument tags that steer the style of the generated extensions.
Seconds of new audio to generate before the original track. Set 0 to skip extending the start.
Seconds of new audio to generate after the original track. Set 0 to skip extending the end.
Optional lyrics to guide vocals in the extended sections. Leave blank to let the model write lyrics, or use [inst] / [instrumental] for no vocals.
Random seed for reproducibility. Use -1 to randomize.
Idle
The rate is $0.0002 per second of total output audio.

Introduction To Ace Step Audio Outpaint

ACE Studio's Ace Step Audio Outpaint extends an existing track at the start, end, or both directions at $0.0002 per second of total output, generating new bars that match the source's style, rhythm, and mood. Trading manual tail recording, intro re-edits, and stitched-loop workarounds for tag-driven, prompt-controlled Ace Step Audio Outpaint passes, the model speeds up music lengthening for producers, video editors, game audio teams, and content creators. For developers, Ace Step Audio Outpaint on RunComfy can be used both in the browser and via an HTTP API, so you don't need to host or scale the model yourself.
Ideal for: Intro And Outro Generation | Background Music Lengthening | Loop-Free Track Extensions

ACE Studio / Ace Step Audio Outpaint#


Where most music tools force you to regenerate an entire track to make it longer, Ace Step Audio Outpaint grows a finished song outward — writing fresh bars at the head, the tail, or both — while leaving the source audio bit-for-bit untouched. Feed it a track URL, point it at how many seconds to add before and after, and steer the new material with comma-separated style tags plus (optionally) lyrics. The continuations inherit the source's tempo feel and instrumentation, then crossfade into the original so the seam reads as part of the arrangement rather than an edit point.


Output format: Audio only / source up to 60 minutes / extension range 0–240 seconds per direction / provider-defined sample rate.


Parameters#


ParameterRequiredTypeDefaultRange / OptionsDescription
audio*Yes (*)string—HTTPS URL to MP3 / WAV / FLAC, up to 60 minSource audio file to extend.
tags*Yes (*)string—Free textComma-separated genre, mood, and instrument tags steering the style of the extensions.
extend_before_durationNonumber00 – 240Seconds of new audio generated before the original track.
extend_after_durationNonumber300 – 240Seconds of new audio generated after the original track.
lyricsNostring—Free text or [inst] / [instrumental]Optional lyrics for vocals in the extended sections.
seedNointeger-1-1 – 2147483647Random seed for reproducibility; -1 randomizes.

Pricing#


Ace Step Audio Outpaint on RunComfy uses time-based billing tied to the total output duration (original audio + extend_before_duration + extend_after_duration).


Billing unitRate
Per second of total output$0.0002

Estimated cost examples


OriginalExtend BeforeExtend AfterTotalApprox. cost
60 s0 s30 s90 s~$0.018
90 s10 s30 s130 s~$0.026
120 s0 s60 s180 s~$0.036
180 s30 s30 s240 s~$0.048

Prompt & Reference Tips#


  • Use descriptive, specific tags such as "lofi, hiphop, jazzy, chill, piano" instead of generic words like "music" for tighter style matching.
  • Keep the original track and extension durations roughly balanced when you want a consistent feel across the full output.
  • Start with shorter 15–30s extensions to validate style match before committing to longer renders.
  • Match the new tags to the source's energy when extending continuously; diverge only when you want a clear genre shift at the seam.
  • Provide structured lyrics with consistent syllable counts so the new vocal phrasing fits the surrounding bars.
  • Fix the seed when iterating so you can attribute changes to tag or duration edits rather than random variation.
  • For clean source results, use audio with consistent tempo and minimal background noise before running Ace Step Audio Outpaint.

Related Models

hunyuan/image-to-video

Features smooth scene transitions, natural cuts, and consistent motion.

kling-video-o1/image-to-video

Transform static visuals into cinematic motion with Kling O1's precise scene control and lifelike generation.

wan-2-2/speech-to-video

Turn photos into expressive videos with synced voice motion.

ace-step-1.5/text-to-audio

Generates up to 4-minute songs with vocals from style tags and lyrics

kling-3.0/4k/image-to-video

Animate stills into native 4K cinematic clips with start-end frame guidance and synchronized sound.

sync/lipsync/v2/pro

Create lifelike talking visuals with AI that matches voice and motion seamlessly.

Frequently Asked Questions

What is Ace Step Audio Outpaint and what does it do in an audio-to-audio workflow?

Ace Step Audio Outpaint is an audio extension model from acestep-ai that generates new content before, after, or on both sides of an existing track. In an audio-to-audio workflow on RunComfy, you provide a source URL, choose how many seconds to extend in each direction, and supply style tags or optional lyrics, and Ace Step Audio Outpaint produces continuations that blend with the original. It is built for lengthening tracks without re-rendering the source.

What kinds of generation tasks is Ace Step Audio Outpaint best suited for?

Ace Step Audio Outpaint is best suited for adding intros and outros, lengthening background music for video and podcasts, building extended remixes, and producing adaptive game or media audio that needs to keep going past its original cut. It also works well for songwriting, where you want to extend an existing idea in a matching style. Because it is bidirectional, it covers both opening and closing extensions in one pass.

How does Ace Step Audio Outpaint compare to loop-based or full text-to-music approaches?

Compared to manual loop-and-stitch workflows, Ace Step Audio Outpaint generates style-matched bars and blends them into the source instead of relying on repeating segments. Compared to full text-to-music models, it preserves the existing track and only adds new content at the requested ends, which keeps the original arrangement intact. This typically gives technical artists tighter control over how a track grows in length.

Which teams and use cases benefit most from Ace Step Audio Outpaint in production?

Music producers, video editors, game audio teams, and ad creatives benefit from Ace Step Audio Outpaint when they need to extend an existing track to fit a longer scene, episode, or campaign. Developers can wrap it into editing tools that let users mark a track and request seconds of intro or outro through an audio-to-audio interface. Content teams can also use it for last-mile fixes when a backing track is just shy of the final cut length.

What input and output limits should I know before using Ace Step Audio Outpaint?

Source audio is typically supplied as a public HTTPS URL to MP3, WAV, or FLAC, and based on available provider information may be up to about 60 minutes long. Each extension direction in Ace Step Audio Outpaint accepts a duration in seconds within the 0–240 range, with extend_before_duration controlling the start side and extend_after_duration the end side. Other constraints such as sample rate and exact format support depend on provider settings, so check the RunComfy parameter panel for the live limits.

How do I move from testing Ace Step Audio Outpaint in the Playground to using it in production via the RunComfy API?

You can prototype Ace Step Audio Outpaint in the RunComfy AI Playground Web UI by adjusting the source URL, extension durations, tags, lyrics, and seed until the audio-to-audio result matches your target. Once the configuration is stable, call the same Ace Step Audio Outpaint model through the RunComfy API with identical parameters to automate extensions from your backend or content pipeline. This keeps creative iteration in the browser and production runs in code, without changing model behavior.

How is pricing handled when running Ace Step Audio Outpaint on RunComfy?

Ace Step Audio Outpaint generations consume usd / credits from your RunComfy balance, and based on available provider information the model is billed at $0.0002 per second of total output (original audio plus both extension durations). New users typically get a free trial usd amount to experiment, after which usage follows the Generation rules shown on the model page. For current rates and any mode-specific differences, refer to the Generation section of the Ace Step Audio Outpaint page on RunComfy.

Can I use Ace Step Audio Outpaint outputs commercially?

RunComfy provides access to the Ace Step Audio Outpaint model and the audio-to-audio workflow, but commercial usage rights for the extended audio depend on the license from the original model author and provider (acestep-ai), as well as any rights you hold over the source track you upload. Before releasing extended audio in commercial products, ads, films, or games, review the official ACE-Step license and your source-audio rights. For platform-side questions you can reach out to hi@runcomfy.com.

Follow us
  • LinkedIn
  • Facebook
  • Instagram
  • Twitter
Support
  • Discord
  • Email
  • System Status
  • Affiliate
Video Models
  • Seedance 1.0 Pro Fast
  • Hailuo 2.3 Fast Standard
  • Wan 2.2
  • Seedance 1.5 Pro
  • Seedance 1.0
  • Veo 3.1 Fast
  • View All Models →
Image Models
  • Wan 2.6 Image to Image
  • Nano Banana 2 Edit
  • Nano Banana Pro
  • seedream 4.0
  • nano banana
  • Seedream 4.0 sequential
  • View All Models →
Legal
  • Terms of Service
  • Privacy Policy
  • Cookie Policy
RunComfy
Copyright 2026 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.