RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

FLUX Controlnet Inpainting

Enhance realism by using ControlNet to guide FLUX.1-dev.

Face Detailer | Fix Faces

Use Face Detailer first for facial restoration, followed by the 4x UltraSharp Model for superior upscaling.

IC-Light | Video Relighting | AnimateDiff

Relight your videos with light maps and prompts

ComfyUI > Nodes > ComfyUI > Stability AI Text To Audio

ComfyUI Node: Stability AI Text To Audio

Class Name

StabilityTextToAudio

Category
api node/audio/Stability AI

Author
ComfyAnonymous (Account age: 763days) Extension
ComfyUI Latest Updated
2026-05-13 Github Stars
112.77K

Github Ask ComfyAnonymous Current Questions Past Questions

Table of Content

Description
StabilityTextToAudio:
StabilityTextToAudio Input Parameters:
StabilityTextToAudio Output Parameters:
StabilityTextToAudio Usage Tips:
StabilityTextToAudio Common Errors and Solutions:
Related Nodes

How to Install ComfyUI

Install this extension via the ComfyUI Manager by searching for ComfyUI

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Stability AI Text To Audio Description

Transform text into high-quality audio outputs using advanced AI models for creative sound design.

Stability AI Text To Audio:

The StabilityTextToAudio node is designed to transform text descriptions into high-quality audio outputs, such as music and sound effects. This node leverages advanced AI models to interpret textual prompts and generate corresponding audio, making it a powerful tool for artists and creators looking to produce unique soundscapes or musical compositions from written ideas. By converting text into audio, this node opens up new creative possibilities, allowing you to explore and experiment with sound design in an intuitive and accessible way. Whether you're crafting ambient soundtracks or dynamic audio effects, the StabilityTextToAudio node provides a seamless interface to bring your textual concepts to life in the auditory realm.

Stability AI Text To Audio Input Parameters:

model

The model parameter specifies the AI model to be used for generating audio from text. This choice can impact the style and quality of the audio output, as different models may have varying capabilities and characteristics. Selecting the appropriate model is crucial for achieving the desired audio results.

prompt

The prompt parameter is a text description that guides the audio generation process. It serves as the creative input, where you can describe the type of audio you wish to produce. The prompt should be clear and detailed to ensure the AI accurately interprets your vision. The maximum length for this parameter is 10,000 characters.

duration

The duration parameter defines the length of the generated audio in seconds. This allows you to control how long the audio output will be, which is essential for fitting specific project requirements or creative intentions. The duration must be within the range of 6 to 190 seconds.

seed

The seed parameter is used to initialize the random number generator, ensuring reproducibility of the audio output. By setting a specific seed value, you can generate the same audio output consistently, which is useful for iterative design processes or when sharing results with others.

steps

The steps parameter determines the number of processing steps the AI model will take to generate the audio. More steps can lead to higher quality outputs but may also increase processing time. Balancing the number of steps is important for optimizing both quality and efficiency.

strength

The strength parameter controls the influence of the text prompt on the audio generation. A higher strength value means the audio will more closely follow the prompt, while a lower value allows for more creative freedom and variation. Adjusting this parameter helps fine-tune the balance between adherence to the prompt and artistic exploration.

Stability AI Text To Audio Output Parameters:

audio

The audio output parameter provides the generated audio file as a result of the text-to-audio transformation. This audio output is the culmination of the input parameters and the AI model's interpretation, offering a tangible sound representation of the initial text prompt. The audio can be used in various creative projects, from music production to sound design.

Stability AI Text To Audio Usage Tips:

Experiment with different prompt descriptions to explore a wide range of audio outputs and discover unique soundscapes.
Adjust the strength parameter to find the right balance between following the prompt closely and allowing for creative variations in the audio output.
Use the seed parameter to reproduce specific audio outputs consistently, which is helpful for refining designs or collaborating with others.

Stability AI Text To Audio Common Errors and Solutions:

No audio file was received in response.

Explanation: This error occurs when the API does not return an audio file, possibly due to an issue with the input parameters or the API request.
Solution: Ensure that all input parameters are correctly specified and within their valid ranges. Double-check the API endpoint and network connectivity to confirm that the request is being processed correctly.

Stability AI Text To Audio Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI

Table of Content

Description
StabilityTextToAudio:
StabilityTextToAudio Input Parameters:
StabilityTextToAudio Output Parameters:
StabilityTextToAudio Usage Tips:
StabilityTextToAudio Common Errors and Solutions:
Related Nodes

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Qwen Image LoRA Inference | AI Toolkit ComfyUI

Keep AI Toolkit-trained Qwen Image LoRA inference in ComfyUI preview-aligned using a single RCQwenImage custom node.

LongCat Avatar in ComfyUI | Identity-Consistent Avatar Animation

Turns one image into smooth, identity-consistent avatar animation.

Z Image ControlNet | Precision Image Generator

Total control over image poses, edges, and depth layouts.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Stability AI Text To Audio

StabilityTextToAudio

How to Install ComfyUI

Stability AI Text To Audio Description

Stability AI Text To Audio:

Stability AI Text To Audio Input Parameters:

model

prompt

duration

seed

steps

strength

Stability AI Text To Audio Output Parameters:

audio

Stability AI Text To Audio Usage Tips:

Stability AI Text To Audio Common Errors and Solutions:

No audio file was received in response.

Stability AI Text To Audio Related Nodes