RunComfy

SeedVR2 V2.5 | AI Video Upscaling Workflow

Upscale videos fast with sharp, smooth, cinematic results.

Qwen Edit 2509 MultipleAngles | Multi-View Image Creator

Turn one photo into complete multi-angle visuals instantly.

Wan 2.2 + Lightx2v V2 | Ultra Fast I2V & T2V

Dual Light LoRA setup, 4X faster.

Image Bypass | Smart Image Detection Bypass Utility Workflow

Skip limits and process images faster with total creative control.

ComfyUI > Nodes > ComfyUI-AudioX > AudioX Enhanced Text to Audio

ComfyUI Node: AudioX Enhanced Text to Audio

Class Name

AudioXEnhancedTextToAudio

Category
AudioX/Generation

Author
lum3on (Account age: 314days) Extension
ComfyUI-AudioX Latest Updated
2025-06-24 Github Stars
0.04K

Github Ask lum3on Current Questions Past Questions

Table of Content

Description
AudioXEnhancedTextToAudio:
AudioXEnhancedTextToAudio Input Parameters:
AudioXEnhancedTextToAudio Output Parameters:
AudioXEnhancedTextToAudio Usage Tips:
AudioXEnhancedTextToAudio Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-AudioX

Install this extension via the ComfyUI Manager by searching for ComfyUI-AudioX

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioX in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

AudioX Enhanced Text to Audio Description

Transform text to high-quality audio using advanced algorithms for creative audio generation.

AudioX Enhanced Text to Audio:

The AudioXEnhancedTextToAudio node is designed to transform textual descriptions into audio outputs, leveraging the capabilities of the AudioX model. This node is particularly beneficial for AI artists and creators who wish to generate audio content from text prompts, offering a seamless way to convert written ideas into auditory experiences. By utilizing advanced algorithms, the node enhances the text-to-audio conversion process, ensuring high-quality audio generation that aligns closely with the provided text prompt. This node is essential for those looking to explore creative audio generation, providing a robust tool for producing soundscapes, effects, or any audio content directly from textual input.

AudioX Enhanced Text to Audio Input Parameters:

model

The model parameter specifies the AudioX model to be used for generating audio. It is a required parameter and ensures that the node utilizes the correct model configuration for audio synthesis.

text_prompt

The text_prompt parameter is a string input that serves as the basis for audio generation. It allows you to describe the desired audio scene or effect, such as "Typing on a keyboard." This parameter supports multiline input, enabling detailed descriptions. The default value is "Typing on a keyboard."

steps

The steps parameter determines the number of processing steps the model will take to generate the audio. It influences the quality and detail of the output, with a higher number of steps generally resulting in more refined audio. The parameter accepts integer values ranging from 1 to 1000, with a default of 250.

cfg_scale

The cfg_scale parameter is a float that adjusts the guidance scale for the model, affecting how closely the generated audio adheres to the text prompt. A higher value increases adherence to the prompt, while a lower value allows for more creative freedom. The range is from 0.1 to 20.0, with a default of 7.0.

seed

The seed parameter is an integer that sets the random seed for audio generation, ensuring reproducibility of results. A value of -1 indicates that a random seed will be used. The range is from -1 to 2^32

1, with a default of -1.

duration_seconds

The duration_seconds parameter specifies the length of the generated audio in seconds. It allows you to control the duration of the output, with a range from 1.0 to 30.0 seconds and a default of 10.0 seconds.

AudioX Enhanced Text to Audio Output Parameters:

audio

The audio output parameter represents the generated audio file based on the provided text prompt and other input settings. This output is crucial as it is the final product of the node's processing, providing you with an audio representation of your textual description.

AudioX Enhanced Text to Audio Usage Tips:

Experiment with different text_prompt descriptions to explore the range of audio outputs the node can generate. Be as descriptive as possible to achieve more accurate results.
Adjust the steps parameter to balance between processing time and audio quality. More steps can lead to higher quality but will take longer to process.
Use the cfg_scale parameter to fine-tune how closely the audio should match the text prompt. Higher values ensure closer adherence, while lower values allow for more creative variations.

AudioX Enhanced Text to Audio Common Errors and Solutions:

Invalid model configuration

Explanation: This error occurs when the specified model configuration is incorrect or incompatible with the node.
Solution: Ensure that the model parameter is set to a valid AudioX model configuration. Verify the model's compatibility with the node.

Text prompt too vague

Explanation: A vague or insufficient text prompt can lead to unsatisfactory audio outputs.
Solution: Provide a more detailed and specific text_prompt to guide the audio generation process effectively.

Duration exceeds limits

Explanation: The specified duration for audio generation is outside the allowed range.
Solution: Adjust the duration_seconds parameter to fall within the 1.0 to 30.0 seconds range.

Seed value out of range

Explanation: The seed value provided is not within the acceptable range.
Solution: Ensure the seed parameter is set between -1 and 2^32 - 1. Use -1 for a random seed if needed.

AudioX Enhanced Text to Audio Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-AudioX

Table of Content

Description
AudioXEnhancedTextToAudio:
AudioXEnhancedTextToAudio Input Parameters:
AudioXEnhancedTextToAudio Output Parameters:
AudioXEnhancedTextToAudio Usage Tips:
AudioXEnhancedTextToAudio Common Errors and Solutions:
Related Nodes

Flux UltraRealistic LoRA V2

Create stunningly lifelike image with Flux UltraRealistic LoRA V2

Qwen Image 2512 LoRA Inference | AI Toolkit ComfyUI

Use an AI Toolkit-trained LoRA with Qwen Image 2512 in ComfyUI via one RCQwenImage2512 node for preview-aligned generations.

Wan 2.2 Animate V2 | Realistic Pose Video Generator

Transforms photos into smooth-motion animated character videos using Wan 2.2.

Create Coherent Scenes | Consistent Story Art Generator

Build seamless storytelling scenes with rich visual consistency.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: AudioX Enhanced Text to Audio

AudioXEnhancedTextToAudio

How to Install ComfyUI-AudioX

AudioX Enhanced Text to Audio Description

AudioX Enhanced Text to Audio:

AudioX Enhanced Text to Audio Input Parameters:

model

text_prompt

steps

cfg_scale

seed

duration_seconds

AudioX Enhanced Text to Audio Output Parameters:

audio

AudioX Enhanced Text to Audio Usage Tips:

AudioX Enhanced Text to Audio Common Errors and Solutions:

Invalid model configuration

Text prompt too vague

Duration exceeds limits

Seed value out of range

AudioX Enhanced Text to Audio Related Nodes