RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Qwen Edit 2509 MultipleAngles | Multi-View Image Creator

Turn one photo into complete multi-angle visuals instantly.

Sonic | Lip-Sync Portrait Animation

Sonic delivers advanced audio-driven lip-sync for portraits with high-quality animation.

Consistent Character Creator 3.0 | Easy Consistency, Any Angle

Make characters stay the same, every angle, strong and perfect.

ComfyUI > Nodes > ComfyUI-AudioX > AudioX Enhanced Text to Music

ComfyUI Node: AudioX Enhanced Text to Music

Class Name

AudioXEnhancedTextToMusic

Category
AudioX/Generation

Author
lum3on (Account age: 314days) Extension
ComfyUI-AudioX Latest Updated
2025-06-24 Github Stars
0.04K

Github Ask lum3on Current Questions Past Questions

Table of Content

Description
AudioXEnhancedTextToMusic:
AudioXEnhancedTextToMusic Input Parameters:
AudioXEnhancedTextToMusic Output Parameters:
AudioXEnhancedTextToMusic Usage Tips:
AudioXEnhancedTextToMusic Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-AudioX

Install this extension via the ComfyUI Manager by searching for ComfyUI-AudioX

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-AudioX in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

AudioX Enhanced Text to Music Description

Transform textual descriptions into musical compositions with advanced prompt controls for enhanced creative process in the AudioX suite.

AudioX Enhanced Text to Music:

The AudioXEnhancedTextToMusic node is designed to transform textual descriptions into musical compositions, leveraging advanced prompt controls to enhance the creative process. This node is part of the AudioX suite, which focuses on generating audio content from various input types. By using this node, you can create music that aligns closely with your artistic vision, as it allows for detailed customization of the music generation process. The node is particularly beneficial for AI artists looking to explore new musical ideas or generate background scores that match specific themes or moods described in text form. Its enhanced capabilities provide a more refined control over the output, ensuring that the generated music is not only coherent but also rich in detail and expression.

AudioX Enhanced Text to Music Input Parameters:

model

The model parameter specifies the AudioX model to be used for generating music. This model acts as the core engine that interprets the text prompt and converts it into a musical composition. Selecting the appropriate model is crucial as it determines the style and quality of the generated music.

text_prompt

The text_prompt parameter is a string input where you describe the type of music you wish to generate. This could be a simple phrase like "A music with piano and violin" or a more detailed description. The prompt guides the model in creating music that matches the specified theme or mood. The default value is "A music with piano and violin," and it supports multiline input for more complex descriptions.

steps

The steps parameter defines the number of iterations the model will perform to generate the music. A higher number of steps can lead to more refined and detailed compositions, but it also increases the processing time. The parameter ranges from 1 to 1000, with a default value of 250.

cfg_scale

The cfg_scale parameter is a float that adjusts the influence of the text prompt on the music generation process. A higher value makes the output more closely aligned with the prompt, while a lower value allows for more creative freedom. The scale ranges from 0.1 to 20.0, with a default setting of 7.0.

seed

The seed parameter is an integer that initializes the random number generator used in the music generation process. By setting a specific seed, you can reproduce the same output in subsequent runs. The default value is -1, which means a random seed is used. The range is from -1 to 2^32

duration_seconds

The duration_seconds parameter specifies the length of the generated music in seconds. This allows you to control how long the output audio will be, ranging from 1.0 to 30.0 seconds, with a default duration of 10.0 seconds.

AudioX Enhanced Text to Music Output Parameters:

audio

The audio output parameter represents the generated music in audio format. This output is the result of the text-to-music conversion process, encapsulating the musical composition that aligns with the provided text prompt. The audio can be used for various creative projects, such as background music for videos, interactive media, or standalone musical pieces.

AudioX Enhanced Text to Music Usage Tips:

Experiment with different text_prompt descriptions to explore a wide range of musical styles and moods. The more detailed your prompt, the more specific the generated music will be.
Adjust the cfg_scale to find the right balance between adherence to the prompt and creative variation. A higher scale will produce music that closely matches your description, while a lower scale allows for more unexpected results.
Use the seed parameter to reproduce specific outputs. This is particularly useful if you want to refine a particular piece of music or ensure consistency across different projects.

AudioX Enhanced Text to Music Common Errors and Solutions:

Invalid model selection

Explanation: The selected model is not compatible or not available for the music generation process.
Solution: Ensure that you have selected a valid and available AudioX model. Check the model list and confirm that the chosen model is correctly installed and configured.

Text prompt too vague

Explanation: The text prompt provided is too vague, leading to unsatisfactory or generic music output.
Solution: Provide a more detailed and specific text prompt to guide the music generation process more effectively. Include elements like instruments, mood, or style to enhance the output.

Steps value too high

Explanation: Setting the steps parameter too high can lead to excessive processing time without significant improvement in output quality.
Solution: Start with the default value and gradually increase it to find the optimal balance between quality and processing time. Avoid setting it unnecessarily high unless required for specific artistic purposes.

AudioX Enhanced Text to Music Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-AudioX

Table of Content

Description
AudioXEnhancedTextToMusic:
AudioXEnhancedTextToMusic Input Parameters:
AudioXEnhancedTextToMusic Output Parameters:
AudioXEnhancedTextToMusic Usage Tips:
AudioXEnhancedTextToMusic Common Errors and Solutions:
Related Nodes

MimicMotion | Human Motion Video Generation

Generate high-quality human motion videos with MimicMotion, using a reference image and motion sequence.

Reallusion AI Render | 3D to ComfyUI Workflows Collection

ComfyUI + Reallusion = Speed, Accessibility, and Ease for 3D visuals

ComfyUI Grounding | Object Tracking Workflow

Track any subject with pixel-perfect accuracy for stunning VFX results.

Z-Image Finetuned Models Collection | Multi-Style Generator

Create stunning, detailed images across multiple styles and moods easily.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: AudioX Enhanced Text to Music

AudioXEnhancedTextToMusic

How to Install ComfyUI-AudioX

AudioX Enhanced Text to Music Description

AudioX Enhanced Text to Music:

AudioX Enhanced Text to Music Input Parameters:

model

text_prompt

steps

cfg_scale

seed

duration_seconds

AudioX Enhanced Text to Music Output Parameters:

audio

AudioX Enhanced Text to Music Usage Tips:

AudioX Enhanced Text to Music Common Errors and Solutions:

Invalid model selection

Text prompt too vague

Steps value too high

AudioX Enhanced Text to Music Related Nodes