ComfyUI > Nodes > SongBloom > SongBloom Generate Audio

ComfyUI Node: SongBloom Generate Audio

Class Name

SongBloomGenerate

Category
audio/songbloom
Author
fredconex (Account age: 1213days)
Extension
SongBloom
Latest Updated
2025-09-17
Github Stars
0.19K

How to Install SongBloom

Install this extension via the ComfyUI Manager by searching for SongBloom
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter SongBloom in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

SongBloom Generate Audio Description

AI-powered music composition node within ComfyUI for generating music from input data like lyrics and audio prompts.

SongBloom Generate Audio:

The SongBloomGenerate node is designed to create music using the SongBloom model within the ComfyUI framework. This node leverages advanced AI capabilities to transform input data, such as lyrics and audio prompts, into musical compositions. It is particularly beneficial for AI artists and music creators who wish to explore new musical ideas or generate unique soundscapes. The node's primary function is to process input conditions and generate music by setting various parameters that influence the style, complexity, and structure of the output. By integrating with ComfyUI's model management, it provides a seamless experience for users to experiment with music generation without needing deep technical expertise.

SongBloom Generate Audio Input Parameters:

prompt_wav

The prompt_wav parameter is an audio waveform that serves as the initial input or inspiration for the music generation process. It influences the style and mood of the generated music. The waveform is truncated if it exceeds a certain length, ensuring it fits within the model's processing capabilities. This parameter is crucial for setting the initial tone and can significantly impact the final output.

processed_lyrics

processed_lyrics are the textual input that guides the thematic and lyrical content of the generated music. This parameter allows users to infuse specific lyrical ideas or themes into the music, making it a powerful tool for creating songs with coherent lyrical narratives. The lyrics are processed to fit the model's requirements, ensuring they are effectively integrated into the music generation process.

cfg_coef

The cfg_coef parameter, or configuration coefficient, adjusts the balance between creativity and adherence to the input conditions. A higher value encourages the model to follow the input conditions more closely, while a lower value allows for more creative freedom. This parameter is essential for controlling the degree of variation in the generated music.

temperature

temperature controls the randomness of the music generation process. A higher temperature results in more diverse and unpredictable outputs, while a lower temperature produces more consistent and predictable results. This parameter is key for users who wish to explore different levels of creativity in their music.

diff_temp

The diff_temp parameter is similar to temperature but specifically affects the diffusion process within the model. It fine-tunes the randomness during the generation, allowing for subtle adjustments to the output's variability.

top_k

top_k limits the number of potential next steps the model considers during generation, effectively controlling the diversity of the output. A lower top_k value results in more focused and less varied music, while a higher value allows for greater diversity and exploration.

steps

The steps parameter determines the number of iterations the model performs during the music generation process. More steps can lead to more refined and detailed outputs, but may also increase the processing time.

dit_cfg_type

dit_cfg_type specifies the type of configuration used for the diffusion process. This parameter allows users to select different diffusion strategies, which can affect the style and quality of the generated music.

use_sampling

The use_sampling parameter indicates whether sampling techniques are employed during generation. Enabling sampling can introduce additional variability and creativity into the music, making it a useful option for users seeking unique outputs.

max_duration

max_duration sets the maximum length of the generated music in seconds. This parameter ensures that the output fits within the desired time constraints, making it essential for projects with specific duration requirements.

sampler

The sampler parameter specifies the sampling method used during generation, such as "spiral" or "pingpong." Different samplers can produce distinct musical styles and effects, allowing users to experiment with various sonic textures.

SongBloom Generate Audio Output Parameters:

generated_music

The generated_music output is the final audio composition created by the SongBloom model. It reflects the input conditions and parameters set by the user, offering a unique musical piece that can be used for various creative projects. This output is the culmination of the model's processing and serves as the primary deliverable of the node.

SongBloom Generate Audio Usage Tips:

  • Experiment with different temperature and diff_temp settings to explore a wide range of musical styles and creativity levels.
  • Use the prompt_wav and processed_lyrics parameters to guide the thematic direction of the music, ensuring it aligns with your creative vision.
  • Adjust the cfg_coef to balance between adhering to input conditions and allowing for creative exploration, depending on your project needs.

SongBloom Generate Audio Common Errors and Solutions:

"Input audio exceeds maximum length"

  • Explanation: The prompt_wav input is longer than the allowed maximum length for processing.
  • Solution: Ensure that your input audio is within the specified length limit or adjust the max_duration parameter accordingly.

"Invalid sampler type"

  • Explanation: The specified sampler type is not recognized by the model.
  • Solution: Verify that the sampler parameter is set to a valid option, such as "spiral" or "pingpong," and adjust if necessary.

"Lyrics processing error"

  • Explanation: There was an issue processing the processed_lyrics input.
  • Solution: Check the format and content of your lyrics input to ensure it meets the model's requirements, and try reprocessing if needed.

SongBloom Generate Audio Related Nodes

Go back to the extension to check out more related nodes.
SongBloom
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.