RunComfy

Video Character Replacement (MoCha) | Realistic Swap Tool

Swap video characters fast with realistic motion and lighting control.

Hunyuan3D-2 | Leading-edge 3D Assets Generator

Generate precise textured 3D assets from images with state-of-the-art AI technology.

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Qwen Image Edit | Precise AI Photo Editing

Edit photos fast with style, relighting, and object control precision.

ComfyUI > Nodes > ComfyUI-HiggsAudio > Higgs Audio

ComfyUI Node: Higgs Audio

Class Name

HiggsAudio

Category
Higgs Audio

Author
Yuan-ManX (Account age: 2090days) Extension
ComfyUI-HiggsAudio Latest Updated
2025-07-26 Github Stars
0.02K

Github Ask Yuan-ManX Current Questions Past Questions

Table of Content

Description
HiggsAudio:
HiggsAudio Input Parameters:
HiggsAudio Output Parameters:
HiggsAudio Usage Tips:
HiggsAudio Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-HiggsAudio

Install this extension via the ComfyUI Manager by searching for ComfyUI-HiggsAudio

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-HiggsAudio in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Higgs Audio Description

HiggsAudio enhances audio processing in ComfyUI, enabling encoding and manipulation of audio data.

Higgs Audio:

HiggsAudio is a sophisticated node designed to enhance audio processing capabilities within the ComfyUI framework. It serves as a pivotal component in the multimodal integration of audio data, allowing for the seamless encoding and manipulation of audio inputs. The node is particularly beneficial for tasks that require the transformation or conditioning of audio signals, such as speaker identity transfer or audio feature extraction. By leveraging advanced audio encoding techniques, HiggsAudio facilitates the conversion of raw audio data into a format that can be effectively utilized for further processing or analysis. This node is essential for AI artists looking to incorporate audio elements into their creative projects, providing a robust foundation for audio-based applications.

Higgs Audio Input Parameters:

reference_audio

The reference_audio parameter is crucial for setting a baseline audio clip that the node will use for encoding and conditioning purposes. This parameter allows you to specify an audio file that serves as a reference point for tasks such as speaker identity transfer. The reference audio is encoded into a latent space, which can then be used to guide the model's output, ensuring that the resulting audio retains the desired characteristics of the reference. This parameter does not have specific minimum or maximum values, as it depends on the audio file provided.

identity_guidance_scale

The identity_guidance_scale parameter controls the extent to which the reference audio's identity influences the output. By adjusting this scale, you can amplify or diminish the impact of the reference audio on the final result. A higher scale value increases the prominence of the reference audio's characteristics, while a lower value reduces it. This parameter is essential for fine-tuning the balance between the reference audio and the generated output, allowing for precise control over the audio transformation process.

Higgs Audio Output Parameters:

audio_embed

The audio_embed output parameter represents the encoded audio data in a latent space. This output is a transformed version of the input audio, encapsulating its essential features in a format that can be used for further processing or analysis. The audio embedding is crucial for tasks that require the manipulation or conditioning of audio signals, as it provides a compact and efficient representation of the audio's characteristics. This output is particularly valuable for AI artists looking to integrate audio elements into their projects, offering a versatile foundation for creative exploration.

Higgs Audio Usage Tips:

To achieve optimal results when using HiggsAudio for speaker identity transfer, ensure that the reference audio is clear and representative of the desired speaker characteristics.
Experiment with the identity_guidance_scale parameter to find the right balance between the reference audio and the generated output, especially when working on projects that require subtle audio transformations.

Higgs Audio Common Errors and Solutions:

Reference Audio Not Found

Explanation: This error occurs when the specified reference audio file cannot be located or accessed by the node.
Solution: Ensure that the file path to the reference audio is correct and that the file is accessible from the current working directory.

Invalid Audio Format

Explanation: The node encountered an audio file format that it does not support.
Solution: Convert the audio file to a supported format, such as WAV or MP3, and try again.

Identity Guidance Scale Out of Range

Explanation: The identity_guidance_scale parameter was set to a value outside the acceptable range.
Solution: Adjust the identity_guidance_scale to a valid value, typically between 0 and 1, to ensure proper functioning of the node.

Higgs Audio Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-HiggsAudio

Table of Content

Description
HiggsAudio:
HiggsAudio Input Parameters:
HiggsAudio Output Parameters:
HiggsAudio Usage Tips:
HiggsAudio Common Errors and Solutions:
Related Nodes

Z Image ControlNet | Precision Image Generator

Total control over image poses, edges, and depth layouts.

Flux Kontext 360 Degree LoRA

Generate immersive 360-style images with depth and spatial control.

OmniGen2 | Text-to-Image & Editing

Powerful unified model for image generation and editing

FLUX.2 Klein Unified Image Editing | Smart Inpaint, Outpaint & Remove

Flawless editing. Remove, fill, and extend any image fast.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Higgs Audio

HiggsAudio

How to Install ComfyUI-HiggsAudio

Higgs Audio Description

Higgs Audio:

Higgs Audio Input Parameters:

reference_audio

identity_guidance_scale

Higgs Audio Output Parameters:

audio_embed

Higgs Audio Usage Tips:

Higgs Audio Common Errors and Solutions:

Reference Audio Not Found

Invalid Audio Format

Identity Guidance Scale Out of Range

Higgs Audio Related Nodes