ComfyUI > Nodes > ComfyUI_StableAvatar

ComfyUI Extension: ComfyUI_StableAvatar

Repo Name

ComfyUI_StableAvatar

Author
smthemex (Account age: 893 days)
Nodes
View all nodes(3)
Latest Updated
2025-08-21
Github Stars
0.04K

How to Install ComfyUI_StableAvatar

Install this extension via the ComfyUI Manager by searching for ComfyUI_StableAvatar
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_StableAvatar in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI_StableAvatar Description

ComfyUI_StableAvatar enables infinite-length, audio-driven avatar video generation within ComfyUI, allowing users to create dynamic avatar videos that respond to audio inputs.

ComfyUI_StableAvatar Introduction

ComfyUI_StableAvatar is an innovative extension designed to generate infinite-length avatar videos driven by audio input. This tool is particularly useful for AI artists who want to create dynamic and engaging avatar animations without the need for extensive video editing skills. By leveraging advanced diffusion models, ComfyUI_StableAvatar allows you to produce high-quality, identity-preserving videos that synchronize perfectly with audio tracks. This extension can be a game-changer for artists looking to explore new creative possibilities in digital storytelling and character animation.

How ComfyUI_StableAvatar Works

At its core, ComfyUI_StableAvatar uses a diffusion model to generate videos. Imagine the diffusion model as a sophisticated artist that starts with a rough sketch and gradually refines it into a detailed masterpiece. In this case, the "sketch" is the initial video frames, and the "masterpiece" is the final, polished video. The model takes a reference image and an audio file as inputs and uses them to guide the video generation process. It ensures that the avatar's movements and expressions are in sync with the audio, creating a seamless and natural animation. The extension employs a unique Time-step-aware Audio Adapter to prevent errors during video generation, ensuring smooth transitions and consistent identity representation throughout the video.

ComfyUI_StableAvatar Features

  • Infinite-Length Video Generation: Create videos as long as your audio track, without worrying about running out of frames.
  • Audio Synchronization: The extension ensures that the avatar's lip movements and expressions match the audio perfectly, enhancing the realism of the animation.
  • Identity Preservation: Maintain the unique characteristics of your avatar throughout the video, avoiding any unwanted changes in appearance.
  • Customizable Settings: Adjust parameters like resolution, frame rate, and inference steps to suit your specific needs and preferences.
  • Multi-GPU Support: Speed up the video generation process by utilizing multiple GPUs, if available.

ComfyUI_StableAvatar Models

ComfyUI_StableAvatar offers different models to cater to various needs:

  • Wan2.1-1.3B-based Model: Ideal for generating high-quality videos at resolutions of 512x512, 480x832, and 832x480. This model balances performance and resource usage effectively.
  • Transformer3d-square.pt and Transformer3d-rec-vec.pt: These models are trained on different datasets and can be selected based on the desired video resolution and quality. Each model has its strengths, and you can choose the one that best fits your project requirements.

What's New with ComfyUI_StableAvatar

Recent updates have introduced several improvements:

  • Enhanced Multi-GPU and Single-GPU Inference: Fixed inconsistencies to ensure smooth operation across different hardware setups.
  • New Demo Release: A brand new demo showcasing the capabilities of ComfyUI_StableAvatar is now available on platforms like YouTube and Bilibili.
  • Faster Inference: The extension can now run in just 10 steps, making it three times faster than before, thanks to the integration with ComfyUI. These updates enhance the user experience by providing faster, more reliable video generation.

Troubleshooting ComfyUI_StableAvatar

Here are some common issues you might encounter and their solutions:

  • Video Quality Issues: If the video quality is not as expected, try increasing the number of inference steps or adjusting the overlap window length for better results.
  • Audio Synchronization Problems: Ensure that the audio file is clear and free from background noise. Using the audio separator feature can help isolate vocals for better synchronization.
  • GPU Memory Errors: If you encounter memory issues, consider reducing the resolution or the number of frames. Alternatively, use the model_cpu_offload mode to decrease GPU memory usage. For more detailed troubleshooting, refer to the ComfyUI_StableAvatar GitHub page.

Learn More about ComfyUI_StableAvatar

To further explore the capabilities of ComfyUI_StableAvatar, you can access additional resources:

  • Project Page: Learn more about the underlying technology and research behind StableAvatar.
  • Hugging Face Demo: Try out the public demo to see the extension in action.
  • YouTube Tutorial: Watch a video tutorial to get a visual understanding of how to use the extension effectively. These resources provide valuable insights and guidance for AI artists looking to make the most of ComfyUI_StableAvatar.

ComfyUI_StableAvatar Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.