Wan 2.2 Animate | Character Swap & Lip-Sync

Transforms any face to speak and move like the original with ease.

Advanced audio-driven lip sync technology.

FLUX.1 Dev LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained FLUX.1 Dev LoRA in ComfyUI with training-matched behavior using a single RCFluxDev custom node.

IPAdapter V1 FaceID Plus | Consistent Characters

Leverage IPAdapter FaceID Plus V2 model to create consistent characters.

ComfyUI > Nodes > ComfyUI-Woosh

ComfyUI Extension: ComfyUI-Woosh

Repo Name

ComfyUI-Woosh

Author
Saganaki22 (Account age: 1846 days) Nodes
View all nodes(4) Latest Updated
2026-06-03 Github Stars
0.1K

Github Ask Saganaki22 Current Questions Past Questions

Table of Content

Description
ComfyUI-Woosh Introduction
How ComfyUI-Woosh Works
ComfyUI-Woosh Features
ComfyUI-Woosh Models
Troubleshooting ComfyUI-Woosh
Learn More about ComfyUI-Woosh
Related Nodes

How to Install ComfyUI-Woosh

Install this extension via the ComfyUI Manager by searching for ComfyUI-Woosh

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-Woosh in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-Woosh Description

ComfyUI-Woosh enhances ComfyUI by providing a streamlined interface for managing and organizing UI components. It simplifies the user experience with intuitive controls, making it easier to customize and navigate complex UI layouts efficiently.

ComfyUI-Woosh Introduction

ComfyUI-Woosh is an innovative extension designed to enhance your creative projects by generating sound effects from text descriptions or video inputs. Developed using Sony AI's Woosh foundation model, this extension integrates seamlessly with ComfyUI, allowing AI artists to create immersive audio experiences without needing extensive technical knowledge. Whether you're looking to add dynamic soundscapes to your digital art or transform video frames into audio, ComfyUI-Woosh provides a versatile and user-friendly solution.

How ComfyUI-Woosh Works

At its core, ComfyUI-Woosh leverages advanced generative models to convert text and video inputs into audio outputs. Imagine describing a scene with words, and the extension brings it to life with sound, or taking a silent video and giving it a voice. This is achieved through a process called latent diffusion modeling, where the extension interprets input data and generates corresponding audio. By using distilled models, ComfyUI-Woosh ensures fast and efficient sound generation, making it accessible even for those with limited computational resources.

ComfyUI-Woosh Features

Text-to-Audio (T2A): Transform text descriptions into sound effects using Flow and DFlow models. This feature allows you to create audio that matches the mood and theme of your visual art.
Video-to-Audio (V2A): Convert video frames into audio using VFlow and DVFlow models. This is perfect for adding soundtracks to animations or video projects.
Distilled Models: DFlow and DVFlow models offer rapid audio generation with fewer steps, making them ideal for quick iterations and experimentation.
Dynamic VRAM Management: Efficiently manage your system's resources by offloading tasks between GPU and CPU, ensuring smooth performance even on less powerful machines.
Force Offload: Automatically clear models from memory after use, optimizing system performance for subsequent tasks.
Video Output: Directly output video frames for further processing or combination with audio, streamlining your workflow.
Bundled Library: The Woosh library is included, eliminating the need for additional installations and ensuring compatibility with your existing environment.

ComfyUI-Woosh Models

ComfyUI-Woosh offers several models tailored to different tasks:

Flow: Ideal for high-quality text-to-audio generation, offering the best sound fidelity.
DFlow: A distilled version of Flow, providing faster audio generation with slightly reduced quality, suitable for quick previews.
VFlow: Designed for video-to-audio conversion, maintaining high audio quality from video inputs.
DVFlow: A distilled version of VFlow, optimized for speed, making it perfect for rapid prototyping. Each model is designed to cater to specific needs, allowing you to choose based on your project's requirements and available resources.

Troubleshooting ComfyUI-Woosh

Here are some common issues you might encounter and how to resolve them:

Error Loading State_dict in Strict Mode: This is normal and handled by non-strict loading. It occurs when some checkpoint keys don't match.
RoBERTa/HuggingFace Downloads Every Restart: The first download is cached locally, so subsequent runs should use the cache.
CUDA Out of Memory: Enable force_offload to free up memory after each run, use smaller models like DFlow/DVFlow, or reduce the number of latent_frames.
Model Download Fails (China): Set a HuggingFace mirror before starting ComfyUI to ensure successful downloads.
Import Errors After Install: Restart ComfyUI to reload all necessary Python modules.

Learn More about ComfyUI-Woosh

To further explore the capabilities of ComfyUI-Woosh, consider visiting the following resources:

Hugging Face Woosh Models for downloading model checkpoints.
SonyResearch/Woosh GitHub Repository for in-depth technical details and updates.
ComfyUI-VideoHelperSuite for additional video processing tools that complement ComfyUI-Woosh. These resources provide valuable insights and community support, helping you make the most of ComfyUI-Woosh in your creative endeavors.

ComfyUI-Woosh Related Nodes

Woosh Model Loader

Woosh Video Loader

Woosh Sampler

Woosh TextConditioning

Table of Content

Description
ComfyUI-Woosh Introduction
How ComfyUI-Woosh Works
ComfyUI-Woosh Features
ComfyUI-Woosh Models
Troubleshooting ComfyUI-Woosh
Learn More about ComfyUI-Woosh
Related Nodes

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Z-Image | Fast Photorealistic Base Model

Super-fast image maker with stunning clarity and total control.

Wan 2.2 Low Vram | Kijai Wrapper

Low VRAM. No longer waiting. Kijai wrapper included.

Wan 2.1 LoRA

Enhance Wan 2.1 video generation with LoRA models for improved style and customization.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.