RunComfy

SeedVR2 V2.5 | AI Video Upscaling Workflow

Upscale videos fast with sharp, smooth, cinematic results.

Qwen Edit 2509 MultipleAngles | Multi-View Image Creator

Turn one photo into complete multi-angle visuals instantly.

Wan 2.2 + Lightx2v V2 | Ultra Fast I2V & T2V

Dual Light LoRA setup, 4X faster.

Image Bypass | Smart Image Detection Bypass Utility Workflow

Skip limits and process images faster with total creative control.

ComfyUI > Nodes > ComfyUI-FL-VoxCPM

ComfyUI Extension: ComfyUI-FL-VoxCPM

Repo Name

ComfyUI-FL-VoxCPM

Author
filliptm (Account age: 2446 days) Nodes
View all nodes(8) Latest Updated
2026-05-21 Github Stars
0.03K

Github Ask filliptm Current Questions Past Questions

Table of Content

Description
ComfyUI-FL-VoxCPM Introduction
How ComfyUI-FL-VoxCPM Works
ComfyUI-FL-VoxCPM Features
ComfyUI-FL-VoxCPM Models
What's New with ComfyUI-FL-VoxCPM
Troubleshooting ComfyUI-FL-VoxCPM
Learn More about ComfyUI-FL-VoxCPM
Related Nodes

How to Install ComfyUI-FL-VoxCPM

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-VoxCPM

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-FL-VoxCPM in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI-FL-VoxCPM Description

ComfyUI-FL-VoxCPM is an extension for ComfyUI that integrates the VoxCPM model, enhancing the user interface with advanced voice processing capabilities. It enables seamless voice command recognition and processing, improving user interaction efficiency.

ComfyUI-FL-VoxCPM Introduction

ComfyUI-FL-VoxCPM is an innovative extension designed to bring advanced text-to-speech (TTS) capabilities to the ComfyUI platform. Powered by OpenBMB's VoxCPM model family, this extension offers a range of features that allow you to create high-quality, multilingual speech synthesis. Whether you're looking to clone voices, design new ones from text descriptions, or fine-tune custom voices, ComfyUI-FL-VoxCPM provides the tools you need. This extension is particularly useful for AI artists who want to incorporate realistic and expressive speech into their projects without needing extensive technical knowledge.

How ComfyUI-FL-VoxCPM Works

At its core, ComfyUI-FL-VoxCPM utilizes a tokenizer-free, diffusion autoregressive architecture. This means it can generate continuous speech representations directly, bypassing the need for discrete tokenization. Imagine it as a painter who doesn't need to sketch first but can directly paint a complete picture. This approach allows for highly natural and expressive speech synthesis. The extension supports multiple languages and can create voices from simple text descriptions, making it accessible and easy to use for artists.

ComfyUI-FL-VoxCPM Features

VoxCPM V2 Model: This model boasts 2 billion parameters and produces 48kHz studio-quality audio across 30 languages. It's ideal for creating diverse and high-fidelity speech outputs.
Voice Design: Create unique voices using natural language descriptions. For example, you can describe a voice as "a young woman with a warm and gentle tone," and the model will generate speech that matches this description.
Voice Cloning: Clone any voice using a short audio reference. This feature is perfect for replicating specific vocal characteristics.
Controllable Cloning: Modify the style or emotion of a cloned voice, allowing for creative expression while maintaining the original voice's timbre.
Ultimate Cloning: Achieve maximum fidelity by using both reference audio and continuation audio, ensuring every vocal nuance is captured.
LoRA Training: Fine-tune custom voices with a real-time training dashboard that provides insights into the training process, including loss charts and validation audio.
Auto Transcription: Integrated Whisper technology transcribes audio to text, aiding in creating accurate reference texts.
Audio Crop: Trim audio files to specific time ranges, making it easy to edit and manage audio content.

ComfyUI-FL-VoxCPM Models

The extension includes several models, each suited for different needs:

VoxCPM2: With 2 billion parameters, this model is recommended for its high-quality output and support for 30 languages. It's perfect for voice design and controllable cloning.
VoxCPM1.5: A stable model with 800 million parameters, offering high-fidelity TTS at 44.1kHz. It's suitable for projects requiring consistent quality.
VoxCPM-0.5B: A legacy model with 500 million parameters, providing a lightweight option for basic TTS needs.

What's New with ComfyUI-FL-VoxCPM

Recent updates have introduced the VoxCPM2 model, which supports 30 languages and offers advanced features like voice design and controllable cloning. These enhancements allow for more creative and flexible use of the extension, enabling AI artists to produce even more realistic and expressive speech outputs.

Troubleshooting ComfyUI-FL-VoxCPM

If you encounter issues while using ComfyUI-FL-VoxCPM, here are some common solutions:

Model Not Downloading: Ensure you have a stable internet connection. The models are downloaded automatically from HuggingFace on first use.
Audio Quality Issues: Check your input settings, such as the cfg_value and inference_timesteps, to ensure they are optimized for your desired output.
Voice Cloning Errors: Make sure your reference audio is clear and of good quality. Use the FL VoxCPM Transcribe node to generate accurate transcripts if needed.

Learn More about ComfyUI-FL-VoxCPM

To further explore the capabilities of ComfyUI-FL-VoxCPM, you can visit the VoxCPM GitHub repository for more detailed documentation and resources. Additionally, the Hugging Face page provides access to model weights and further technical details. For community support and discussions, consider joining the Discord server where you can connect with other users and developers.

ComfyUI-FL-VoxCPM Related Nodes

FL VoxCPM Audio Crop

FL VoxCPM Dataset Maker

FL VoxCPM LoRA Trainer

FL VoxCPM Train Config

FL VoxCPM Transcribe

FL VoxCPM TTS

FL VoxCPM V2 Train Config

FL VoxCPM V2 TTS

Table of Content

Description
ComfyUI-FL-VoxCPM Introduction
How ComfyUI-FL-VoxCPM Works
ComfyUI-FL-VoxCPM Features
ComfyUI-FL-VoxCPM Models
What's New with ComfyUI-FL-VoxCPM
Troubleshooting ComfyUI-FL-VoxCPM
Learn More about ComfyUI-FL-VoxCPM
Related Nodes

Flux UltraRealistic LoRA V2

Create stunningly lifelike image with Flux UltraRealistic LoRA V2

Qwen Image 2512 LoRA Inference | AI Toolkit ComfyUI

Use an AI Toolkit-trained LoRA with Qwen Image 2512 in ComfyUI via one RCQwenImage2512 node for preview-aligned generations.

Wan 2.2 Animate V2 | Realistic Pose Video Generator

Transforms photos into smooth-motion animated character videos using Wan 2.2.

Create Coherent Scenes | Consistent Story Art Generator

Build seamless storytelling scenes with rich visual consistency.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Extension: ComfyUI-FL-VoxCPM

ComfyUI-FL-VoxCPM

How to Install ComfyUI-FL-VoxCPM

ComfyUI-FL-VoxCPM Description

ComfyUI-FL-VoxCPM Introduction

How ComfyUI-FL-VoxCPM Works

ComfyUI-FL-VoxCPM Features

ComfyUI-FL-VoxCPM Models

What's New with ComfyUI-FL-VoxCPM

Troubleshooting ComfyUI-FL-VoxCPM

Learn More about ComfyUI-FL-VoxCPM

ComfyUI-FL-VoxCPM Related Nodes