Save 4 hours! We auto-setup your workflow! Free!

Drop your workflow.json — we handle every dependency, custom node, and model. Just open the link and run.

Auto-Setup Workflow Json (Free) Now!
ComfyUI > Nodes > ComfyUI-FL-VoxCPM

ComfyUI Extension: ComfyUI-FL-VoxCPM

Repo Name

ComfyUI-FL-VoxCPM

Author
filliptm (Account age: 2446 days)
Nodes
View all nodes(8)
Latest Updated
2026-05-21
Github Stars
0.03K

How to Install ComfyUI-FL-VoxCPM

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-VoxCPM
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-FL-VoxCPM in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI-FL-VoxCPM Description

ComfyUI-FL-VoxCPM is an extension for ComfyUI that integrates the VoxCPM model, enhancing the user interface with advanced voice processing capabilities. It enables seamless voice command recognition and processing, improving user interaction efficiency.

ComfyUI-FL-VoxCPM Introduction

ComfyUI-FL-VoxCPM is an innovative extension designed to bring advanced text-to-speech (TTS) capabilities to the ComfyUI platform. Powered by OpenBMB's VoxCPM model family, this extension offers a range of features that allow you to create high-quality, multilingual speech synthesis. Whether you're looking to clone voices, design new ones from text descriptions, or fine-tune custom voices, ComfyUI-FL-VoxCPM provides the tools you need. This extension is particularly useful for AI artists who want to incorporate realistic and expressive speech into their projects without needing extensive technical knowledge.

How ComfyUI-FL-VoxCPM Works

At its core, ComfyUI-FL-VoxCPM utilizes a tokenizer-free, diffusion autoregressive architecture. This means it can generate continuous speech representations directly, bypassing the need for discrete tokenization. Imagine it as a painter who doesn't need to sketch first but can directly paint a complete picture. This approach allows for highly natural and expressive speech synthesis. The extension supports multiple languages and can create voices from simple text descriptions, making it accessible and easy to use for artists.

ComfyUI-FL-VoxCPM Features

  • VoxCPM V2 Model: This model boasts 2 billion parameters and produces 48kHz studio-quality audio across 30 languages. It's ideal for creating diverse and high-fidelity speech outputs.
  • Voice Design: Create unique voices using natural language descriptions. For example, you can describe a voice as "a young woman with a warm and gentle tone," and the model will generate speech that matches this description.
  • Voice Cloning: Clone any voice using a short audio reference. This feature is perfect for replicating specific vocal characteristics.
  • Controllable Cloning: Modify the style or emotion of a cloned voice, allowing for creative expression while maintaining the original voice's timbre.
  • Ultimate Cloning: Achieve maximum fidelity by using both reference audio and continuation audio, ensuring every vocal nuance is captured.
  • LoRA Training: Fine-tune custom voices with a real-time training dashboard that provides insights into the training process, including loss charts and validation audio.
  • Auto Transcription: Integrated Whisper technology transcribes audio to text, aiding in creating accurate reference texts.
  • Audio Crop: Trim audio files to specific time ranges, making it easy to edit and manage audio content.

ComfyUI-FL-VoxCPM Models

The extension includes several models, each suited for different needs:

  • VoxCPM2: With 2 billion parameters, this model is recommended for its high-quality output and support for 30 languages. It's perfect for voice design and controllable cloning.
  • VoxCPM1.5: A stable model with 800 million parameters, offering high-fidelity TTS at 44.1kHz. It's suitable for projects requiring consistent quality.
  • VoxCPM-0.5B: A legacy model with 500 million parameters, providing a lightweight option for basic TTS needs.

What's New with ComfyUI-FL-VoxCPM

Recent updates have introduced the VoxCPM2 model, which supports 30 languages and offers advanced features like voice design and controllable cloning. These enhancements allow for more creative and flexible use of the extension, enabling AI artists to produce even more realistic and expressive speech outputs.

Troubleshooting ComfyUI-FL-VoxCPM

If you encounter issues while using ComfyUI-FL-VoxCPM, here are some common solutions:

  • Model Not Downloading: Ensure you have a stable internet connection. The models are downloaded automatically from HuggingFace on first use.
  • Audio Quality Issues: Check your input settings, such as the cfg_value and inference_timesteps, to ensure they are optimized for your desired output.
  • Voice Cloning Errors: Make sure your reference audio is clear and of good quality. Use the FL VoxCPM Transcribe node to generate accurate transcripts if needed.

Learn More about ComfyUI-FL-VoxCPM

To further explore the capabilities of ComfyUI-FL-VoxCPM, you can visit the VoxCPM GitHub repository for more detailed documentation and resources. Additionally, the Hugging Face page provides access to model weights and further technical details. For community support and discussions, consider joining the Discord server where you can connect with other users and developers.

ComfyUI-FL-VoxCPM Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ComfyUI-FL-VoxCPM detailed guide | ComfyUI