Save 4 hours! We auto-setup your workflow! Free!

Drop your workflow.json — we handle every dependency, custom node, and model. Just open the link and run.

Auto-Setup Workflow Json (Free) Now!
ComfyUI > Nodes > ComfyUI_VoxCPM_SM

ComfyUI Extension: ComfyUI_VoxCPM_SM

Repo Name

ComfyUI_VoxCPM_SM

Author
smthemex (Account age: 1064 days)
Nodes
View all nodes(4)
Latest Updated
2026-06-02
Github Stars
0.03K

How to Install ComfyUI_VoxCPM_SM

Install this extension via the ComfyUI Manager by searching for ComfyUI_VoxCPM_SM
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_VoxCPM_SM in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ComfyUI_VoxCPM_SM Description

ComfyUI_VoxCPM_SM is an extension for ComfyUI that enhances user interface functionality by integrating advanced voice command processing. It streamlines user interactions through efficient voice recognition and command execution, improving accessibility and user experience.

ComfyUI_VoxCPM_SM Introduction

ComfyUI_VoxCPM_SM is an extension designed to enhance the capabilities of the ComfyUI platform by integrating the VoxCPM model, a tokenizer-free Text-to-Speech (TTS) system. This extension allows AI artists to generate context-aware speech and perform true-to-life voice cloning without the need for complex tokenization processes. By using this extension, you can easily infer and train models to produce natural and expressive speech outputs, making it a valuable tool for artists looking to incorporate realistic voice synthesis into their projects.

How ComfyUI_VoxCPM_SM Works

At its core, ComfyUI_VoxCPM_SM leverages the VoxCPM model, which operates on a diffusion autoregressive architecture. This means it generates speech by progressively refining audio representations, similar to how an artist might start with a rough sketch and gradually add details. The model bypasses traditional tokenization, allowing for more fluid and natural speech synthesis. This approach is particularly beneficial for creating multilingual speech and voice cloning, as it can adapt to various languages and vocal styles without predefined tokens.

ComfyUI_VoxCPM_SM Features

  • Tokenizer-Free Speech Generation: Generate speech directly from text without the need for tokenization, resulting in more natural and expressive outputs.
  • Voice Cloning: Clone voices from short audio clips, allowing for the creation of personalized and unique vocal outputs.
  • Multilingual Support: Capable of synthesizing speech in multiple languages, making it versatile for global applications.
  • Customizable Inference and Training: Easily adjust settings for inference and training to suit specific project needs, such as adjusting the voice's tone or emotion.

ComfyUI_VoxCPM_SM Models

The extension supports different models, including VoxCPM1.5 and VoxCPM2. Each model has its strengths:

  • VoxCPM1.5: Suitable for projects requiring stable and reliable speech synthesis.
  • VoxCPM2: Offers advanced features like voice design and controllable voice cloning, ideal for more complex and creative applications.

What's New with ComfyUI_VoxCPM_SM

Recent updates have introduced support for the gguf model format, which optimizes VRAM usage, requiring only 4.8GB for inference. The extension now also supports the VoxCPM2 model, enhancing both training and inference capabilities. These updates improve performance and expand the range of applications for AI artists.

Troubleshooting ComfyUI_VoxCPM_SM

If you encounter issues while using the extension, consider the following solutions:

  • Model Loading Errors: Ensure that the model files are correctly placed in the specified directories and that their names match the expected format.
  • Inference Performance: If performance is not as expected, check your VRAM availability and consider using the gguf model format for optimized usage.
  • Training Issues: Verify that your training data is correctly formatted and that the paths in the configuration files are accurate.

Learn More about ComfyUI_VoxCPM_SM

To further explore the capabilities of ComfyUI_VoxCPM_SM, you can visit the VoxCPM GitHub repository for detailed documentation and examples. Additionally, the ComfyUI examples page provides insights into how to create complex workflows using the ComfyUI platform. Engaging with community forums and tutorials can also provide valuable support and inspiration for your projects.

ComfyUI_VoxCPM_SM Related Nodes

RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ComfyUI_VoxCPM_SM detailed guide | ComfyUI