RunComfy

Flux Kontext Pulid | Consistent Character Generation

Create consistent characters using FLUX Kontext with a single face reference image.

Wan 2.2 VACE | Pose-Controlled Video Generator

Turn still images into stunning motion with pose-based control.

LongCat Avatar in ComfyUI | Identity-Consistent Avatar Animation

Turns one image into smooth, identity-consistent avatar animation.

Qwen Image Edit | Precise AI Photo Editing

Edit photos fast with style, relighting, and object control precision.

ComfyUI > Nodes > ComfyUI_VoxCPM_SM

ComfyUI Extension: ComfyUI_VoxCPM_SM

Repo Name

ComfyUI_VoxCPM_SM

Author
smthemex (Account age: 1064 days) Nodes
View all nodes(4) Latest Updated
2026-06-02 Github Stars
0.03K

Github Ask smthemex Current Questions Past Questions

Table of Content

Description
ComfyUI_VoxCPM_SM Introduction
How ComfyUI_VoxCPM_SM Works
ComfyUI_VoxCPM_SM Features
ComfyUI_VoxCPM_SM Models
What's New with ComfyUI_VoxCPM_SM
Troubleshooting ComfyUI_VoxCPM_SM
Learn More about ComfyUI_VoxCPM_SM
Related Nodes

How to Install ComfyUI_VoxCPM_SM

Install this extension via the ComfyUI Manager by searching for ComfyUI_VoxCPM_SM

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_VoxCPM_SM in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

ComfyUI_VoxCPM_SM Description

ComfyUI_VoxCPM_SM is an extension for ComfyUI that enhances user interface functionality by integrating advanced voice command processing. It streamlines user interactions through efficient voice recognition and command execution, improving accessibility and user experience.

ComfyUI_VoxCPM_SM Introduction

ComfyUI_VoxCPM_SM is an extension designed to enhance the capabilities of the ComfyUI platform by integrating the VoxCPM model, a tokenizer-free Text-to-Speech (TTS) system. This extension allows AI artists to generate context-aware speech and perform true-to-life voice cloning without the need for complex tokenization processes. By using this extension, you can easily infer and train models to produce natural and expressive speech outputs, making it a valuable tool for artists looking to incorporate realistic voice synthesis into their projects.

How ComfyUI_VoxCPM_SM Works

At its core, ComfyUI_VoxCPM_SM leverages the VoxCPM model, which operates on a diffusion autoregressive architecture. This means it generates speech by progressively refining audio representations, similar to how an artist might start with a rough sketch and gradually add details. The model bypasses traditional tokenization, allowing for more fluid and natural speech synthesis. This approach is particularly beneficial for creating multilingual speech and voice cloning, as it can adapt to various languages and vocal styles without predefined tokens.

ComfyUI_VoxCPM_SM Features

Tokenizer-Free Speech Generation: Generate speech directly from text without the need for tokenization, resulting in more natural and expressive outputs.
Voice Cloning: Clone voices from short audio clips, allowing for the creation of personalized and unique vocal outputs.
Multilingual Support: Capable of synthesizing speech in multiple languages, making it versatile for global applications.
Customizable Inference and Training: Easily adjust settings for inference and training to suit specific project needs, such as adjusting the voice's tone or emotion.

ComfyUI_VoxCPM_SM Models

The extension supports different models, including VoxCPM1.5 and VoxCPM2. Each model has its strengths:

VoxCPM1.5: Suitable for projects requiring stable and reliable speech synthesis.
VoxCPM2: Offers advanced features like voice design and controllable voice cloning, ideal for more complex and creative applications.

What's New with ComfyUI_VoxCPM_SM

Recent updates have introduced support for the gguf model format, which optimizes VRAM usage, requiring only 4.8GB for inference. The extension now also supports the VoxCPM2 model, enhancing both training and inference capabilities. These updates improve performance and expand the range of applications for AI artists.

Troubleshooting ComfyUI_VoxCPM_SM

If you encounter issues while using the extension, consider the following solutions:

Model Loading Errors: Ensure that the model files are correctly placed in the specified directories and that their names match the expected format.
Inference Performance: If performance is not as expected, check your VRAM availability and consider using the gguf model format for optimized usage.
Training Issues: Verify that your training data is correctly formatted and that the paths in the configuration files are accurate.

Learn More about ComfyUI_VoxCPM_SM

To further explore the capabilities of ComfyUI_VoxCPM_SM, you can visit the VoxCPM GitHub repository for detailed documentation and examples. Additionally, the ComfyUI examples page provides insights into how to create complex workflows using the ComfyUI platform. Engaging with community forums and tutorials can also provide valuable support and inspiration for your projects.

ComfyUI_VoxCPM_SM Related Nodes

VoxCPM_SM_KSampler

VoxCPM_SM_LoraTrainerInit

VoxCPM_SM_LoraTrainerLoop

VoxCPM_SM_Model

Table of Content

Description
ComfyUI_VoxCPM_SM Introduction
How ComfyUI_VoxCPM_SM Works
ComfyUI_VoxCPM_SM Features
ComfyUI_VoxCPM_SM Models
What's New with ComfyUI_VoxCPM_SM
Troubleshooting ComfyUI_VoxCPM_SM
Learn More about ComfyUI_VoxCPM_SM
Related Nodes

PMRF Ultra Fast Upscaler | Low VRAM ComfyUI

Ultra fast PMRF upscaler! 3.79s on medium machine. 2x scale.

SDXL LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained SDXL LoRA in ComfyUI with training-matched defaults using a single RC custom node.

Qwen-Image | HD Multi-Text Poster Generator

New Era of Text Generation in Images!

Hunyuan3D-2 | Leading-edge 3D Assets Generator

Generate precise textured 3D assets from images with state-of-the-art AI technology.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Extension: ComfyUI_VoxCPM_SM

ComfyUI_VoxCPM_SM

How to Install ComfyUI_VoxCPM_SM

ComfyUI_VoxCPM_SM Description

ComfyUI_VoxCPM_SM Introduction

How ComfyUI_VoxCPM_SM Works

ComfyUI_VoxCPM_SM Features

ComfyUI_VoxCPM_SM Models

What's New with ComfyUI_VoxCPM_SM

Troubleshooting ComfyUI_VoxCPM_SM

Learn More about ComfyUI_VoxCPM_SM

ComfyUI_VoxCPM_SM Related Nodes