RunComfy

Video Character Replacement (MoCha) | Realistic Swap Tool

Swap video characters fast with realistic motion and lighting control.

SUPIR + Foolhardy Remacri | 8K Image/Video Upscaler

Upscale images to 8K with SUPIR and 4x Foolhardy Remacri model.

Z Image ControlNet | Precision Image Generator

Total control over image poses, edges, and depth layouts.

Flux TTP Upscale | 4K Face Restore

Repair distorted faces and upscale images to 4K resolution.

ComfyUI > Nodes > ComfyUI_RH_VoxCPM > RunningHub VoxCPM Multi-Speaker

ComfyUI Node: RunningHub VoxCPM Multi-Speaker

Class Name

RunningHub_VoxCPM_MultiSpeaker

Category
RunningHub/VoxCPM

Author
HM-RunningHub (Account age: 489days) Extension
ComfyUI_RH_VoxCPM Latest Updated
2026-04-15 Github Stars
0.03K

Github Ask HM-RunningHub Current Questions Past Questions

Table of Content

Description
RunningHub_VoxCPM_MultiSpeaker:
RunningHub_VoxCPM_MultiSpeaker Input Parameters:
RunningHub_VoxCPM_MultiSpeaker Output Parameters:
RunningHub_VoxCPM_MultiSpeaker Usage Tips:
RunningHub_VoxCPM_MultiSpeaker Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_RH_VoxCPM

Install this extension via the ComfyUI Manager by searching for ComfyUI_RH_VoxCPM

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_RH_VoxCPM in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

RunningHub VoxCPM Multi-Speaker Description

Generates multi-speaker audio using VoxCPM for dynamic AI-driven content creation.

RunningHub VoxCPM Multi-Speaker:

The RunningHub_VoxCPM_MultiSpeaker node is designed to facilitate the generation of speech from multiple speakers using the VoxCPM model. This node is particularly useful for applications that require dynamic audio generation with varied speaker characteristics, such as virtual assistants, interactive storytelling, or any AI-driven audio content creation. By leveraging the capabilities of the VoxCPM model, this node allows you to create rich, diverse audio outputs that can mimic different speaker profiles, enhancing the realism and engagement of the generated content. The primary goal of this node is to provide a seamless and efficient way to produce multi-speaker audio, making it an invaluable tool for AI artists and developers looking to incorporate sophisticated audio features into their projects.

RunningHub VoxCPM Multi-Speaker Input Parameters:

speaker_profiles

The speaker_profiles parameter allows you to specify the different speaker profiles that the node will use to generate speech. Each profile can represent a unique voice with distinct characteristics, enabling the creation of diverse audio outputs. This parameter is crucial for defining the variety and richness of the generated speech, as it directly influences the tonal and stylistic attributes of each speaker. There are no specific minimum or maximum values, but the diversity of profiles will enhance the output quality.

reference_audio

The reference_audio parameter is used to provide sample audio files that the node can analyze to better mimic the desired speaker characteristics. By supplying reference audio, you can guide the node in capturing the nuances of specific voices, leading to more accurate and personalized speech generation. This parameter is optional but highly recommended for achieving high fidelity in speaker imitation.

text_input

The text_input parameter is where you input the text that you want to be converted into speech. This text serves as the script for the generated audio, and its content will be articulated by the selected speaker profiles. The quality and clarity of the text input will directly affect the intelligibility and coherence of the resulting speech.

RunningHub VoxCPM Multi-Speaker Output Parameters:

generated_audio

The generated_audio parameter is the primary output of the node, providing the synthesized speech based on the input text and speaker profiles. This audio output is the culmination of the node's processing, delivering a multi-speaker audio file that reflects the specified characteristics and content. The generated audio can be used in various applications, from multimedia projects to AI-driven communication tools.

RunningHub VoxCPM Multi-Speaker Usage Tips:

To achieve the best results, provide high-quality reference audio that closely matches the desired speaker characteristics. This will help the node generate more accurate and realistic speech.
Experiment with different speaker profiles to explore the range of voices and styles that the node can produce. This can add depth and variety to your audio projects.
Ensure that your text input is clear and well-structured to enhance the clarity and coherence of the generated speech.

RunningHub VoxCPM Multi-Speaker Common Errors and Solutions:

"Invalid speaker profile"

Explanation: This error occurs when the specified speaker profile is not recognized by the node.
Solution: Verify that the speaker profile names are correctly spelled and match the available profiles supported by the node.

"Reference audio not found"

Explanation: The node cannot locate the reference audio file specified in the input.
Solution: Check the file path and ensure that the reference audio file is accessible and correctly specified in the input parameters.

"Text input is empty"

Explanation: The node requires text input to generate speech, and this error indicates that no text was provided.
Solution: Provide a valid text input to the node to enable speech generation.

RunningHub VoxCPM Multi-Speaker Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_RH_VoxCPM

Table of Content

Description
RunningHub_VoxCPM_MultiSpeaker:
RunningHub_VoxCPM_MultiSpeaker Input Parameters:
RunningHub_VoxCPM_MultiSpeaker Output Parameters:
RunningHub_VoxCPM_MultiSpeaker Usage Tips:
RunningHub_VoxCPM_MultiSpeaker Common Errors and Solutions:
Related Nodes

CatVTON | Amazing Virtual Try-On

CatVTON for easy and accurate virtual try-on.

Stable Diffusion 1.5 LoRA Inference | AI Toolkit ComfyUI

Run AI Toolkit-trained Stable Diffusion 1.5 LoRAs in ComfyUI with training-matched behavior using a single RCSD15 custom node.

Wan 2.1 Ditto | Cinematic Video Restyle Generator

Transform videos into stunning artistic styles with perfect motion flow.

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: RunningHub VoxCPM Multi-Speaker

RunningHub_VoxCPM_MultiSpeaker

How to Install ComfyUI_RH_VoxCPM

RunningHub VoxCPM Multi-Speaker Description

RunningHub VoxCPM Multi-Speaker:

RunningHub VoxCPM Multi-Speaker Input Parameters:

speaker_profiles

reference_audio

text_input

RunningHub VoxCPM Multi-Speaker Output Parameters:

generated_audio

RunningHub VoxCPM Multi-Speaker Usage Tips:

RunningHub VoxCPM Multi-Speaker Common Errors and Solutions:

"Invalid speaker profile"

"Reference audio not found"

"Text input is empty"

RunningHub VoxCPM Multi-Speaker Related Nodes