PuLID Flux II | Consistent Character Generation

Generate images with precise character control while preserving artistic style.

HiDream-I1 | T2I

High-quality image generation using a 17B parameter model.

LivePortrait | Animate Portraits | Vid2Vid

Transfer facial expressions and movements from a driving video onto a source video

CatVTON | Amazing Virtual Try-On

CatVTON for easy and accurate virtual try-on.

ComfyUI > Nodes > ComfyUI-MiniCPM-o > Load MiniCPM-o Model

ComfyUI Node: Load MiniCPM-o Model

Class Name

Load MiniCPM Model

Category
MiniCPM-o

Author
CY-CHENYUE (Account age: 520days) Extension
ComfyUI-MiniCPM-o Latest Updated
2025-02-16 Github Stars
0.03K

Github Ask CY-CHENYUE Current Questions Past Questions

Table of Content

Description
Load MiniCPM Model:
Load MiniCPM Model Input Parameters:
Load MiniCPM Model Output Parameters:
Load MiniCPM Model Usage Tips:
Load MiniCPM Model Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-MiniCPM-o

Install this extension via the ComfyUI Manager by searching for ComfyUI-MiniCPM-o

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-MiniCPM-o in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Load MiniCPM-o Model Description

Facilitates loading versatile MiniCPM-o AI model for vision, audio, and text tasks, with customizable functionalities for efficient resource utilization.

Load MiniCPM-o Model:

The Load MiniCPM Model node is designed to facilitate the loading of the MiniCPM-o model, a versatile AI model capable of handling various tasks such as vision, audio, and text-to-speech processing. This node is essential for users who wish to leverage the capabilities of the MiniCPM-o model within their AI projects, providing a seamless way to initialize and configure the model according to specific needs. By offering options to enable or disable certain functionalities like vision and audio, it allows for a tailored approach to model usage, ensuring that resources are utilized efficiently. The node's primary goal is to simplify the model loading process, making it accessible even to those without a deep technical background, while ensuring that the model is ready for immediate use in diverse applications.

Load MiniCPM-o Model Input Parameters:

model_name

This parameter specifies the name of the model to be loaded. Currently, it supports the model named MiniCPM-o-2_6, which is a predefined model folder name. This parameter is crucial as it determines which version of the MiniCPM-o model will be loaded for use.

device

The device parameter allows you to choose the hardware on which the model will run. You can select between cuda and cpu, with the default being cuda. This choice impacts the model's performance and speed, as running on a GPU (cuda) typically offers faster processing compared to a CPU.

init_vision

This boolean parameter determines whether the vision capabilities of the model should be initialized. By default, it is set to True, enabling the model to process visual data. If your application does not require vision processing, you can set this to False to save resources.

init_audio

The init_audio parameter is a boolean that specifies whether the model's audio processing features should be activated. It defaults to False, meaning audio capabilities are disabled unless explicitly enabled. This allows you to tailor the model's functionality to your specific needs.

init_tts

This boolean parameter controls the initialization of the text-to-speech (TTS) functionality within the model. By default, it is set to False, indicating that TTS features are not activated unless required. Enabling this feature allows the model to convert text into spoken words, which can be useful in applications requiring audio output.

Load MiniCPM-o Model Output Parameters:

model

The model output parameter provides the loaded MiniCPM-o model, ready for use in various AI tasks. This output is crucial as it represents the core functionality that you will interact with, enabling you to perform operations such as inference and data processing.

tokenizer

The tokenizer output is an essential component that accompanies the model, responsible for converting text into a format that the model can understand and process. This output is vital for any text-based operations, ensuring that the input data is correctly formatted for the model's use.

Load MiniCPM-o Model Usage Tips:

Ensure that the model files are correctly placed in the specified directory (ComfyUI/models/MiniCPM/MiniCPM-o-2_6) to avoid loading errors.
Choose the appropriate device (cuda or cpu) based on your hardware capabilities and the performance requirements of your application.
Enable only the necessary functionalities (vision, audio, TTS) to optimize resource usage and improve performance.

Load MiniCPM-o Model Common Errors and Solutions:

本地模型未找到：`{model_path}`。请将模型文件放置在 ComfyUI/models/MiniCPM/MiniCPM-o-2_6 文件夹中。

Explanation: This error indicates that the model files are not found in the expected directory.
Solution: Verify that the model files are correctly placed in the ComfyUI/models/MiniCPM/MiniCPM-o-2_6 directory and try loading the model again.

加载模型时发生错误: `{str(e)}`

Explanation: This error occurs when there is an issue during the model loading process, possibly due to incorrect configurations or missing files.
Solution: Check the error message for specific details, ensure all required files are present, and verify that the input parameters are correctly set. If the problem persists, consult the documentation or seek support for further assistance.

Load MiniCPM-o Model Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-MiniCPM-o

Table of Content

Description
Load MiniCPM Model:
Load MiniCPM Model Input Parameters:
Load MiniCPM Model Output Parameters:
Load MiniCPM Model Usage Tips:
Load MiniCPM Model Common Errors and Solutions:
Related Nodes

Advanced Live Portrait | Parameter Control

Use customizable parameters to control every feature, from eye blinks to head movements, for natural results.

VACE 14B: All-in-One Video Creation & Editing

Create, edit and transform videos with the powerful VACE Wan2.1 14B.

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

AP Workflow 12.0 | Ready-to-Use Complete AI Media Suite

Pre-set all-in-one system for image & video generation, enhancement, and manipulation. Zero setup required.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.