RunComfy

Wan 2.2 + Lightx2v V2 | Ultra Fast I2V & T2V

Dual Light LoRA setup, 4X faster.

CatVTON | Amazing Virtual Try-On

CatVTON for easy and accurate virtual try-on.

Flux 2 Dev | Photoreal Text-to-Image Generator

Next-level image realism with advanced generation control power

ReActor | Fast Face Swap

With ComfyUI ReActor, you can easily swap the faces of one or more characters in images or videos.

ComfyUI > Nodes > Qwen2.5-VL GGUF Nodes > 🖼️ Local Vision Model Loader (GGUF)

ComfyUI Node: 🖼️ Local Vision Model Loader (GGUF)

Class Name

VisionModelLoader

Category
🤖 GGUF-VLM/🖼️ Vision Models

Author
walke2019 (Account age: 2560days) Extension
Qwen2.5-VL GGUF Nodes Latest Updated
2025-12-17 Github Stars
0.03K

Github Ask walke2019 Current Questions Past Questions

Table of Content

Description
VisionModelLoader:
VisionModelLoader Input Parameters:
VisionModelLoader Output Parameters:
VisionModelLoader Usage Tips:
VisionModelLoader Common Errors and Solutions:
Related Nodes

How to Install Qwen2.5-VL GGUF Nodes

Install this extension via the ComfyUI Manager by searching for Qwen2.5-VL GGUF Nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Qwen2.5-VL GGUF Nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

🖼️ Local Vision Model Loader (GGUF) Description

Facilitates loading and configuring vision language models in GGUF-VLM for optimized use.

🖼️ Local Vision Model Loader (GGUF):

The VisionModelLoader is a crucial component designed to facilitate the loading and configuration of vision language models within the GGUF-VLM framework. This node serves as a bridge between the model files and the inference engine, ensuring that the models are correctly loaded into memory and optimized for performance. It provides a streamlined process for selecting and configuring models, allowing you to focus on creative tasks without delving into the technical complexities of model management. By handling tasks such as model validation, preset application, and device optimization, the VisionModelLoader enhances the efficiency and reliability of deploying vision language models, making it an indispensable tool for AI artists looking to leverage advanced visual analysis capabilities in their projects.

🖼️ Local Vision Model Loader (GGUF) Input Parameters:

model

This parameter allows you to select the vision language model you wish to load. It is crucial for determining which model will be used for visual analysis tasks. The list of available models can be refreshed by clicking the "🔄 Refresh Models" button. This ensures that you have access to the latest models and configurations.

n_ctx

The n_ctx parameter specifies the context window size, which impacts how much information the model can consider at once. It is an integer value with a default of 8192, a minimum of 512, and a maximum of 32768, adjustable in steps of 512. A larger context window allows the model to process more data simultaneously, which can enhance performance in complex tasks but may require more computational resources.

device

This parameter determines the execution device for the model, with options including "Auto," "GPU," and "CPU." The default setting is "Auto," which automatically detects the best available device. Selecting "GPU" can significantly speed up processing by utilizing the graphics card, while "CPU" is suitable for systems without a dedicated GPU.

mmproj_file

The mmproj_file parameter is optional and allows you to manually specify an mmproj file. This file must match the model's visual encoder to avoid tensor errors. Providing the correct mmproj file ensures compatibility and optimal performance of the model.

🖼️ Local Vision Model Loader (GGUF) Output Parameters:

model

The output parameter model represents the loaded vision language model configuration. This configuration includes details such as the model name, path, and any applied presets or optimizations. It is essential for subsequent processing steps, as it defines the model's operational parameters and ensures that the correct model is used for visual analysis tasks.

🖼️ Local Vision Model Loader (GGUF) Usage Tips:

Ensure that the mmproj_file matches the model's visual encoder to prevent compatibility issues and tensor errors.
Utilize the "Auto" device setting to allow the system to choose the most efficient execution device, optimizing performance without manual intervention.
Regularly refresh the model list to access the latest models and configurations, ensuring you are working with the most up-to-date tools.

🖼️ Local Vision Model Loader (GGUF) Common Errors and Solutions:

"mmproj 文件必须与模型的视觉编码器匹配"

Explanation: This error occurs when the specified mmproj file does not match the model's visual encoder, leading to tensor errors.
Solution: Download the correct mmproj file that matches the model version, rename it if necessary, and specify it using the mmproj_file parameter.

"Invalid config: `<validation_errors>`"

Explanation: This error indicates that the model configuration is invalid due to incorrect parameters or settings.
Solution: Review the configuration parameters, ensure they are within the allowed ranges, and verify that all required files and settings are correctly specified.

🖼️ Local Vision Model Loader (GGUF) Related Nodes

Go back to the extension to check out more related nodes.

Qwen2.5-VL GGUF Nodes

Table of Content

Description
VisionModelLoader:
VisionModelLoader Input Parameters:
VisionModelLoader Output Parameters:
VisionModelLoader Usage Tips:
VisionModelLoader Common Errors and Solutions:
Related Nodes

Z-Image Turbo I2I for Characters | Ultimate Photorealism

Turns portraits into lifelike, perfectly detailed realistic faces fast.

ByteDance USO | Unified Style & Subject Generator

ByteDance USO makes subject and style fusion simple and powerful.

Qwen-Image Lightning | 8-Step Speed Boost

Cut generation time in half.

Flux Redux | Variation and Restyling

Official Flux Tools - Flux Redux for Image Variation and Restyling

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: 🖼️ Local Vision Model Loader (GGUF)

VisionModelLoader

How to Install Qwen2.5-VL GGUF Nodes

🖼️ Local Vision Model Loader (GGUF) Description

🖼️ Local Vision Model Loader (GGUF):

🖼️ Local Vision Model Loader (GGUF) Input Parameters:

model

n_ctx

device

mmproj_file

🖼️ Local Vision Model Loader (GGUF) Output Parameters:

model

🖼️ Local Vision Model Loader (GGUF) Usage Tips:

🖼️ Local Vision Model Loader (GGUF) Common Errors and Solutions:

"mmproj 文件必须与模型的视觉编码器匹配"

"Invalid config: <validation_errors>"

🖼️ Local Vision Model Loader (GGUF) Related Nodes

"Invalid config: `<validation_errors>`"