RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Qwen Edit 2509 MultipleAngles | Multi-View Image Creator

Turn one photo into complete multi-angle visuals instantly.

Sonic | Lip-Sync Portrait Animation

Sonic delivers advanced audio-driven lip-sync for portraits with high-quality animation.

Consistent Character Creator 3.0 | Easy Consistency, Any Angle

Make characters stay the same, every angle, strong and perfect.

ComfyUI > Nodes > HunyuanVideo-1.5 nodes > HunyuanVideo 1.5 Leo Vision Encoder Model Loader

ComfyUI Node: HunyuanVideo 1.5 Leo Vision Encoder Model Loader

Class Name

HyVideo15VisionEncoderLoader

Category
HunyuanVideoWrapper1.5

Author
yuanyuan-spec (Account age: 32days) Extension
HunyuanVideo-1.5 nodes Latest Updated
2025-12-02 Github Stars
0.02K

Github Ask yuanyuan-spec Current Questions Past Questions

Table of Content

Description
HyVideo15VisionEncoderLoader:
HyVideo15VisionEncoderLoader Input Parameters:
HyVideo15VisionEncoderLoader Output Parameters:
HyVideo15VisionEncoderLoader Usage Tips:
HyVideo15VisionEncoderLoader Common Errors and Solutions:
Related Nodes

How to Install HunyuanVideo-1.5 nodes

Install this extension via the ComfyUI Manager by searching for HunyuanVideo-1.5 nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter HunyuanVideo-1.5 nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Description

Loads and configures the vision encoder for HunyuanVideo 1.5, streamlining visual data processing.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader:

The HyVideo15VisionEncoderLoader is a specialized node designed to load and configure the vision encoder component of the HunyuanVideo 1.5 Leo model suite. This node is integral for processing visual data, enabling the transformation of images into a format that can be effectively utilized by AI models for video generation and analysis. By leveraging advanced vision encoding techniques, it ensures that visual inputs are accurately interpreted and encoded, facilitating high-quality video outputs. The node is particularly beneficial for AI artists looking to incorporate sophisticated visual processing capabilities into their projects without delving into the complexities of model configuration and device management. Its primary goal is to streamline the vision encoding process, making it accessible and efficient for creative applications.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Input Parameters:

vision_encoder

This parameter specifies the type of vision encoder to be used. It is crucial for determining the model architecture and capabilities, impacting the quality and style of the visual encoding. The default value is "siglip", which is a fixed input for the algorithm, ensuring compatibility and optimal performance.

hyvid_cfg

This configuration parameter encompasses various settings that influence the behavior of the vision encoder. It includes options for adjusting the number of videos per prompt and other model-specific configurations, allowing for tailored video generation based on user preferences.

latents_dict

This parameter contains the latent variables that are used during the encoding process. These variables are essential for capturing the underlying structure and features of the input images, directly affecting the richness and detail of the encoded output.

enable_offloading

A boolean parameter that determines whether model offloading is enabled. Offloading can help manage memory usage by transferring parts of the model to secondary storage, which is particularly useful when working with limited hardware resources. The default value is True.

reference_image

An optional parameter that allows you to provide a reference image to guide the encoding process. This can be useful for ensuring consistency in style or content across different video outputs. The default value is None.

vision_num_semantic_tokens

This integer parameter sets the number of semantic tokens used in the vision encoding process. It influences the granularity of the encoding, with higher values potentially capturing more detailed features. The default value is 729.

vision_states_dim

This parameter defines the dimensionality of the vision states, which are the intermediate representations produced during encoding. A higher dimensionality can capture more complex patterns and details, enhancing the quality of the encoded output. The default value is 1152.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Output Parameters:

vision_states

The output parameter vision_states represents the encoded visual data in a format that can be further processed or used for video generation. These states encapsulate the essential features and patterns extracted from the input images, serving as a foundation for creating high-quality video content. The encoded vision states are crucial for ensuring that the final video output is both visually appealing and contextually relevant.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Usage Tips:

Ensure that the vision_encoder type is set to "siglip" to maintain compatibility with the HunyuanVideo 1.5 Leo model suite, as this is the only supported type for automatic downloads.
Utilize the enable_offloading option to manage memory usage effectively, especially when working with large models or limited hardware resources.
Experiment with the vision_num_semantic_tokens and vision_states_dim parameters to find the optimal balance between detail and performance for your specific project needs.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Common Errors and Solutions:

Unsupported vision encoder type: `<type>`

Explanation: This error occurs when an unsupported vision encoder type is specified.
Solution: Ensure that the vision_encoder type is set to "siglip", as this is the only supported type for this node.

Vision encoder path not found

Explanation: This error indicates that the specified path for the vision encoder does not exist.
Solution: Verify that the path is correct and that the necessary files are present. If using automatic download, ensure that the download process completed successfully.

Device not available

Explanation: This error occurs when the specified device for model loading is not available.
Solution: Check your hardware configuration and ensure that the specified device (e.g., GPU) is correctly set up and accessible.

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Related Nodes

Go back to the extension to check out more related nodes.

HunyuanVideo-1.5 nodes

Table of Content

Description
HyVideo15VisionEncoderLoader:
HyVideo15VisionEncoderLoader Input Parameters:
HyVideo15VisionEncoderLoader Output Parameters:
HyVideo15VisionEncoderLoader Usage Tips:
HyVideo15VisionEncoderLoader Common Errors and Solutions:
Related Nodes

MimicMotion | Human Motion Video Generation

Generate high-quality human motion videos with MimicMotion, using a reference image and motion sequence.

Reallusion AI Render | 3D to ComfyUI Workflows Collection

ComfyUI + Reallusion = Speed, Accessibility, and Ease for 3D visuals

ComfyUI Grounding | Object Tracking Workflow

Track any subject with pixel-perfect accuracy for stunning VFX results.

Z-Image Finetuned Models Collection | Multi-Style Generator

Create stunning, detailed images across multiple styles and moods easily.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: HunyuanVideo 1.5 Leo Vision Encoder Model Loader

HyVideo15VisionEncoderLoader

How to Install HunyuanVideo-1.5 nodes

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Description

HunyuanVideo 1.5 Leo Vision Encoder Model Loader:

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Input Parameters:

vision_encoder

hyvid_cfg

latents_dict

enable_offloading

reference_image

vision_num_semantic_tokens

vision_states_dim

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Output Parameters:

vision_states

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Usage Tips:

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Common Errors and Solutions:

Unsupported vision encoder type: <type>

Vision encoder path not found

Device not available

HunyuanVideo 1.5 Leo Vision Encoder Model Loader Related Nodes

Unsupported vision encoder type: `<type>`