RunComfy

Pose Control LipSync S2V | Expressive Video Generator

Turn images into talking, moving characters with pose and audio control.

DreamO | Unified Multi-Task Image Customization Framework

Perform identity, style, try-on, and multi-condition image generation from 1–3 references

Instagirl v.20 | Wan 2.2 LoRA Demo

A Wan 2.2 workflow for demoing the Instagirl LoRA by Instara.

Qwen Image Edit 2509 | Multi-Image Editor

Turn 2–3 images into one seamless, edited masterpiece instantly.

ComfyUI > Nodes > ComfyUI-GLM4 > GLM识图生成提示词

ComfyUI Node: GLM识图生成提示词

Class Name

GLM_Vision_ImageToPrompt

Category
GLM

Author
msola-ht (Account age: 2655days) Extension
ComfyUI-GLM4 Latest Updated
2026-03-17 Github Stars
0.13K

Github Ask msola-ht Current Questions Past Questions

Table of Content

Description
GLM_Vision_ImageToPrompt:
GLM_Vision_ImageToPrompt Input Parameters:
GLM_Vision_ImageToPrompt Output Parameters:
GLM_Vision_ImageToPrompt Usage Tips:
GLM_Vision_ImageToPrompt Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-GLM4

Install this extension via the ComfyUI Manager by searching for ComfyUI-GLM4

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-GLM4 in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

GLM识图生成提示词 Description

Converts images to detailed English prompts for text-to-image models using GLM-4V in ComfyUI.

GLM识图生成提示词:

GLM_Vision_ImageToPrompt is a node designed for use within ComfyUI, leveraging the capabilities of the Zhipu AI GLM-4V model to generate descriptive prompts from images. This node can process images provided via URL, Base64 encoded data, or directly from ComfyUI's IMAGE objects. Its primary function is to convert visual content into high-quality English prompts that can be used for text-to-image generation models. The node supports multiple preset image prompts, which can be loaded from a specific TXT file format, and it manages these prompts with a priority system. This ensures that users can either rely on predefined prompts or override them with custom text, providing flexibility and control over the prompt generation process. The node is particularly beneficial for AI artists looking to create detailed and creative image descriptions that capture the essence of the visual content, including aspects like objects, scenes, actions, lighting, materials, colors, composition, and artistic style.

GLM识图生成提示词 Input Parameters:

image_prompt_preset

This parameter allows you to select from a list of available image prompt presets. These presets are loaded from external files or use built-in defaults if no external files are available. The selected preset provides a structured template for generating image descriptions. The default selection is the first available preset, ensuring that there is always a fallback option.

prompt_override

This parameter is a string input where you can manually enter a custom prompt to describe the image. It has the highest priority, meaning that if you provide a value here, it will override any preset prompts. This allows for maximum customization and specificity in the generated prompt. The default is an empty string, which means the node will use the selected preset unless overridden.

GLM识图生成提示词 Output Parameters:

GETPrompt

This output parameter returns the generated prompt as a string. The prompt is a detailed and creative description of the image, suitable for use in text-to-image generation models. It captures various elements of the image, such as the main objects, scene, actions, and artistic style, providing a comprehensive textual representation of the visual content.

GLM识图生成提示词 Usage Tips:

To achieve the best results, ensure that your image is clear and well-composed, as this will help the node generate more accurate and detailed prompts.
Experiment with different preset prompts to see how they influence the generated descriptions, and use the prompt override feature for specific or unique image content that may not be covered by presets.

GLM识图生成提示词 Common Errors and Solutions:

"提供的Base64图片数据无效。"

Explanation: This error occurs when the Base64 encoded image data is invalid or cannot be decoded.
Solution: Verify that the Base64 data is correctly formatted and complete. Ensure there are no missing or extra characters.

"未能获取有效的图片数据。"

Explanation: This error indicates that the node could not retrieve valid image data from the provided inputs.
Solution: Check that the image URL is correct and accessible, or that the Base64 data is properly encoded. If using ComfyUI IMAGE objects, ensure they are correctly linked.

"识图提示词不能为空。"

Explanation: This error means that the node could not generate a prompt because no valid prompt text was available.
Solution: Ensure that either a valid preset is selected or a custom prompt is provided in the prompt override field. If using presets, verify that the external TXT file is correctly formatted and accessible.

GLM识图生成提示词 Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-GLM4

Table of Content

Description
GLM_Vision_ImageToPrompt:
GLM_Vision_ImageToPrompt Input Parameters:
GLM_Vision_ImageToPrompt Output Parameters:
GLM_Vision_ImageToPrompt Usage Tips:
GLM_Vision_ImageToPrompt Common Errors and Solutions:
Related Nodes

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Multitalk | Realistic Talking Video Maker

One-click create multi-speaker lip-sync videos from portraits and voices!

Nunchaku Qwen Image | Multi-Image Editor

Blend and style multiple images with next-level control.

Wan 2.2 + Lightx2v V2 | Ultra Fast I2V & T2V

Dual Light LoRA setup, 4X faster.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy