ComfyUI > Nodes > ComfyUI_QwenVL_PromptCaption > ASID Captioner (Inverse Prompt)

ComfyUI Node: ASID Captioner (Inverse Prompt)

Class Name

ASID_Caption

Category
image/caption
Author
WingeD123 (Account age: 1221days)
Extension
ComfyUI_QwenVL_PromptCaption
Latest Updated
2026-03-23
Github Stars
0.04K

How to Install ComfyUI_QwenVL_PromptCaption

Install this extension via the ComfyUI Manager by searching for ComfyUI_QwenVL_PromptCaption
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_QwenVL_PromptCaption in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

ASID Captioner (Inverse Prompt) Description

Generates detailed captions for images/videos using AI, aiding artists with descriptive text.

ASID Captioner (Inverse Prompt):

The ASID_Caption node is designed to generate detailed captions for images or videos using advanced AI models. It leverages inverse prompting techniques to produce high-quality, descriptive text outputs that can be used in various AI art applications. This node is particularly beneficial for artists and creators who need to generate precise and contextually rich descriptions of visual content. By utilizing this node, you can automate the process of captioning, ensuring consistency and enhancing the interpretability of your visual projects. The node is capable of handling different data types and languages, making it versatile for diverse creative needs.

ASID Captioner (Inverse Prompt) Input Parameters:

model_path

This parameter specifies the path to the model you wish to use for captioning. It is crucial as it determines the AI model's capabilities and the quality of the captions generated. The model path should be correctly set to ensure the node functions properly.

dtype

The dtype parameter defines the data type precision for the model, with options including "auto", "4bit", and "8bit". The default is "auto", but using "4bit" is strongly recommended for memory optimization. This setting impacts the model's performance and resource usage.

keep_model_loaded

This boolean parameter, defaulting to False, determines whether the model remains loaded in memory after execution. Keeping the model loaded can save time if you plan to run multiple captioning tasks consecutively, but it may consume more memory.

unload_other_models

Set to True by default, this boolean parameter ensures that other models are unloaded from memory when the ASID_Caption node is executed. This helps in managing memory resources efficiently, especially when working with large models.

lang

The lang parameter allows you to choose the language for the captions, with options being "中文" (Chinese) and "English". The default language is "中文". This setting is essential for generating captions in the desired language.

seed

This integer parameter is used to set the random seed for the captioning process, ensuring reproducibility of results. It has a default value but can be adjusted to achieve different outputs.

video_path

This parameter specifies the path to the video file if you are captioning video content. It is important for locating the video that needs to be processed.

use_audio

A boolean parameter that indicates whether to use audio from the video during captioning. It defaults to True, but if the video lacks audio, setting this to True might cause errors.

instruction

This optional string parameter allows you to provide specific instructions or guidelines for the captioning process. It supports multiline input, enabling detailed directives.

ASID Captioner (Inverse Prompt) Output Parameters:

text

The text output parameter provides the generated caption as a string. This output is the result of the captioning process and contains the detailed description of the input image or video, formatted according to the specified language and instructions.

ASID Captioner (Inverse Prompt) Usage Tips:

  • To optimize performance, use the "4bit" dtype setting for memory efficiency, especially when working with large datasets or models.
  • Keep the keep_model_loaded parameter set to True if you plan to run multiple captioning tasks in succession to save on loading times.
  • Ensure the model_path is correctly set to avoid errors related to model loading.

ASID Captioner (Inverse Prompt) Common Errors and Solutions:

Failed to load model, 模型加载失败

  • Explanation: This error occurs when the specified model cannot be loaded, possibly due to an incorrect model path or missing files.
  • Solution: Verify that the model_path is correct and that all necessary model files are present in the specified directory.

no image, 无图像

  • Explanation: This error indicates that no image was provided for captioning, which is required for the process.
  • Solution: Ensure that an image is correctly input into the node before execution.

Error due to missing audio

  • Explanation: If use_audio is set to True but the video lacks audio, this can cause an error.
  • Solution: Set use_audio to False if the video does not contain audio to prevent this error.

ASID Captioner (Inverse Prompt) Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI_QwenVL_PromptCaption
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

ASID Captioner (Inverse Prompt)