RunComfy

FLUX Img2Img | Merge Visuals and Prompts

Merge visuals and prompts for stunning, enhanced results.

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Z-Image | Fast Photorealistic Base Model

Super-fast image maker with stunning clarity and total control.

Flux Depth and Canny

Official Flux Tools - Flux Depth and Canny ControlNet Model

ComfyUI > Nodes > ComfyUI_QwenVL_PromptCaption > Qwen3 VL Batch Caption

ComfyUI Node: Qwen3 VL Batch Caption

Class Name

Qwen3CaptionBatch

Category
image/caption

Author
WingeD123 (Account age: 1221days) Extension
ComfyUI_QwenVL_PromptCaption Latest Updated
2026-03-23 Github Stars
0.04K

Github Ask WingeD123 Current Questions Past Questions

Table of Content

Description
Qwen3CaptionBatch:
Qwen3CaptionBatch Input Parameters:
Qwen3CaptionBatch Output Parameters:
Qwen3CaptionBatch Usage Tips:
Qwen3CaptionBatch Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_QwenVL_PromptCaption

Install this extension via the ComfyUI Manager by searching for ComfyUI_QwenVL_PromptCaption

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_QwenVL_PromptCaption in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Qwen3 VL Batch Caption Description

Qwen3CaptionBatch automates batch image captioning in Chinese/English, enhancing productivity.

Qwen3 VL Batch Caption:

Qwen3CaptionBatch is a powerful node designed to facilitate batch processing of image captioning tasks using the Qwen3 model. This node is particularly beneficial for users who need to generate descriptive captions for a large number of images efficiently. By leveraging advanced machine learning techniques, Qwen3CaptionBatch can process multiple images simultaneously, providing detailed and contextually relevant captions in either Chinese or English. The node is optimized for performance, ensuring that it can handle high-resolution images while managing memory usage effectively. Its primary goal is to streamline the workflow for AI artists and content creators by automating the captioning process, thus saving time and enhancing productivity.

Qwen3 VL Batch Caption Input Parameters:

model_path

The model_path parameter specifies the location of the text encoder model files required for caption generation. It is crucial for loading the appropriate model components necessary for processing images. This parameter does not have a default value and must be set to a valid path where the model files are stored.

dtype

The dtype parameter determines the data type used for model processing, with options including "auto", "4bit", and "8bit". The default setting is "4bit", which is recommended for optimal performance and memory efficiency. Choosing the appropriate data type can impact the speed and resource usage of the captioning process.

keep_model_loaded

The keep_model_loaded parameter is a boolean option that dictates whether the model should remain loaded in memory after processing. By default, it is set to False, meaning the model will be unloaded to free up resources. Keeping the model loaded can be beneficial if multiple batches are processed consecutively, reducing loading times.

lang

The lang parameter allows you to select the language for the generated captions, with options for "中文" (Chinese) and "English". The default language is "中文". This parameter ensures that the captions are generated in the desired language, catering to different audience needs.

max_side

The max_side parameter sets the maximum dimension for the images being processed, with a default value of 512 pixels. It accepts values ranging from 256 to 2240 pixels, in increments of 32. This parameter helps manage memory usage and processing time by resizing images to a manageable size while maintaining quality.

image_path

The image_path parameter specifies the directory containing the images to be captioned. It is a required parameter and must point to a valid directory path. This parameter is essential for the node to locate and process the images.

save_path

The save_path parameter is optional and defines where the generated captions will be saved. If left empty, the captions will be saved in the same directory as the images. This parameter provides flexibility in organizing and storing the output files.

instruction

The instruction parameter is an optional multiline string that allows you to provide specific instructions or context for the captioning process. This can influence the style or focus of the generated captions, making them more tailored to your needs.

Qwen3 VL Batch Caption Output Parameters:

summary

The summary output parameter provides the generated captions for the batch of images processed. It is returned as a string and contains the descriptive text for each image, allowing you to easily review and utilize the captions in your projects.

Qwen3 VL Batch Caption Usage Tips:

Ensure that the model_path is correctly set to avoid errors related to model loading. Double-check the path to ensure it points to the correct directory containing the model files.
Use the max_side parameter to control the size of the images being processed. Adjusting this parameter can help balance between image quality and processing speed, especially when dealing with high-resolution images.
Consider setting keep_model_loaded to True if you plan to process multiple batches in succession, as this can reduce the time spent loading the model for each batch.

Qwen3 VL Batch Caption Common Errors and Solutions:

"0 image captioned, 共处理0张图片"

Explanation: This error occurs when the specified image_path is either empty or does not point to a valid directory containing images.
Solution: Verify that the image_path parameter is set to a correct and accessible directory path containing the images you wish to process.

"Failed to load model, 模型加载失败"

Explanation: This error indicates that the model could not be loaded from the specified model_path, possibly due to incorrect path or missing files.
Solution: Ensure that the model_path is accurate and that all necessary model files are present in the specified directory. Double-check the path and file permissions.

Qwen3 VL Batch Caption Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_QwenVL_PromptCaption

Table of Content

Description
Qwen3CaptionBatch:
Qwen3CaptionBatch Input Parameters:
Qwen3CaptionBatch Output Parameters:
Qwen3CaptionBatch Usage Tips:
Qwen3CaptionBatch Common Errors and Solutions:
Related Nodes

InstantCharacter

One photo, endless characters. Perfect identity preservation.

Character AI Ovi | Talking Avatar Generator

Turns any photo into lifelike talking avatars with emotion and voice.

Wan 2.1 Ditto | Cinematic Video Restyle Generator

Transform videos into stunning artistic styles with perfect motion flow.

Reallusion AI Render | 3D to ComfyUI Workflows Collection

ComfyUI + Reallusion = Speed, Accessibility, and Ease for 3D visuals

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Qwen3 VL Batch Caption

Qwen3CaptionBatch

How to Install ComfyUI_QwenVL_PromptCaption

Qwen3 VL Batch Caption Description

Qwen3 VL Batch Caption:

Qwen3 VL Batch Caption Input Parameters:

model_path

dtype

keep_model_loaded

lang

max_side

image_path

save_path

instruction

Qwen3 VL Batch Caption Output Parameters:

summary

Qwen3 VL Batch Caption Usage Tips:

Qwen3 VL Batch Caption Common Errors and Solutions:

"0 image captioned, 共处理0张图片"

"Failed to load model, 模型加载失败"

Qwen3 VL Batch Caption Related Nodes