RunComfy

Wan 2.2 FLF2V | First-Last Frame Video Generation

Generate smooth videos from a start and end frame using Wan 2.2 FLF2V.

Hunyuan3D 2.1 | Image to 3D Model

Big jump from 2.0: Turn photos into incredible 3D models instantly.

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

ReActor | Fast Face Swap

Professional face swapping toolkit for ComfyUI that enables natural face replacement and enhancement.

ComfyUI > Nodes > ComfyUI_Prompt-All-In-One > API Qwen ImgOrVideo2Text

ComfyUI Node: API Qwen ImgOrVideo2Text

Class Name

APIQwenImgOrVideo2Text

Category
🎤MW/MW-Prompt-All-In-One

Author
billwuhao (Account age: 2576days) Extension
ComfyUI_Prompt-All-In-One Latest Updated
2026-03-20 Github Stars
0.05K

Github Ask billwuhao Current Questions Past Questions

Table of Content

Description
APIQwenImgOrVideo2Text:
APIQwenImgOrVideo2Text Input Parameters:
APIQwenImgOrVideo2Text Output Parameters:
APIQwenImgOrVideo2Text Usage Tips:
APIQwenImgOrVideo2Text Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_Prompt-All-In-One

Install this extension via the ComfyUI Manager by searching for ComfyUI_Prompt-All-In-One

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_Prompt-All-In-One in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

API Qwen ImgOrVideo2Text Description

Transforms images/videos into descriptive text using AI for automated captioning and insights.

API Qwen ImgOrVideo2Text:

APIQwenImgOrVideo2Text is a versatile node designed to transform images or videos into descriptive text. This node leverages advanced AI models to analyze visual content and generate coherent and contextually relevant textual descriptions. Its primary purpose is to bridge the gap between visual and textual data, enabling users to extract meaningful insights from images or videos without needing to manually interpret the content. This capability is particularly beneficial for AI artists and content creators who wish to automate the process of generating captions or descriptions for their visual media. By utilizing this node, you can enhance accessibility, improve content categorization, and streamline workflows that require visual-to-text conversion.

API Qwen ImgOrVideo2Text Input Parameters:

image_or_video

This parameter specifies the input type, which can be either an image or a video. The node processes the visual content based on this input to generate a corresponding text description. The choice between image and video impacts the complexity and detail of the generated text, as videos may provide more context through motion and sequence. There are no explicit minimum or maximum values, but the input should be a valid image or video file.

prompt

The prompt parameter allows you to provide additional context or guidance for the text generation process. This can be a specific theme, style, or focus area that you want the generated text to adhere to. While optional, using a prompt can significantly influence the tone and content of the output, making it more aligned with your creative vision. There are no predefined options, but the prompt should be a coherent and relevant text string.

API Qwen ImgOrVideo2Text Output Parameters:

description

The description output is a text string that provides a detailed and contextually relevant narrative of the input image or video. This output is crucial for understanding and interpreting the visual content, especially in scenarios where manual analysis is impractical. The generated description aims to capture the essence of the visual input, highlighting key elements, actions, or themes present in the media.

API Qwen ImgOrVideo2Text Usage Tips:

To achieve the best results, ensure that the input image or video is of high quality and clearly depicts the subject matter you want to describe.
Utilize the prompt parameter to guide the text generation process, especially if you have specific themes or styles in mind for the output description.

API Qwen ImgOrVideo2Text Common Errors and Solutions:

InvalidInputError

Explanation: This error occurs when the input provided is not a valid image or video file.
Solution: Verify that the input file is correctly formatted and supported by the node. Ensure that the file path is correct and that the file is accessible.

EmptyPromptWarning

Explanation: This warning indicates that the prompt parameter is empty, which may lead to a less focused text description.
Solution: Consider providing a relevant prompt to enhance the specificity and relevance of the generated text.

API Qwen ImgOrVideo2Text Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_Prompt-All-In-One

Table of Content

Description
APIQwenImgOrVideo2Text:
APIQwenImgOrVideo2Text Input Parameters:
APIQwenImgOrVideo2Text Output Parameters:
APIQwenImgOrVideo2Text Usage Tips:
APIQwenImgOrVideo2Text Common Errors and Solutions:
Related Nodes

AnimateDiff + IPAdapter V1 | Image to Video

With IPAdapter, you can efficiently control the generation of animations using reference images.

Multitalk | Realistic Talking Video Maker

One-click create multi-speaker lip-sync videos from portraits and voices!

Flex.1 LoRA Inference | AI Toolkit ComfyUI

Run your AI Toolkit-trained Flex.1 LoRA in ComfyUI with training-matched defaults using a single RC custom node.

Wan 2.1 Fun | I2V + T2V

Empower your AI videos with Wan 2.1 Fun.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: API Qwen ImgOrVideo2Text

APIQwenImgOrVideo2Text

How to Install ComfyUI_Prompt-All-In-One

API Qwen ImgOrVideo2Text Description

API Qwen ImgOrVideo2Text:

API Qwen ImgOrVideo2Text Input Parameters:

image_or_video

prompt

API Qwen ImgOrVideo2Text Output Parameters:

description

API Qwen ImgOrVideo2Text Usage Tips:

API Qwen ImgOrVideo2Text Common Errors and Solutions:

InvalidInputError

EmptyPromptWarning

API Qwen ImgOrVideo2Text Related Nodes