Mochi Edit: Modify Videos Using Text-Based Prompts and Unsampling.

ACE++ Character Consistency

Generate consistent images of your character across poses, angles, and styles from a single photo.

FLUX LoRA (RealismLoRA) | Photorealistic Images

Blend FLUX-1 model with FLUX-RealismLoRA for photorealistic AI images

LatentSync| Lip Sync Model

Advanced audio-driven lip sync technology.

ComfyUI > Nodes > ComfyUI_CaptionThis > Janus Pro Describe Image 🐑

ComfyUI Node: Janus Pro Describe Image 🐑

Class Name

JanusProDescribeImage|Mie

Category
🐑 JanusProCaption

Author
mie (Account age: 1888days) Extension
ComfyUI_CaptionThis Latest Updated
2025-04-22 Github Stars
0.05K

Github Ask mie Current Questions Past Questions

Table of Content

Description
JanusProDescribeImage|Mie:
JanusProDescribeImage|Mie Input Parameters:
JanusProDescribeImage|Mie Output Parameters:
JanusProDescribeImage|Mie Usage Tips:
JanusProDescribeImage|Mie Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_CaptionThis

Install this extension via the ComfyUI Manager by searching for ComfyUI_CaptionThis

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_CaptionThis in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Janus Pro Describe Image 🐑 Description

Generate detailed image descriptions using advanced image processing and language modeling techniques for AI artists and creators.

Janus Pro Describe Image 🐑| Janus Pro Describe Image 🐑:

The JanusProDescribeImage| Janus Pro Describe Image 🐑 node is designed to provide detailed descriptions of images by leveraging advanced image processing and language modeling techniques. This node is particularly useful for AI artists and creators who wish to generate comprehensive textual descriptions of visual content. By inputting an image and a specific question or prompt, the node utilizes a pre-trained model to analyze the image and generate a descriptive response. This capability is beneficial for tasks such as image captioning, content analysis, and enhancing accessibility by providing textual representations of visual data. The node's functionality is powered by a sophisticated model that processes the image and generates text based on the input parameters, ensuring that the descriptions are both relevant and contextually accurate.

Janus Pro Describe Image 🐑| Janus Pro Describe Image 🐑 Input Parameters:

model

The model parameter specifies the pre-trained MIE_JANUS_MODEL to be used for image description. This model is responsible for processing the image and generating the descriptive text. It is crucial to select a model that is well-suited for the type of images you are working with to ensure accurate and meaningful descriptions.

image

The image parameter is the visual content that you want to describe. It should be provided in a compatible format, such as a PIL image, which the node will process to generate a textual description. The quality and content of the image can significantly impact the accuracy and detail of the generated description.

question

The question parameter allows you to specify a prompt or query that guides the description process. By default, it is set to "Describe this image in detail." This parameter can be customized to focus on specific aspects of the image, such as colors, objects, or actions, thereby tailoring the output to your needs.

seed

The seed parameter is an integer that sets the random seed for the model's operations, ensuring reproducibility of results. It has a default value of 42 and can range from 0 to 0xffffffffffffffff. Using the same seed across different runs will produce consistent outputs, which is useful for debugging and comparison purposes.

temperature

The temperature parameter is a float that controls the randomness of the text generation process. It ranges from 0.0 to 1.0, with a default value of 0.1. Lower values result in more deterministic outputs, while higher values introduce more variability and creativity in the descriptions.

top_p

The top_p parameter, also known as nucleus sampling, is a float that determines the cumulative probability threshold for token selection during text generation. It ranges from 0.0 to 1.0, with a default value of 0.95. This parameter helps balance between diversity and coherence in the generated text.

max_new_tokens

The max_new_tokens parameter is an integer that sets the maximum number of tokens to be generated in the description. It ranges from 1 to 2048, with a default value of 512. This parameter controls the length of the output, allowing you to generate concise or detailed descriptions as needed.

keep_model_loaded

The keep_model_loaded parameter is a boolean that determines whether the model should remain loaded in memory after processing. By default, it is set to True, which can improve performance for batch processing or repeated use. Setting it to False will offload the model to free up resources.

Janus Pro Describe Image 🐑| Janus Pro Describe Image 🐑 Output Parameters:

text

The text output parameter is a string that contains the generated description of the input image. This text is the result of the model's analysis and processing, providing a detailed and contextually relevant description based on the input parameters. The output can be used for various applications, such as enhancing image metadata, improving accessibility, or serving as input for further creative processes.

Janus Pro Describe Image 🐑| Janus Pro Describe Image 🐑 Usage Tips:

To achieve consistent results, use the same seed value across different runs when testing or comparing outputs.
Adjust the temperature and top_p parameters to find the right balance between creativity and coherence in the generated descriptions.
Use specific and detailed prompts in the question parameter to guide the model towards generating descriptions that focus on particular aspects of the image.
Consider setting keep_model_loaded to True if you plan to process multiple images in succession, as this can reduce loading times and improve efficiency.

Janus Pro Describe Image 🐑| Janus Pro Describe Image 🐑 Common Errors and Solutions:

"Model not found"

Explanation: This error occurs when the specified model is not available or incorrectly loaded.
Solution: Ensure that the correct model path is specified and that the model files are present in the designated directory.

"Image format not supported"

Explanation: The input image is not in a compatible format for processing.
Solution: Convert the image to a supported format, such as a PIL image, before inputting it into the node.

"Invalid parameter value"

Explanation: One or more input parameters have values outside their allowed ranges.
Solution: Check the parameter values against their specified ranges and adjust them accordingly to ensure they fall within the valid limits.

Janus Pro Describe Image 🐑 Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_CaptionThis

Table of Content

Description
JanusProDescribeImage|Mie:
JanusProDescribeImage|Mie Input Parameters:
JanusProDescribeImage|Mie Output Parameters:
JanusProDescribeImage|Mie Usage Tips:
JanusProDescribeImage|Mie Common Errors and Solutions:
Related Nodes

Hunyuan Video | Video to Video

Combine text prompt and source video to generate new video.

FramePack Wrapper | Efficient long Video Generation

Create stable, 60s+ long videos with minimal cloud resources.

LivePortrait | Animate Portraits | Img2Vid

Animate portraits with facial expressions and motion using a single image and reference video.

ComfyUI Vid2Vid Dance Transfer

Transfers the motion and style from a source video onto a target image or object.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.