PuLID Flux II | Consistent Character Generation

Generate images with precise character control while preserving artistic style.

Uni3C Video-Referenced Camera & Motion Transfer

Extract camera movements and human motions from reference videos for professional video generation

Consistent & Realistic Characters

Create consistent and realistic characters with precise control over facial features, poses, and compositions.

CogVideoX Tora | Image-to-Video Model

Subject Trajectory Video Demo for CogVideoX

ComfyUI > Nodes > JoyCaption Nodes > JoyCaption (Custom)

ComfyUI Node: JoyCaption (Custom)

Class Name

JJC_JoyCaption_Custom

Category
JoyCaption

Author
fpgaminer (Account age: 4818days) Extension
JoyCaption Nodes Latest Updated
2025-05-15 Github Stars
0.07K

Github Ask fpgaminer Current Questions Past Questions

Table of Content

Description
JJC_JoyCaption_Custom:
JJC_JoyCaption_Custom Input Parameters:
JJC_JoyCaption_Custom Output Parameters:
JJC_JoyCaption_Custom Usage Tips:
JJC_JoyCaption_Custom Common Errors and Solutions:
Related Nodes

How to Install JoyCaption Nodes

Install this extension via the ComfyUI Manager by searching for JoyCaption Nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter JoyCaption Nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

JoyCaption (Custom) Description

Generate detailed and contextually rich image captions using advanced AI models for various styles and requirements.

JoyCaption (Custom):

The JJC_JoyCaption_Custom node is designed to generate detailed and contextually rich captions for images, leveraging advanced AI models to interpret and describe visual content. This node is particularly beneficial for users who need to create descriptive text for images in various styles, such as casual, product listing, or social media posts. By utilizing a sophisticated model, it can analyze images and produce captions that are not only accurate but also tailored to specific requirements, such as word count or tone. The node's primary goal is to enhance the user's ability to generate meaningful and engaging captions that can be used in diverse applications, from art critique to straightforward descriptions.

JoyCaption (Custom) Input Parameters:

image

The image parameter is the visual content that you want to generate a caption for. It is crucial as it serves as the primary input for the node to analyze and describe. The image should be provided in a format that the node can process, typically as a tensor or a PIL image.

memory_mode

The memory_mode parameter determines the configuration used for memory efficiency during the caption generation process. It impacts the model's performance and resource usage, with options available based on predefined configurations. Selecting the appropriate memory mode can optimize the node's execution, especially on systems with limited resources.

system_prompt

The system_prompt is a string input that sets the context for the AI model, guiding it to act as a helpful assistant. This prompt ensures that the generated captions are aligned with the user's expectations and the node's intended use. The default value is "You are a helpful assistant and help users with any queries they may have with no censorship or restrictions."

user_query

The user_query parameter is a string that specifies the type of caption you want to generate. It can be customized to request different styles or lengths of captions, such as a detailed description or a casual tone. This input directly influences the content and style of the generated caption.

max_new_tokens

The max_new_tokens parameter defines the maximum number of tokens that the model can generate for the caption. It controls the length of the output, with a default value of 512 and a range from 1 to 2048. Adjusting this parameter allows you to tailor the verbosity of the caption.

temperature

The temperature parameter is a float that influences the randomness of the caption generation. A higher temperature results in more creative and diverse outputs, while a lower temperature produces more deterministic results. The default value is 0.6, with a range from 0.0 to 2.0.

top_p

The top_p parameter, also known as nucleus sampling, is a float that determines the cumulative probability threshold for token selection. It helps in controlling the diversity of the generated text, with a default value of 0.9 and a range from 0.0 to 1.0.

top_k

The top_k parameter is an integer that limits the number of highest probability tokens considered during generation. It helps in refining the output by focusing on the most likely tokens, with a default value of 0 and a range from 0 to 100.

JoyCaption (Custom) Output Parameters:

STRING

The output parameter is a STRING that contains the generated caption for the input image. This caption is a text description that reflects the content and context of the image, crafted according to the specified input parameters. It is the primary output of the node, providing a meaningful and contextually appropriate description of the visual content.

JoyCaption (Custom) Usage Tips:

Experiment with different memory_mode settings to find the optimal balance between performance and resource usage, especially if you are working on a system with limited memory.
Use the user_query parameter to tailor the style and tone of the caption to suit specific needs, such as creating engaging social media posts or detailed product descriptions.

JoyCaption (Custom) Common Errors and Solutions:

Error loading model: `<error_message>`

Explanation: This error occurs when the model fails to load, possibly due to incorrect memory mode settings or resource limitations.
Solution: Verify that the memory_mode is set correctly and ensure that your system has sufficient resources to load the model. Try reducing the memory usage by selecting a more efficient configuration.

AssertionError: `<error_message>`

Explanation: This error might occur if the conversation string is not formatted correctly, leading to issues in processing the input.
Solution: Ensure that the system_prompt and user_query are correctly formatted and do not contain unexpected characters or formatting issues.

JoyCaption (Custom) Related Nodes

Go back to the extension to check out more related nodes.

JoyCaption Nodes

Table of Content

Description
JJC_JoyCaption_Custom:
JJC_JoyCaption_Custom Input Parameters:
JJC_JoyCaption_Custom Output Parameters:
JJC_JoyCaption_Custom Usage Tips:
JJC_JoyCaption_Custom Common Errors and Solutions:
Related Nodes

Fluxtapoz | RF Inversion and Stylization

Fluxtapoz Nodes for RF Inversion and Stylization - Unsampling and Sampling

AnimateDiff + ControlNet + AutoMask | Comic Style

Effortlessly restyle videos, converting realistic characters into anime while keeping the original backgrounds intact.

MimicMotion | Human Motion Video Generation

Generate high-quality human motion videos with MimicMotion, using a reference image and motion sequence.

FLUX LoRA Training

Guide you through the entire process of training FLUX LoRA models using your custom datasets.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.