ComfyUI > Nodes > ComfyUI_Prompt-All-In-One > API Gemini Image Gen

ComfyUI Node: API Gemini Image Gen

Class Name

APIGeminiImageGen

Category
🎤MW/MW-Prompt-All-In-One
Author
billwuhao (Account age: 2576days)
Extension
ComfyUI_Prompt-All-In-One
Latest Updated
2026-03-20
Github Stars
0.05K

How to Install ComfyUI_Prompt-All-In-One

Install this extension via the ComfyUI Manager by searching for ComfyUI_Prompt-All-In-One
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI_Prompt-All-In-One in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

API Gemini Image Gen Description

Facilitates image generation/editing via Google Vertex AI's Gemini model for creative workflows.

API Gemini Image Gen:

APIGeminiImageGen is a powerful node designed to facilitate the generation and editing of images through the Google Vertex AI platform, specifically utilizing the Gemini model. This node allows you to create or modify images synchronously by leveraging advanced AI capabilities, making it an invaluable tool for AI artists looking to enhance their creative workflows. By interfacing with the Google Vertex API, APIGeminiImageGen provides a seamless experience for generating high-quality images based on user prompts and configurations. Its primary goal is to offer a robust and flexible solution for image manipulation, enabling users to explore new creative possibilities with ease.

API Gemini Image Gen Input Parameters:

prompt

The prompt parameter is a text input that serves as the primary instruction for the image generation or editing process. It guides the AI in understanding what kind of image you want to create or modify. The prompt should be clear and descriptive to ensure the AI generates the desired output. There is no explicit minimum or maximum length, but it should be concise enough to convey the intended concept effectively.

aspect_ratio

The aspect_ratio parameter determines the proportions of the generated image. It can be set to "auto" for automatic adjustment or specified in a format like "16:9" to maintain specific dimensions. This parameter impacts the final appearance of the image, ensuring it fits the desired visual layout. The default value is "auto," which adapts to the content provided.

images

The images parameter allows you to input existing images that the AI can use as a reference or base for editing. This can be particularly useful for enhancing or transforming existing artwork. The parameter accepts a list of images, and the AI will incorporate these into the generation process to produce a coherent output.

files

The files parameter is used to input additional data files that may be relevant to the image generation process. These files can include various types of content that the AI might use to inform its output. The parameter accepts a list of files, and their inclusion can influence the final image result.

system_prompt

The system_prompt parameter provides additional instructions or context to the AI, helping to refine the image generation process. It acts as a secondary prompt that can guide the AI's understanding and execution of the task. This parameter is optional but can be beneficial for achieving more precise results.

response_modalities

The response_modalities parameter specifies the type of output you want from the AI, such as "IMAGE" or "TEXT, IMAGE." This determines whether the AI should focus solely on generating images or include textual descriptions as well. The default setting is "IMAGE," but you can adjust it based on your needs.

thinking_level

The thinking_level parameter configures the depth of processing the AI should apply during image generation. It influences the complexity and detail of the output, allowing you to tailor the AI's performance to suit different creative tasks. This parameter requires a specific value to be set, reflecting the desired level of AI engagement.

API Gemini Image Gen Output Parameters:

image_output

The image_output parameter provides the final generated or edited image as a result of the AI's processing. This output is the primary deliverable of the node, representing the visual content created based on the input parameters and prompts. It is crucial for evaluating the success of the image generation process.

text_output

The text_output parameter, if applicable, contains any textual descriptions or annotations generated alongside the image. This output is optional and depends on the response_modalities setting. It can offer additional context or insights into the image, enhancing the overall understanding of the AI's output.

thought_image_output

The thought_image_output parameter includes any intermediate images generated during the AI's "thinking" process. This output is useful for understanding the AI's creative journey and can provide valuable insights into how the final image was developed. It is particularly relevant when the thinking_level parameter is utilized.

API Gemini Image Gen Usage Tips:

  • Use clear and descriptive prompts to guide the AI effectively, ensuring the generated image aligns with your creative vision.
  • Experiment with different aspect ratios to explore various visual formats and find the one that best suits your project.
  • Leverage the system_prompt to provide additional context or instructions, refining the AI's output for more precise results.
  • Adjust the thinking_level to control the complexity of the image generation process, tailoring the AI's performance to your specific needs.

API Gemini Image Gen Common Errors and Solutions:

Invalid aspect ratio

  • Explanation: The aspect ratio provided is not in a recognized format or is unsupported by the AI.
  • Solution: Ensure the aspect ratio is specified correctly, using formats like "16:9" or set to "auto" for automatic adjustment.

Missing prompt

  • Explanation: The prompt parameter is empty or not provided, leading to a lack of guidance for the AI.
  • Solution: Provide a clear and descriptive prompt to direct the AI's image generation process.

Unsupported file type

  • Explanation: One or more files provided are in a format not supported by the AI for processing.
  • Solution: Verify that all input files are in compatible formats and re-upload them for processing.

Exceeded input file size

  • Explanation: The total size of input files exceeds the maximum allowed limit of 20 MB.
  • Solution: Reduce the size of input files or split them into smaller parts to comply with the size restrictions.

API Gemini Image Gen Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI_Prompt-All-In-One
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

API Gemini Image Gen