RunComfy

FlashVSR | Real-Time Video Upscaler

Upscale videos fast, smooth, and super clear—no detail lost.

Flux UltraRealistic LoRA V2

Create stunningly lifelike image with Flux UltraRealistic LoRA V2

Z-Image Finetuned Models Collection | Multi-Style Generator

Create stunning, detailed images across multiple styles and moods easily.

DiffuEraser | Video Inpainting

Erase objects from videos with auto-masking and realistic reconstruction.

ComfyUI > Nodes > Sage Utils > Ollama LLM Prompt (Vision)

ComfyUI Node: Ollama LLM Prompt (Vision)

Class Name

Sage_OllamaLLMPromptVision

Category
Sage Utils/LLM/Ollama

Author
arcum42 (Account age: 6442days) Extension
Sage Utils Latest Updated
2026-05-17 Github Stars
0.03K

Github Ask arcum42 Current Questions Past Questions

Table of Content

Description
Sage_OllamaLLMPromptVision:
Sage_OllamaLLMPromptVision Input Parameters:
Sage_OllamaLLMPromptVision Output Parameters:
Sage_OllamaLLMPromptVision Usage Tips:
Sage_OllamaLLMPromptVision Common Errors and Solutions:
Related Nodes

How to Install Sage Utils

Install this extension via the ComfyUI Manager by searching for Sage Utils

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter Sage Utils in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Ollama LLM Prompt (Vision) Description

Facilitates multimodal input to generate responses from a vision-capable language model.

Ollama LLM Prompt (Vision):

The Sage_OllamaLLMPromptVision node is designed to facilitate interaction with a language model that supports multimodal input, specifically through the Ollama platform. This node allows you to send a textual prompt, optionally accompanied by an image, to a vision-capable language model. The primary goal of this node is to generate a response from the model based on the provided inputs, leveraging the model's ability to process and understand both text and visual data. This capability is particularly beneficial for tasks that require a nuanced understanding of visual context alongside textual information, such as generating descriptive narratives or providing detailed analyses of images. By integrating with Ollama, the node ensures that the model used is optimized for handling such multimodal inputs, thereby enhancing the quality and relevance of the responses generated.

Ollama LLM Prompt (Vision) Input Parameters:

prompt

The prompt parameter is a string input where you can enter the textual prompt that you wish to send to the language model. This prompt serves as the primary input for the model to generate a response. It can be a question, a statement, or any text that you want the model to process. The default value is set to a predefined vision prompt, and it supports multiline input, allowing for detailed and complex prompts.

model

The model parameter is a combo input that allows you to select the specific language model you wish to use from a list of available options. These models are sorted and must be installed via Ollama. The choice of model can significantly impact the nature and quality of the response, as different models may have varying capabilities and specializations.

image

The image parameter is an image input that lets you provide an optional image to the language model. If the model supports multimodal input, this image will be used alongside the textual prompt to generate a more informed and contextually relevant response. This parameter is particularly useful for tasks that require visual context.

seed

The seed parameter is an integer input that allows you to specify a seed value for the random number generator used by the model. This can be useful for ensuring reproducibility of results. The seed value can range from 0 to 2^32

1, with a default value of 0. Adjusting the seed can lead to variations in the model's output, even with the same input prompt and image.

Ollama LLM Prompt (Vision) Output Parameters:

initial_response

The initial_response parameter is a string output that contains the response generated by the language model based on the initial prompt and any provided image. This response reflects the model's interpretation and processing of the input data, offering insights or answers as per the prompt's requirements.

Ollama LLM Prompt (Vision) Usage Tips:

To optimize the quality of the model's response, ensure that your prompt is clear and detailed, providing sufficient context for the model to understand the task or question.
Experiment with different models available in the model parameter to find the one that best suits your specific needs, as different models may excel in different areas of understanding or creativity.

Ollama LLM Prompt (Vision) Common Errors and Solutions:

Model not found

Explanation: This error occurs when the specified model is not installed or recognized by the Ollama platform.
Solution: Ensure that the model you selected in the model parameter is correctly installed via Ollama. Check the list of available models and verify your selection.

Invalid image format

Explanation: This error arises when the provided image is in a format not supported by the node.
Solution: Convert your image to a supported format, such as JPEG or PNG, and try again. Ensure the image is correctly uploaded and accessible by the node.

Ollama LLM Prompt (Vision) Related Nodes

Go back to the extension to check out more related nodes.

Sage Utils

Table of Content

Description
Sage_OllamaLLMPromptVision:
Sage_OllamaLLMPromptVision Input Parameters:
Sage_OllamaLLMPromptVision Output Parameters:
Sage_OllamaLLMPromptVision Usage Tips:
Sage_OllamaLLMPromptVision Common Errors and Solutions:
Related Nodes

FLUX Dev ControlNet | Multi-Condition ControlNet

Controlled FLUX Dev image generation with Pose, Depth, Canny, and ReColor

Reallusion AI Render | 3D to ComfyUI Workflows Collection

ComfyUI + Reallusion = Speed, Accessibility, and Ease for 3D visuals

SAM 3 | Advanced Object Segmentation Tool

Next-gen segmentation tool for precise object masking and tracking.

FLUX Kontext LoRA | Style Transfer

Mix 13 art styles instantly or plug in custom LoRAs!

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Ollama LLM Prompt (Vision)

Sage_OllamaLLMPromptVision

How to Install Sage Utils

Ollama LLM Prompt (Vision) Description

Ollama LLM Prompt (Vision):

Ollama LLM Prompt (Vision) Input Parameters:

prompt

model

image

seed

Ollama LLM Prompt (Vision) Output Parameters:

initial_response

Ollama LLM Prompt (Vision) Usage Tips:

Ollama LLM Prompt (Vision) Common Errors and Solutions:

Model not found

Invalid image format

Ollama LLM Prompt (Vision) Related Nodes