Save 4 hours! We auto-setup your workflow! Free!

Drop your workflow.json — we handle every dependency, custom node, and model. Just open the link and run.

Auto-Setup Workflow Json (Free) Now!
ComfyUI > Nodes > Sage Utils > Ollama LLM Prompt (Vision)

ComfyUI Node: Ollama LLM Prompt (Vision)

Class Name

Sage_OllamaLLMPromptVision

Category
Sage Utils/LLM/Ollama
Author
arcum42 (Account age: 6442days)
Extension
Sage Utils
Latest Updated
2026-05-17
Github Stars
0.03K

How to Install Sage Utils

Install this extension via the ComfyUI Manager by searching for Sage Utils
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter Sage Utils in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Ollama LLM Prompt (Vision) Description

Facilitates multimodal input to generate responses from a vision-capable language model.

Ollama LLM Prompt (Vision):

The Sage_OllamaLLMPromptVision node is designed to facilitate interaction with a language model that supports multimodal input, specifically through the Ollama platform. This node allows you to send a textual prompt, optionally accompanied by an image, to a vision-capable language model. The primary goal of this node is to generate a response from the model based on the provided inputs, leveraging the model's ability to process and understand both text and visual data. This capability is particularly beneficial for tasks that require a nuanced understanding of visual context alongside textual information, such as generating descriptive narratives or providing detailed analyses of images. By integrating with Ollama, the node ensures that the model used is optimized for handling such multimodal inputs, thereby enhancing the quality and relevance of the responses generated.

Ollama LLM Prompt (Vision) Input Parameters:

prompt

The prompt parameter is a string input where you can enter the textual prompt that you wish to send to the language model. This prompt serves as the primary input for the model to generate a response. It can be a question, a statement, or any text that you want the model to process. The default value is set to a predefined vision prompt, and it supports multiline input, allowing for detailed and complex prompts.

model

The model parameter is a combo input that allows you to select the specific language model you wish to use from a list of available options. These models are sorted and must be installed via Ollama. The choice of model can significantly impact the nature and quality of the response, as different models may have varying capabilities and specializations.

image

The image parameter is an image input that lets you provide an optional image to the language model. If the model supports multimodal input, this image will be used alongside the textual prompt to generate a more informed and contextually relevant response. This parameter is particularly useful for tasks that require visual context.

seed

The seed parameter is an integer input that allows you to specify a seed value for the random number generator used by the model. This can be useful for ensuring reproducibility of results. The seed value can range from 0 to 2^32

  • 1, with a default value of 0. Adjusting the seed can lead to variations in the model's output, even with the same input prompt and image.

Ollama LLM Prompt (Vision) Output Parameters:

initial_response

The initial_response parameter is a string output that contains the response generated by the language model based on the initial prompt and any provided image. This response reflects the model's interpretation and processing of the input data, offering insights or answers as per the prompt's requirements.

Ollama LLM Prompt (Vision) Usage Tips:

  • To optimize the quality of the model's response, ensure that your prompt is clear and detailed, providing sufficient context for the model to understand the task or question.
  • Experiment with different models available in the model parameter to find the one that best suits your specific needs, as different models may excel in different areas of understanding or creativity.

Ollama LLM Prompt (Vision) Common Errors and Solutions:

Model not found

  • Explanation: This error occurs when the specified model is not installed or recognized by the Ollama platform.
  • Solution: Ensure that the model you selected in the model parameter is correctly installed via Ollama. Check the list of available models and verify your selection.

Invalid image format

  • Explanation: This error arises when the provided image is in a format not supported by the node.
  • Solution: Convert your image to a supported format, such as JPEG or PNG, and try again. Ensure the image is correctly uploaded and accessible by the node.

Ollama LLM Prompt (Vision) Related Nodes

Go back to the extension to check out more related nodes.
Sage Utils
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Ollama LLM Prompt (Vision)