Generate accurate design visuals with refined control and repeatable detail.
On RunComfy, GPT Image 2 offers state-of-the-art image generation plus GPT Image 2 Image Edit for precise, instruction-driven transformations using one or more reference images. It accepts images and a text prompt, returning high-fidelity still images suitable for commercial visuals, product shots, UI mockups, and marketing assets.
| Parameter | Required | Type | Default | Range / Options | Description |
|---|---|---|---|---|---|
| images* | Yes (*) | List of images | — | Up to 10 images | Reference images to edit, up to 10 images |
| prompt* | Yes (*) | String | — | — | Text description of the desired edit |
| size | No | Enum (string) | auto | auto, 10241024, 10241536, 1536*1024 | Output size: auto (default), 10241024, 10241536, or 1536*1024 |
Billing for GPT Image 2 Image Edit on RunComfy is simple:
| Billing Unit | Price |
|---|---|
| Per image | $0.1 per image. |
Generate accurate design visuals with refined control and repeatable detail.
Create reliable, studio-grade visuals with precise color and layout control.
Transform written ideas into lifelike visuals with precise texture, light, and typography control for professional design use.
Create lifelike visuals and illustrations from text with flexible design control.
Transforms reference visuals into layout-accurate, style-consistent designs for creative workflows.
High-speed model for rapid text-to-image creation with rich detail and flexible format control.
GPT Image 2 Image Edit supports output resolutions up to approximately 4K, adhering to pixel budget limits defined by the underlying model. For image-to-image editing, it’s best to maintain aspect ratios similar to the input image to ensure fidelity. Exceeding these parameters may result in automatic resizing or reduced visual coherence.
Yes. GPT Image 2 Image Edit typically limits text prompts to a few thousand tokens and may restrict reference inputs to one primary image for image-to-image tasks. Advanced multi-reference control (e.g., ControlNet or IP-Adapter) may become available in later API versions, but current implementations prioritize single-image guidance.
To move from trial to production, export your GPT Image 2 Image Edit configurations from the RunComfy Playground, replace manual editing steps with the RunComfy API endpoint, and authenticate with your production API keys. Ensure you manage API rate limits and usd balance, as production requests consume paid computation resources, while the Playground trial uses limited free quota.
GPT Image 2 Image Edit delivers superior visual stability and instruction following compared to GPT Image 1.5. In image-to-image operations, it preserves character likeness, lighting accuracy, and scene coherence more effectively due to the new autoregressive multimodal architecture replacing prior diffusion-based systems.
GPT Image 2 Image Edit excels at reproducing accurate text overlays, including non-Latin and multilingual scripts. This represents a notable step up from older models, as it minimizes distortions and retains text legibility even after multiple image-to-image edits within the same session.
GPT Image 2 Image Edit provides excellent fidelity for photorealistic and compositional tasks, especially where precise text rendering or product labeling is critical. While Nano Banana Pro may generate slightly faster and with smoother skin texture realism in some tests, GPT Image 2 Image Edit offers stronger control and scene accuracy for commercial-grade image-to-image workflows.
GPT Image 2 Image Edit uses an autoregressive design that tracks contextual coherence across multiple generations. As a result, when performing iterative image-to-image refinements, it better preserves geometric structure, reflections, and lighting consistency compared to diffusion models that may drift across edits.
For batch-based use, GPT Image 2 Image Edit can incur slightly higher latency than lightweight generators. Developers should manage request concurrency and consider smaller image-to-image batch sizes for efficiency. Caching reusable references and minimizing redundant uploads can also improve throughput in production.
GPT Image 2 Image Edit performs exceptionally well for professional product visuals, UX/UI mockups, and branding updates where faithful preservation of composition is essential. Its image-to-image editing mode lets users cleanly transform existing imagery, refine object placement, or modify colors and text overlays while retaining visual fidelity.
RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.





