Create precise, consistent visuals with 4K detail and adaptive text-to-image rendering for design and production needs.
On RunComfy, GPT Image 2 offers state-of-the-art image generation plus GPT Image 2 Image Edit for precise, instruction-driven transformations using one or more reference images. It accepts images and a text prompt, returning high-fidelity still images suitable for commercial visuals, product shots, UI mockups, and marketing assets.
| Parameter | Required | Type | Default | Range / Options | Description |
|---|---|---|---|---|---|
| images* | Yes (*) | List of images | — | Up to 10 images | Reference images to edit, up to 10 images |
| prompt* | Yes (*) | String | — | — | Text description of the desired edit |
| size | No | Enum (string) | auto | auto, 10241024, 10241536, 1536*1024 | Output size: auto (default), 10241024, 10241536, or 1536*1024 |
Billing for GPT Image 2 Image Edit on RunComfy is simple:
| Billing Unit | Price |
|---|---|
| Per image | $0.1 per image. |
Create precise, consistent visuals with 4K detail and adaptive text-to-image rendering for design and production needs.
Fast, high-quality text-to-image generation with Nano Banana 2, with aspect ratio, safety tolerance, and output format controls.
Precise text rendering & multilingual edits for visual pros
Generate refined visuals with accurate lighting and text control for design work.
High-speed model for consistent visual creation and precise design control
Advanced image-to-image tool with geometry-aware edits and consistent identity control for creative workflows.
GPT Image 2 Image Edit supports output resolutions up to approximately 4K, adhering to pixel budget limits defined by the underlying model. For image-to-image editing, it’s best to maintain aspect ratios similar to the input image to ensure fidelity. Exceeding these parameters may result in automatic resizing or reduced visual coherence.
Yes. GPT Image 2 Image Edit typically limits text prompts to a few thousand tokens and may restrict reference inputs to one primary image for image-to-image tasks. Advanced multi-reference control (e.g., ControlNet or IP-Adapter) may become available in later API versions, but current implementations prioritize single-image guidance.
To move from trial to production, export your GPT Image 2 Image Edit configurations from the RunComfy Playground, replace manual editing steps with the RunComfy API endpoint, and authenticate with your production API keys. Ensure you manage API rate limits and usd balance, as production requests consume paid computation resources, while the Playground trial uses limited free quota.
GPT Image 2 Image Edit delivers superior visual stability and instruction following compared to GPT Image 1.5. In image-to-image operations, it preserves character likeness, lighting accuracy, and scene coherence more effectively due to the new autoregressive multimodal architecture replacing prior diffusion-based systems.
GPT Image 2 Image Edit excels at reproducing accurate text overlays, including non-Latin and multilingual scripts. This represents a notable step up from older models, as it minimizes distortions and retains text legibility even after multiple image-to-image edits within the same session.
GPT Image 2 Image Edit provides excellent fidelity for photorealistic and compositional tasks, especially where precise text rendering or product labeling is critical. While Nano Banana Pro may generate slightly faster and with smoother skin texture realism in some tests, GPT Image 2 Image Edit offers stronger control and scene accuracy for commercial-grade image-to-image workflows.
GPT Image 2 Image Edit uses an autoregressive design that tracks contextual coherence across multiple generations. As a result, when performing iterative image-to-image refinements, it better preserves geometric structure, reflections, and lighting consistency compared to diffusion models that may drift across edits.
For batch-based use, GPT Image 2 Image Edit can incur slightly higher latency than lightweight generators. Developers should manage request concurrency and consider smaller image-to-image batch sizes for efficiency. Caching reusable references and minimizing redundant uploads can also improve throughput in production.
GPT Image 2 Image Edit performs exceptionally well for professional product visuals, UX/UI mockups, and branding updates where faithful preservation of composition is essential. Its image-to-image editing mode lets users cleanly transform existing imagery, refine object placement, or modify colors and text overlays while retaining visual fidelity.
RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.





