Key capabilities
- Text-to-Image — Generate images from natural language descriptions
- Image editing — Edit an existing image with a text prompt via
/v1/images/edits - Flexible resolution — Any custom size up to 4K (3840px), edges must be multiples of 16
- Near-perfect text rendering — ~99% character-level accuracy across 12+ languages
- Multi-object composition — Complex scenes without occlusion or misplacement issues
- Multi-style — Photorealism, illustration, anime, vector, 3D, data visualization
Both/v1/images/generationsand/v1/images/editssupportn=1–10(default1). Requires selecting the Direct group in the console.
Output specifications
| Property | Value |
|---|---|
| Sizes | Flexible resolution (e.g. 1024x1024, 2048x2048, 3840x2160) |
| Size constraints | Edges: multiples of 16, aspect ratio ≤ 3:1, total pixels 655,360–8,294,400 |
| Quality | low, medium, high |
| Output formats | png, jpeg |
| Input formats (edits) | png, jpeg |
Quick example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be gpt-image-2 |
prompt | string | Yes | Text description of the desired image |
n | integer | No | Number of images to generate. Supports 1–10. Default: 1. Requires selecting the Direct group. |
size | string | No | {width}x{height} format. Edges must be multiples of 16, aspect ratio ≤ 3:1, total pixels 655,360–8,294,400, max edge 3,840px. Default: 1024x1024 |
quality | string | No | low, medium, high. Default: medium |
output_format | string | No | png, jpeg, webp. Default: png |
moderation | string | No | auto or low. Default: auto |
output_compression | integer | No | Compression level for jpeg (0–100). |
Image editing
Edit an existing image with a text prompt viaPOST /v1/images/edits. Use multipart/form-data and pass images as file uploads.
Supports up to 16 input images per request (image[]).
Input image formats
Supported formats: PNG, JPEG. Images must be provided as multipart/form-data file uploads. Two model IDs are available:gpt-image-2— official modelgpt-image-2-c— cost-effective variant (supportsresponse_format)
Note: When usinggpt-image-2-c, a higher quality setting may affect the stability of multi-image output (n > 1). For more consistent results when generating multiple images, use a lower quality setting.
Edit parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
image | file | Yes | Single source image (PNG or JPEG, multipart upload) |
image[] | file array | Yes* | Multiple source images — up to 16 images (PNG or JPEG). Use instead of image for multi-image input |
mask | file | No | Mask image with an alpha channel. Same format and size as source (< 50MB). Applied to the first image when multiple images are provided. Requires selecting the Direct group. |
model | string | Yes | gpt-image-2 or gpt-image-2-c |
prompt | string | Yes | Edit instruction text |
n | integer | No | Number of images to return. Supports 1–10. Default: 1. Requires selecting the Direct group. |
size | string | No | {width}x{height}. Default: 1024x1024 |
output_format | string | No | png, jpeg. Default: png |
output_compression | integer | No | Compression level for jpeg (0–100) |
response_format | string | No | url. Only supported by gpt-image-2-c. Default: url |
Use eitherimage(single file) orimage[](array) — not both.
Mask editing
Provide a mask image with an alpha channel to control which areas of the source image to edit. Pixels with white (opaque) alpha are preserved; pixels with black (transparent) alpha are edited. Requirements:- The mask must contain an Alpha channel
- Must be the same image format and same dimensions as the source image
- File size limit: < 50MB
Both mask editing and the n parameter require selecting the Direct group in the console.
cURL
Python
Generate API Reference
Interactive playground for
POST /v1/images/generations.Edit API Reference
Interactive playground for
POST /v1/images/edits.