##All APIs require Bearer Token authentication##
Get API Key:
Visit API Key Management Page to get your API Key
Add to request header:
Authorization: Bearer YOUR_API_KEYImage generation model name, official channel, better stability and controllability, suitable for commercial scenarios
gpt-image-2 "gpt-image-2"
Prompt describing the image to be generated, or describing how to edit the input image
Limits:
32000 characters (counted by Unicode code points, works for CJK and other languages)32000"A beautiful colorful sunset over the ocean"
Reference image URL list for image-to-image and image editing functions
Note:
1~1650MB.jpeg, .jpg, .png, .webp.png, .jpg)[
"https://example.com/image1.png",
"https://example.com/image2.png"
]Size of the generated image. Supports both ratio format and explicit pixel format, defaults to auto
① Ratio format (recommended, 15 options)
1:1: Square1:2 / 2:1: Extreme portrait / landscape1:3 / 3:1: Ultra portrait / landscape (3:1 limit)2:3 / 3:2: Standard portrait / landscape3:4 / 4:3: Classic portrait / landscape4:5 / 5:4: Common social media9:16 / 16:9: Mobile / desktop widescreen9:21 / 21:9: Ultra-wide② Explicit pixel format: WxH (or W×H), e.g. 1024x1024, 1536x1024, 3840×2160
16[16, 3840]655,360 ≤ width × height ≤ 8,294,400 (about 0.65 MP ~ 8.29 MP)≤ 3:1③ auto: The model decides the size automatically (resolution does not apply in this mode)
Out-of-range handling:
resolution combination exceeds the pixel budget, dimensions are automatically scaled down proportionally (e.g. 4K 1:1 → 2880×2880)"auto"
Resolution tier shortcut, only effective when size is a ratio; ignored in explicit pixel mode
Anchor-edge rules (the other edge is auto-derived from the size ratio and aligned to multiples of 16):
1K: Short edge locked at 10242K: Long edge locked at 20484K: Long edge locked at 3840Landscape / square output dimensions (portrait dimensions are the landscape width/height swapped, e.g. 2:3 = 3:2 reversed):
| Ratio | 1K | 2K | 4K |
|---|---|---|---|
1:1 | 1024×1024 | 2048×2048 | 2880×2880 * |
2:1 | 2048×1024 | 2048×1024 | 3840×1920 |
3:1 | 3072×1024 | 2048×688 | 3840×1280 |
3:2 | 1536×1024 | 2048×1360 | 3520×2336 * |
4:3 | 1360×1024 | 2048×1536 | 3312×2480 * |
5:4 | 1280×1024 | 2048×1632 | 3216×2560 * |
16:9 | 1824×1024 | 2048×1152 | 3840×2160 (UHD) |
21:9 | 2384×1024 | 2048×880 | 3840×1648 |
* Marks combinations that are auto-downscaled to fit the pixel budget. Values are case-insensitive.
1K, 2K, 4K "1K"
Rendering quality that controls the model's "reasoning depth", directly affecting output token count and cost. Defaults to medium
| Value | Tile base | Relative cost (1024²) |
|---|---|---|
low | 16 | ~0.11× |
medium | 48 | 1.0× |
high | 96 | ~4.0× |
low, medium, high "medium"
Number of images to generate, each billed independently
Note:
n1 <= x <= 101
HTTPS callback address after task completion
Callback Timing:
Security Restrictions:
2048 charactersCallback Mechanism:
10 seconds3 retries on failure (retries after 1 second/2 seconds/4 seconds)"https://your-domain.com/webhooks/image-task-completed"
Image generation task created successfully
Task creation timestamp
1757156493
Task ID
"task-unified-1757156493-imcg5zqt"
Actual model name used
"gpt-image-2"
Specific task type
image.generation.task Task progress percentage (0-100)
0 <= x <= 1000
Task status
pending, processing, completed, failed "pending"
Asynchronous task information
Task output type
text, image, audio, video "image"
Usage and billing information