Available Model List
OpenAI Models
- gpt-image-1.5 - OpenAI’s latest image generation model, featuring improved instruction adherence and prompt execution capabilities.
- gpt-image-1 - A multimodal language model that accepts text and image inputs, generating image outputs.
- gpt-image-1-mini - An economical version of gpt-image-1.
- dall-e-3 - Capable of creating realistic images and artworks based on natural language descriptions, supporting the creation of new images in specified dimensions based on prompts.
Imagen Models
- imagen-4.0-ultra-generate-001
- imagen-4.0-generate-001
- imagen-4.0-fast-generate-001
- imagen-4.0-fast-generate-preview-06-06
- imagen-3.0-generate-002
Qwen Models
- qwen-image - A foundational image generation model in the Qwen series that has made significant progress in complex text rendering and precise image editing.
- qwen-image-edit - Qwen image editing model.
Doubao Models
- doubao-seedream-4-5 - ByteDance’s latest multimodal image model, integrating capabilities for text-to-image, image-to-image, and composite image outputs.
- doubao-seedream-4-0 - ByteDance’s SOTA multimodal image creation model, supporting diverse functionalities such as multi-image fusion creation, image editing, and composite image generation.
Flux Models
- flux-2-flex - Designed for real-world creative production workflows, generating high-quality images while maintaining character and style consistency among multiple references.
- flux-2-pro - An image generation and editing model developed by Black Forest Labs, supporting complex text reading and writing.
- FLUX.1-Kontext-pro - A multimodal flow matching model developed by Black Forest Labs, supporting both text-to-image generation and image editing in context.
iRAG Models
- irag-1.0 - Baidu’s self-developed image generation model capable of producing ultra-realistic images.
- ernie-irag-edit - Baidu’s self-developed image editing model, supporting operations like erase, repaint, and variation based on images.
Ideogram Models
- V3 - Ideogram AI’s image generation model, capable of quickly generating images.
API Interface
Request URL
Request Headers
Request Parameters
General Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Prompt words |
| size | string | No | Image size, supports 1K (not supported in Doubao-4-5 series), 2K, 4K, auto (default). Qwen series supports: 512*1024, 768*512, 768*1024, 1024*576, 576*1024, 1024*1024 (default) |
| image | string | No | Reference image path |
| n | integer | No | Number of images to generate, supports 1-10, default is 1 |
| quality | string | No | Rendering quality, supports low, medium, and high; higher quality takes longer time |
OpenAI Model Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| input_fidelity | string | No | Fidelity, supports high and low (default) |
| moderation | string | No | Content moderation strictness, supports auto (default, standard filtering) and low (less filtering), not supported in image-to-image mode |
| output_format | string | No | Output image format, supports png, jpeg |
Flux Model Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| safety_tolerance | integer | No | Audit leniency, higher values indicate more leniency. Range 0<=x<=5, default value is 2 |
| aspect_ratio | string | No | Desired aspect ratio for the image, supports 16:9 (default), 1:1, 4:3 |
| seed | integer | No | Random number seed |
| raw | boolean | No | Whether to enable raw mode for a more natural visual effect, default false |
Qwen Model Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| watermark | boolean | No | Whether to add a watermark to the generated image, supports true (default) and false |
| seed | integer | No | Random number seed, range [0,2147483647] |
Doubao Model Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| sequential_image_generation | string | No | Control whether to disable composite image functionality |
| sequential_image_generation_options | object | No | Composite image configuration, effective only when sequential_image_generation is auto. Currently supports max_images, range [1, 15], default value is 15 |
| watermark | boolean | No | Whether to add a watermark to the generated image, supports true (default) and false |
| seed | integer | No | Random number seed, range [-1, 2147483647], default value is -1 |
| response_format | string | No | Return format for the generated image, supports url (link valid for 24 hours after image generation, please download promptly) or base64_json |
Usage Examples
OpenAI
Google Imagen
Qwen
Doubao
Flux
flux-2-flexandflux-2-proare asynchronous endpoints, requiring a two-step request. This series is extremely fast, almost instantaneous.FLUX.1-Kontext-prouses a general interface, providing a one-step process.
FLUX One-Step Drawing
Flux Asynchronous Drawing
Step 1: Send Drawing RequestGet Image
Ideogram V3
- Versions below V3 (V_2, V_1, etc.) are old interfaces and are not supported.
- The returned links require a proxy network to access.