Gemini Omni Flash Image-to-Video

Authorizations

Authorization

string

header

required

##All endpoints require authentication using a Bearer Token##

Get your API Key:

Visit the API Key management page to obtain your API Key

Add it to the request header:

Authorization: Bearer YOUR_API_KEY

Body

application/json

model

enum<string>

required

Model name, fixed to gemini-omni-flash-image-to-video

Available options:

gemini-omni-flash-image-to-video

Example:

"gemini-omni-flash-image-to-video"

prompt

string

required

Text prompt for video generation, supports both English and Chinese

Usage tips:

Describe the subject's actions, camera movement, mood changes, etc.; the more specific, the more stable the result
Write negative requirements directly into the prompt (e.g. No dialogue, no text on screen); this model does not provide a separate negative prompt parameter

Example:

"Have the person in the image slowly turn their head and smile while the leaves in the background sway gently in the breeze"

image_urls

string[]

Array of input images; currently only 1 is supported

Supported forms:

HTTP/HTTPS image URL
Data URL in the form data:image/...;base64,...
Plain base64 image string

Format requirements: png, jpeg, webp are supported

Required array length: 1 element

Example:

["https://example.com/portrait.jpg"]

duration

default:10

Video duration (seconds), default 10

Value notes:

Integer: range 3 ~ 10 seconds
auto: the model decides the output duration

Billing note: The actual charge is based on the usage of the generated video

Required range: 3 <= x <= 10

Example:

6

aspect_ratio

enum<string>

default:16:9

Video aspect ratio, default 16:9

Value notes:

16:9: landscape
9:16: portrait
auto: the model decides the aspect ratio

Available options:

16:9,

9:16,

auto

Example:

"16:9"

callback_url

string<uri>

HTTPS callback URL to notify when the task completes

Callback timing:

Triggered when the task completes (completed), fails (failed), or is cancelled (cancelled)
Sent after billing is confirmed

Security restrictions:

HTTPS protocol only
Callbacks to internal IP addresses are forbidden (127.0.0.1, 10.x.x.x, 172.16-31.x.x, 192.168.x.x, etc.)
URL length must not exceed 2048 characters

Callback mechanism:

Timeout: 10 seconds
Up to 3 retries on failure (retried 1s / 2s / 4s after each failure respectively)
The callback body format matches the response of the task query endpoint
A 2xx status code from the callback URL is treated as success; other status codes trigger a retry

Example:

"https://your-domain.com/webhooks/video-task-completed"

Response

Video task created successfully

created

integer

Task creation timestamp

Example:

1757169743

string

Task ID

Example:

"task-unified-1757169743-7cvnl5zw"

model

string

The model name actually used

Example:

"gemini-omni-flash-image-to-video"

object

enum<string>

The specific type of the task

Available options:

video.generation.task

progress

integer

Task progress percentage (0-100)

Required range: 0 <= x <= 100

Example:

0

status

enum<string>

Task status

Available options:

pending,

processing,

completed,

failed

Example:

"pending"

task_info

object

Detailed video task information

Show child attributes

type

enum<string>

The output type of the task

Available options:

text,

image,

audio,

video

Example:

"video"

usage

object

Usage and billing information

Show child attributes