> ## Documentation Index
> Fetch the complete documentation index at: https://docs.evolink.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Gemini Omni Flash Image-to-Video

> - The Gemini Omni Flash (gemini-omni-flash-image-to-video) model supports image-to-video mode, producing a video with native audio from an input image and a text prompt
- **Image input:** Provided via `image_urls`; currently only 1 image is supported
- **Duration control:** Set an integer duration of `3~10` seconds via `duration`, or pass `auto` to let the model decide
- **Aspect ratio:** Choose `16:9`, `9:16`, or `auto` via `aspect_ratio`
- **Native audio:** The model automatically generates synchronized audio for the video, no extra parameters required
- **Negative description:** Write it directly into `prompt` (e.g. `No dialogue`); this model does not provide a separate negative prompt parameter
- Asynchronous processing mode; use the returned task ID to [query the result](/en/api-manual/task-management/get-task-detail)
- The generated video link is valid for 24 hours, please save it promptly


## OpenAPI

````yaml en/api-manual/video-series/gemini-omni-flash/gemini-omni-flash-image-to-video.json POST /v1/videos/generations
openapi: 3.1.0
info:
  title: gemini-omni-flash-image-to-video API
  description: >-
    Use the Gemini Omni Flash model for image-to-video generation, producing a
    video with native audio from an input image and a text prompt
  license:
    name: MIT
  version: 1.0.0
servers:
  - url: https://api.evolink.ai
    description: Production environment
security:
  - bearerAuth: []
paths:
  /v1/videos/generations:
    post:
      tags:
        - Video Generation
      summary: gemini-omni-flash-image-to-video API
      description: >-
        - The Gemini Omni Flash (gemini-omni-flash-image-to-video) model
        supports image-to-video mode, producing a video with native audio from
        an input image and a text prompt

        - **Image input:** Provided via `image_urls`; currently only 1 image is
        supported

        - **Duration control:** Set an integer duration of `3~10` seconds via
        `duration`, or pass `auto` to let the model decide

        - **Aspect ratio:** Choose `16:9`, `9:16`, or `auto` via `aspect_ratio`

        - **Native audio:** The model automatically generates synchronized audio
        for the video, no extra parameters required

        - **Negative description:** Write it directly into `prompt` (e.g. `No
        dialogue`); this model does not provide a separate negative prompt
        parameter

        - Asynchronous processing mode; use the returned task ID to [query the
        result](/en/api-manual/task-management/get-task-detail)

        - The generated video link is valid for 24 hours, please save it
        promptly
      operationId: createGeminiOmniFlashImageToVideoGeneration
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/GeminiOmniFlashImageToVideoRequest'
            examples:
              image_to_video:
                summary: Image-to-video
                value:
                  model: gemini-omni-flash-image-to-video
                  prompt: >-
                    Have the person in the image slowly turn their head and
                    smile while the leaves in the background sway gently in the
                    breeze
                  image_urls:
                    - https://example.com/portrait.jpg
                  aspect_ratio: '16:9'
                  duration: 10
      responses:
        '200':
          description: Video task created successfully
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/VideoGenerationResponse'
        '400':
          description: Invalid request parameters
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: invalid_request
                  message: Invalid request parameters
                  type: invalid_request_error
        '401':
          description: Not authenticated, token invalid or expired
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: unauthorized
                  message: Invalid or expired token
                  type: authentication_error
        '402':
          description: Insufficient quota, top-up required
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: insufficient_quota
                  message: Insufficient quota. Please top up your account.
                  type: insufficient_quota
        '403':
          description: Access denied
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: model_access_denied
                  message: >-
                    Token does not have access to model:
                    gemini-omni-flash-image-to-video
                  type: invalid_request_error
        '429':
          description: Rate limit exceeded
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: rate_limit_exceeded
                  message: Too many requests, please try again later
                  type: rate_limit_error
        '500':
          description: Internal server error
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: internal_error
                  message: Internal server error
                  type: api_error
components:
  schemas:
    GeminiOmniFlashImageToVideoRequest:
      required:
        - model
        - prompt
      type: object
      properties:
        model:
          type: string
          description: Model name, fixed to `gemini-omni-flash-image-to-video`
          enum:
            - gemini-omni-flash-image-to-video
          example: gemini-omni-flash-image-to-video
        prompt:
          type: string
          description: >-
            Text prompt for video generation, supports both English and Chinese


            **Usage tips:**

            - Describe the subject's actions, camera movement, mood changes,
            etc.; the more specific, the more stable the result

            - Write negative requirements directly into the prompt (e.g. `No
            dialogue`, `no text on screen`); this model does not provide a
            separate negative prompt parameter
          example: >-
            Have the person in the image slowly turn their head and smile while
            the leaves in the background sway gently in the breeze
        image_urls:
          type: array
          items:
            type: string
          minItems: 1
          maxItems: 1
          description: |-
            Array of input images; currently only 1 is supported

            **Supported forms:**
            - HTTP/HTTPS image URL
            - Data URL in the form `data:image/...;base64,...`
            - Plain base64 image string

            **Format requirements:** `png`, `jpeg`, `webp` are supported
          example:
            - https://example.com/portrait.jpg
        duration:
          description: >-
            Video duration (seconds), default `10`


            **Value notes:**

            - Integer: range `3 ~ 10` seconds

            - `auto`: the model decides the output duration


            **Billing note:** The actual charge is based on the usage of the
            generated video
          oneOf:
            - type: integer
              minimum: 3
              maximum: 10
            - type: string
              enum:
                - auto
          default: 10
          example: 6
        aspect_ratio:
          type: string
          description: |-
            Video aspect ratio, default `16:9`

            **Value notes:**
            - `16:9`: landscape
            - `9:16`: portrait
            - `auto`: the model decides the aspect ratio
          enum:
            - '16:9'
            - '9:16'
            - auto
          default: '16:9'
          example: '16:9'
        callback_url:
          type: string
          description: >-
            HTTPS callback URL to notify when the task completes


            **Callback timing:**

            - Triggered when the task completes (completed), fails (failed), or
            is cancelled (cancelled)

            - Sent after billing is confirmed


            **Security restrictions:**

            - HTTPS protocol only

            - Callbacks to internal IP addresses are forbidden (127.0.0.1,
            10.x.x.x, 172.16-31.x.x, 192.168.x.x, etc.)

            - URL length must not exceed `2048` characters


            **Callback mechanism:**

            - Timeout: `10` seconds

            - Up to `3` retries on failure (retried `1`s / `2`s / `4`s after
            each failure respectively)

            - The callback body format matches the response of the task query
            endpoint

            - A 2xx status code from the callback URL is treated as success;
            other status codes trigger a retry
          format: uri
          example: https://your-domain.com/webhooks/video-task-completed
    VideoGenerationResponse:
      type: object
      properties:
        created:
          type: integer
          description: Task creation timestamp
          example: 1757169743
        id:
          type: string
          description: Task ID
          example: task-unified-1757169743-7cvnl5zw
        model:
          type: string
          description: The model name actually used
          example: gemini-omni-flash-image-to-video
        object:
          type: string
          enum:
            - video.generation.task
          description: The specific type of the task
        progress:
          type: integer
          description: Task progress percentage (0-100)
          minimum: 0
          maximum: 100
          example: 0
        status:
          type: string
          description: Task status
          enum:
            - pending
            - processing
            - completed
            - failed
          example: pending
        task_info:
          $ref: '#/components/schemas/VideoTaskInfo'
          description: Detailed video task information
        type:
          type: string
          enum:
            - text
            - image
            - audio
            - video
          description: The output type of the task
          example: video
        usage:
          $ref: '#/components/schemas/Usage'
          description: Usage and billing information
    ErrorResponse:
      type: object
      properties:
        error:
          type: object
          properties:
            code:
              type: string
              description: Error code identifier
            message:
              type: string
              description: Error description message
            type:
              type: string
              description: Error type
    VideoTaskInfo:
      type: object
      properties:
        can_cancel:
          type: boolean
          description: Whether the task can be cancelled
          example: false
        estimated_time:
          type: integer
          description: Estimated completion time (seconds)
          minimum: 0
          example: 60
    Usage:
      type: object
      description: Usage and billing information
      properties:
        billing_rule:
          type: string
          description: Billing rule
          enum:
            - per_call
            - per_token
            - per_second
          example: per_token
        credits_reserved:
          type: number
          description: Estimated credits to be consumed
          minimum: 0
          example: 102.34
        user_group:
          type: string
          description: User group category
          enum:
            - default
            - vip
          example: default
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer
      description: >-
        ##All endpoints require authentication using a Bearer Token##


        **Get your API Key:**


        Visit the [API Key management page](https://evolink.ai/dashboard/keys)
        to obtain your API Key


        **Add it to the request header:**

        ```

        Authorization: Bearer YOUR_API_KEY

        ```

````