curl --request POST \
  --url https://direct.evolink.ai/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gemini-3.1-flash-lite-preview",
  "messages": [
    {
      "role": "user",
      "content": "Please introduce yourself"
    }
  ]
}
'

{
  "id": "chatcmpl-20251010015944503180122WJNB8Eid",
  "model": "gemini-3.1-flash-lite-preview",
  "object": "chat.completion",
  "created": 1760032810,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! I'm pleased to introduce myself.\n\nI'm a Large Language Model, trained and developed by Google.\n\nSimply put, you can think of me as a \"smart brain\" that has been trained on massive amounts of text data and is able to understand and generate human language. My core capability is processing and generating text. Specifically, I can do the following:\n\n**1. Information Query & Knowledge Answering**\nI can act like a \"talking encyclopedia,\" answering various questions, whether they're about scientific knowledge, historical events, or everyday facts.\n\n**2. Creative Writing & Text Generation**\nI can create various types of text based on your requirements, such as:\n*   **Writing**: Poetry, stories, scripts, emails, speeches, advertising copy, etc.\n*   **Planning**: Travel plans, study outlines, event proposals, etc.\n*   **Brainstorming**: Working with you to generate new ideas and spark creativity.\n\n**3. Translation & Language Processing**\nI'm proficient in multiple languages and can provide fast, fluent translation services. I can also help you polish, proofread, summarize, or rewrite text to make your expression clearer and more professional.\n\n**4. Programming & Code Assistance**\nI can write code snippets, explain code logic, debug errors, or \"translate\" code from one programming language to another, making me a helpful companion for programmers.\n\n**5. Logical Analysis & Reasoning**\nI can help you analyze complex problems, organize logical chains, and make inferences and summaries based on the information you provide.\n\n---\n\n**In summary**, my goal is to be a powerful and useful tool that helps you obtain information more efficiently, complete tasks, and spark creativity through natural language communication.\n\n**Remember:** I'm an artificial intelligence, my knowledge comes from the data I've learned, and it may not be the most up-to-date. Sometimes I may also make mistakes, so for very important information, I recommend you verify it again.",
        "tool_calls": [
          {
            "id": "<string>",
            "function": {
              "name": "<string>",
              "arguments": "<string>"
            }
          }
        ]
      },
      "logprobs": {
        "content": [
          {
            "token": "<string>",
            "logprob": 123,
            "bytes": [
              123
            ],
            "top_logprobs": [
              {
                "token": "<string>",
                "logprob": 123,
                "bytes": [
                  123
                ]
              }
            ]
          }
        ]
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 13,
    "completion_tokens": 1891,
    "total_tokens": 1904,
    "prompt_tokens_details": {
      "cached_tokens": 0,
      "text_tokens": 13,
      "audio_tokens": 0,
      "image_tokens": 0
    },
    "completion_tokens_details": {
      "text_tokens": 0,
      "audio_tokens": 0,
      "reasoning_tokens": 1480
    },
    "input_tokens": 0,
    "output_tokens": 0,
    "input_tokens_details": null
  }
}

OpenAI SDK Format

Gemini 3.1 Flash Lite - OpenAI SDK - Full Reference

Call Gemini-3.1-flash-lite-preview model using OpenAI SDK format
Synchronous processing mode, returns conversation content in real-time
Plain text conversation: Single-turn or multi-turn contextual dialogue, see simple_text and multi_turn examples in code samples
System prompt: Customize AI role and behavior, see system_prompt example in code samples
Multimodal input: Supports text + image mixed input, see vision and multi_image examples in code samples

POST

chat

completions

curl --request POST \
  --url https://direct.evolink.ai/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "gemini-3.1-flash-lite-preview",
  "messages": [
    {
      "role": "user",
      "content": "Please introduce yourself"
    }
  ]
}
'

{
  "id": "chatcmpl-20251010015944503180122WJNB8Eid",
  "model": "gemini-3.1-flash-lite-preview",
  "object": "chat.completion",
  "created": 1760032810,
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "Hello! I'm pleased to introduce myself.\n\nI'm a Large Language Model, trained and developed by Google.\n\nSimply put, you can think of me as a \"smart brain\" that has been trained on massive amounts of text data and is able to understand and generate human language. My core capability is processing and generating text. Specifically, I can do the following:\n\n**1. Information Query & Knowledge Answering**\nI can act like a \"talking encyclopedia,\" answering various questions, whether they're about scientific knowledge, historical events, or everyday facts.\n\n**2. Creative Writing & Text Generation**\nI can create various types of text based on your requirements, such as:\n*   **Writing**: Poetry, stories, scripts, emails, speeches, advertising copy, etc.\n*   **Planning**: Travel plans, study outlines, event proposals, etc.\n*   **Brainstorming**: Working with you to generate new ideas and spark creativity.\n\n**3. Translation & Language Processing**\nI'm proficient in multiple languages and can provide fast, fluent translation services. I can also help you polish, proofread, summarize, or rewrite text to make your expression clearer and more professional.\n\n**4. Programming & Code Assistance**\nI can write code snippets, explain code logic, debug errors, or \"translate\" code from one programming language to another, making me a helpful companion for programmers.\n\n**5. Logical Analysis & Reasoning**\nI can help you analyze complex problems, organize logical chains, and make inferences and summaries based on the information you provide.\n\n---\n\n**In summary**, my goal is to be a powerful and useful tool that helps you obtain information more efficiently, complete tasks, and spark creativity through natural language communication.\n\n**Remember:** I'm an artificial intelligence, my knowledge comes from the data I've learned, and it may not be the most up-to-date. Sometimes I may also make mistakes, so for very important information, I recommend you verify it again.",
        "tool_calls": [
          {
            "id": "<string>",
            "function": {
              "name": "<string>",
              "arguments": "<string>"
            }
          }
        ]
      },
      "logprobs": {
        "content": [
          {
            "token": "<string>",
            "logprob": 123,
            "bytes": [
              123
            ],
            "top_logprobs": [
              {
                "token": "<string>",
                "logprob": 123,
                "bytes": [
                  123
                ]
              }
            ]
          }
        ]
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 13,
    "completion_tokens": 1891,
    "total_tokens": 1904,
    "prompt_tokens_details": {
      "cached_tokens": 0,
      "text_tokens": 13,
      "audio_tokens": 0,
      "image_tokens": 0
    },
    "completion_tokens_details": {
      "text_tokens": 0,
      "audio_tokens": 0,
      "reasoning_tokens": 1480
    },
    "input_tokens": 0,
    "output_tokens": 0,
    "input_tokens_details": null
  }
}

BaseURL: The default BaseURL is https://direct.evolink.ai, which has better support for text models and long-lived connections. https://api.evolink.ai is the primary endpoint for multimodal services and serves as a fallback address for text models.

Authorizations

Authorization

string

header

required

##All APIs require Bearer Token authentication##

Get API Key:

Visit API Key Management Page to get your API Key

Add to request header:

Authorization: Bearer YOUR_API_KEY

Body

application/json

model

enum<string>

default:gemini-3.1-flash-lite-preview

required

Chat model name

Available options:

gemini-3.1-flash-lite-preview

Example:

"gemini-3.1-flash-lite-preview"

messages

object[]

required

List of chat messages, supports multi-turn dialogue and multimodal input

Minimum array length: 1

Show child attributes

stream

boolean

default:false

Whether to return response in streaming mode

true: Streaming return, receives content in real-time chunks
false: Returns complete response at once

Example:

false

max_completion_tokens

integer | null

Maximum number of completion tokens for the generated response, corresponding to Gemini's maxOutputTokens.

Required range: 1 <= x <= 65536

Example:

2000

max_tokens

integer

Maximum number of tokens for the generated response, compatible with the legacy OpenAI parameter.

Required range: 1 <= x <= 65536

Example:

2000

temperature

number

default:1

Sampling temperature, controls output randomness

Description:

Lower values (e.g., 0.2): More deterministic, focused output
Higher values (e.g., 1.5): More random, creative output

Required range: 0 <= x <= 2

Example:

0.7

top_p

number

default:1

Nucleus Sampling parameter

Description:

Controls sampling from tokens with cumulative probability
For example, 0.9 means selecting from tokens with cumulative probability up to 90%
Default: 1.0 (considers all tokens)

Recommendation: Do not adjust temperature and top_p simultaneously

Required range: 0 <= x <= 1

Example:

0.9

frequency_penalty

number | null

default:0

Frequency penalty coefficient. Range: -2.0 to 2.0. Corresponds to Gemini's frequencyPenalty.

Required range: -2 <= x <= 2

Example:

0

presence_penalty

number | null

default:0

Presence penalty coefficient. Range: -2.0 to 2.0. Corresponds to Gemini's presencePenalty.

Required range: -2 <= x <= 2

Example:

0

stop

Stop sequences. Supports a string or string array, corresponding to Gemini's stopSequences.

integer | null

default:1

Number of generated candidates.

Required range: x >= 1

Example:

1

reasoning_effort

enum<string> | null

default:medium

Limits reasoning effort. Gemini 3 supports low/high thinking levels; medium maps to the higher level and none is not supported.

Available options:

low,

medium,

high

Example:

"medium"

seed

integer | null

Random seed used to make output as reproducible as possible, corresponding to Gemini's seed.

Example:

12345

logprobs

boolean | null

default:false

Whether to return token logprob information, corresponding to Gemini's responseLogprobs.

Example:

true

top_logprobs

integer | null

Number of top logprob values returned for each token, corresponding to Gemini's logprobs.

Required range: 0 <= x <= 20

Example:

5

response_format

object

Response format settings, supporting JSON mode and JSON Schema, corresponding to Gemini's responseMimeType, responseSchema and responseJsonSchema.

Option 1
Option 2

Show child attributes

stream_options

object

Streaming response options. Can be set when stream is true.

Show child attributes

tools

object[] | null

List of tool definitions for Function Calling.

Show child attributes

tool_choice

Controls tool-calling behavior.

Available options:

none,

auto,

required

extra_body

object

Gemini extension parameters.

Show child attributes

Response

Chat completion generated successfully

string

Unique identifier for the chat completion

Example:

"chatcmpl-20251010015944503180122WJNB8Eid"

model

string

Model name actually used

Example:

"gemini-3.1-flash-lite-preview"

object

enum<string>

Response type

Available options:

chat.completion

Example:

"chat.completion"

created

integer

Creation timestamp

Example:

1760032810

choices

object[]

List of chat completion choices

Show child attributes

usage

object

Token usage statistics

Show child attributes

Gemini 3.1 Flash Lite - OpenAI SDK - Quick Start Gemini 3.1 Flash Lite - Native API - Quick Start