MiniMax-M3 - OpenAI-Compatible API

Authorizations

Authorization

string

header

required

##All APIs require Bearer Token authentication##

Get API Key:

Visit API Key Management Page to get your API Key

Add to request header:

Authorization: Bearer YOUR_API_KEY

Body

application/json

model

enum<string>

required

Chat model name

Available options:

MiniMax-M3

Example:

"MiniMax-M3"

messages

(System Message · object | User Message · object | Assistant Message · object | Tool Message · object)[]

required

List of conversation messages, supports multi-turn dialogue

Messages with different roles have different field structures; select the corresponding role to view

Minimum array length: 1

System Message
User Message
Assistant Message
Tool Message

Show child attributes

thinking

object

Controls deep thinking

Notes:

Defaults to adaptive: The model adaptively decides whether to engage in deep thinking based on problem difficulty
By default, thinking content is inlined in the response content (wrapped in <think>...</think> tags); to separate it into a dedicated field, use reasoning_split

Show child attributes

reasoning_split

boolean

Whether to split thinking content into a separate field

false (default): Thinking content is inlined in content, wrapped in <think>...</think> tags
true: Thinking content is split into choices[].message.reasoning_content and reasoning_details

temperature

number

default:1

Sampling temperature, controls output randomness

Notes:

Lower values (e.g. 0.2): More deterministic, focused output
Higher values (e.g. 1.5): More random, creative output
Range: [0, 2], default 1

Required range: 0 <= x <= 2

Example:

1

top_p

number

default:0.95

Nucleus Sampling parameter

Notes:

Controls sampling from tokens with cumulative probability
e.g. 0.95 means selecting from tokens reaching 95% cumulative probability
Range: [0, 1], MiniMax-M3 default 0.95

Recommendation: Do not adjust temperature and top_p simultaneously

Required range: 0 <= x <= 1

Example:

0.95

max_completion_tokens

integer

Upper limit for generated content length (in tokens)

Notes:

MiniMax-M3 recommended 131,072 (128K), maximum 524,288 (512K)
Tokens generated by thinking also count toward this limit
If generation is interrupted due to length, try increasing this value

Required range: 1 <= x <= 524288

Example:

131072

stream

boolean

default:false

Whether to return the response in streaming mode

true: Streaming response, returns content in real-time chunks via SSE (Server-Sent Events)
false: Wait for complete response before returning (default)

Example:

false

stream_options

object

Streaming response options

Only effective when stream=true

Show child attributes

tools

object[]

Tool definition list for Function Calling

Each tool requires a name, description, and parameter schema

Show child attributes

max_tokens

integer

deprecated

Legacy generation length limit parameter

Note: Deprecated, please use max_completion_tokens instead

Required range: x >= 1

Response

Chat completion successful

string

Unique identifier for the chat completion

Example:

"0668a381bdc3c0ded310e27c9a46d16e7"

model

string

Model name actually used

Example:

"MiniMax-M3"

object

enum<string>

Response type

Available options:

chat.completion

Example:

"chat.completion"

created

integer

Creation timestamp (Unix seconds)

Example:

1777026807

choices

object[]

List of chat completion choices

Show child attributes

usage

object

Token usage statistics

Show child attributes

input_sensitive

boolean

Whether the input content triggered a sensitive word filter. If the input severely violates policies, the API will return a content violation error with empty response content

input_sensitive_type

integer

Type of sensitive word triggered by input (returned when input_sensitive is true): 1 severe violation; 2 pornography; 3 advertising; 4 prohibited content; 5 abusive language; 6 violence/terrorism; 7 other

output_sensitive

boolean

Whether the output content triggered a sensitive word filter

output_sensitive_type

integer

Type of sensitive word triggered by output

base_resp

object

Status code and error details

Show child attributes

Image Series

Video Series

Audio Series

Text Series

Account Management

Task Management

File Management

MiniMax-M3 - OpenAI-Compatible API

Authorizations

Body

Response