generation_mode parameter:
first_frame: First-frame to video — generate a video starting from the given first frame, optional driving audiofirst_last_frame: First-and-last-frame to video — generate a video by interpolating between the first and last frames, optional driving audiovideo_continuation: Video continuation — continue an input video clip; an optional ending frame is allowed (no driving audio)generation_mode is optional (backward compatible); if omitted, an appropriate mode will be selected automatically based on the materials in the requestimage_start (first frame)image_start + audio_urls (first frame + driving audio)image_start + image_end (first + last frame)image_start + image_end + audio_urls (first + last frame + driving audio)video_urls (video continuation)video_urls + image_end (video continuation + last frame)Documentation Index
Fetch the complete documentation index at: https://docs.evolink.ai/llms.txt
Use this file to discover all available pages before exploring further.
Get your API Key:
Visit the API Key management page to obtain your API Key
Add to request headers:
Authorization: Bearer YOUR_API_KEYModel name, must be wan2.7-image-to-video
wan2.7-image-to-video "wan2.7-image-to-video"
Generation mode that determines which material combinations are valid. Explicitly specifying it is recommended
Values:
first_frame: First-frame to video. Required: image_start. Optional: audio_urls. Not accepted: image_end, video_urlsfirst_last_frame: First-and-last-frame to video. Required: image_start + image_end. Optional: audio_urls. Not accepted: video_urlsvideo_continuation: Video continuation. Required: video_urls[0]. Optional: image_end (used as ending frame). Not accepted: image_start, audio_urlsBackward-compatible behavior: when generation_mode is omitted, an appropriate mode will be selected automatically based on the materials in the request; explicit specification is recommended to avoid ambiguity
first_frame, first_last_frame, video_continuation "first_frame"
Text prompt for video generation. Supports both Chinese and English; each character/letter counts as 1, with overflow auto-truncated. Maximum length: 5000 characters
5000"A cat playing piano"
Negative prompt describing what should not appear in the video. Supports both Chinese and English. Maximum length 500 characters; overflow is auto-truncated
500"Blurry, low quality"
First-frame image URL
Mode constraints:
first_frame mode: requiredfirst_last_frame mode: requiredvideo_continuation mode: not allowedImage limits:
[240, 8000] pixels20MB"https://example.com/first_frame.jpg"
Ending-frame image URL
Mode constraints:
first_last_frame mode: requiredvideo_continuation mode: optional (acts as the ending frame for the continuation)first_frame mode: not allowed (use first_last_frame if both first and last frames are needed)Image limits:
[240, 8000] pixels20MB"https://example.com/last_frame.jpg"
Video continuation URL array. Only 1 element is supported
Mode constraints:
video_continuation mode: requiredfirst_frame / first_last_frame mode: not allowedaudio_urlsVideo limits:
2 ~ 10 seconds (length of the input clip itself)[240, 4096] pixels100MBContinuation duration rules:
duration represents the total final output video length (input clip + model-generated continuation)duration − input video lengthduration must be ≥ input video lengthduration)Examples:
| Input clip length | duration | Continuation generated | Final output | Billed |
|---|---|---|---|---|
| 3s | 15 | 12s | 15s | 15s |
| 5s | 10 | 5s | 10s | 10s |
| 8s | 8 | 0s (input only) | 8s | 8s |
["https://example.com/clip.mp4"]Driving audio URL array. Currently supports only 1 element. The model will use this audio as the driving source for video generation (e.g. lip sync, motion alignment)
Mode constraints:
first_frame mode: optionalfirst_last_frame mode: optionalvideo_continuation mode: not allowed (cannot be combined with video_urls)Format requirements:
wav, mp32 ~ 30 seconds15MBTruncation handling:
duration, the first N seconds are extracted and the rest discarded1 element["https://example.com/audio.mp3"]Video quality, defaults to 720p
Options:
720p: Standard definition, standard price, this is the default1080p: High definition, higher price720p, 1080p "720p"
Video duration in seconds (integer). Range 2 ~ 15, default 5
Meaning:
first_frame / first_last_frame modes: total length of the generated videovideo_continuation mode: total length of the final output video (= original input clip + model-generated continuation)Additional constraints in video_continuation mode:
duration must be ≥ input video length (otherwise an error is returned)duration − input video lengthduration equals the input video length, no continuation is generated and the input clip is returned as-isvideo_urls field for detailsBilling: based on the actual generated video duration
2 <= x <= 155
Random seed, defaults to random
Notes:
1 ~ 21474836471 <= x <= 214748364742
Whether to enable intelligent prompt rewriting. When enabled, a large model will optimize the prompt, which significantly improves results for simple or insufficiently descriptive prompts.
Note: Default is false. Omitting the field or sending false will not trigger rewriting; explicitly send true to enable.
false
HTTPS callback URL for task completion
Callback Timing:
Security Restrictions:
2048 charactersCallback Mechanism:
10 seconds3 retries after failure (retries at 1/2/4 seconds after failure)"https://your-domain.com/webhooks/video-task-completed"
Video task created successfully
Task creation timestamp
1757169743
Task ID
"task-unified-1757169743-7cvnl5zw"
Actual model name used
"wan2.7-image-to-video"
Specific task type
video.generation.task Task progress percentage (0-100)
0 <= x <= 1000
Task status
pending, processing, completed, failed "pending"
Detailed video task information
Task output type
text, image, audio, video "video"
Usage and billing information