> ## Documentation Index
> Fetch the complete documentation index at: https://docs.evolink.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Qwen Voice Design

> - Create a custom voice profile from a text description and receive the voice name and a preview audio clip
- [Qwen3 TTS VD](/en/api-manual/audio-series/qwen-tts/qwen3-tts-vd) speech synthesis **must use a voice created by this API** — system built-in voices are not supported
- Asynchronous processing mode; use the returned task ID to [query the result](/en/api-manual/task-management/get-task-detail)
- Generated audio links are valid for 24 hours — save them promptly

**Workflow:**
1. Call this API to create a voice
2. Poll the task result to obtain `result_data.voice` (the voice name)
3. Call [Qwen3 TTS VD](/en/api-manual/audio-series/qwen-tts/qwen3-tts-vd) with the `voice` parameter for speech synthesis



## OpenAPI

````yaml /en/api-manual/audio-series/qwen-tts/qwen-voice-design.json POST /v1/audios/generations
openapi: 3.1.0
info:
  title: Qwen Voice Design API
  description: >-
    Create custom voice profiles from text descriptions. Returns the voice name
    and a preview audio clip. The qwen3-tts-vd speech synthesis model requires
    voices created through this API — system built-in voices are not supported.
  license:
    name: MIT
  version: 1.0.0
servers:
  - url: https://api.evolink.ai
    description: Production environment
security:
  - bearerAuth: []
tags:
  - name: Voice Design
    description: Qwen Voice Design API endpoints
paths:
  /v1/audios/generations:
    post:
      tags:
        - Voice Design
      summary: Qwen Voice Design
      description: >-
        - Create a custom voice profile from a text description and receive the
        voice name and a preview audio clip

        - [Qwen3 TTS VD](/en/api-manual/audio-series/qwen-tts/qwen3-tts-vd)
        speech synthesis **must use a voice created by this API** — system
        built-in voices are not supported

        - Asynchronous processing mode; use the returned task ID to [query the
        result](/en/api-manual/task-management/get-task-detail)

        - Generated audio links are valid for 24 hours — save them promptly


        **Workflow:**

        1. Call this API to create a voice

        2. Poll the task result to obtain `result_data.voice` (the voice name)

        3. Call [Qwen3 TTS
        VD](/en/api-manual/audio-series/qwen-tts/qwen3-tts-vd) with the `voice`
        parameter for speech synthesis
      operationId: createQwenVoiceDesign
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/QwenVoiceDesignRequest'
            examples:
              minimal:
                summary: Minimal call
                value:
                  model: qwen-voice-design
                  voice_prompt: >-
                    A calm middle-aged male news anchor with a deep, resonant
                    voice, rich in magnetism, steady pace, and clear
                    articulation
                  preview_text: >-
                    Good evening, listeners. Welcome to the evening news
                    broadcast.
                  preferred_name: announcer
              full_params:
                summary: Full parameters
                value:
                  model: qwen-voice-design
                  voice_prompt: >-
                    A young, energetic female voice with a fast pace and
                    noticeable upward intonation, suitable for introducing
                    fashion products
                  preview_text: Hi everyone, welcome to today's fashion review!
                  preferred_name: fashion_host
                  language: en
                  sample_rate: 48000
                  response_format: mp3
      responses:
        '200':
          description: Voice design task created successfully
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/QwenVoiceDesignResponse'
        '400':
          description: Invalid request parameters
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: invalid_parameter
                  message: 'Missing required parameter: voice_prompt or preview_text'
                  type: invalid_request_error
        '401':
          description: Unauthenticated, invalid or expired token
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: unauthorized
                  message: Invalid or expired token
                  type: authentication_error
        '402':
          description: Insufficient quota, top-up required
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: insufficient_quota
                  message: Insufficient quota. Please top up your account.
                  type: insufficient_quota
        '403':
          description: Access denied
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: model_access_denied
                  message: 'Token does not have access to model: qwen-voice-design'
                  type: invalid_request_error
        '429':
          description: Rate limit exceeded
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: rate_limit_exceeded
                  message: Too many requests, please try again later
                  type: rate_limit_error
        '500':
          description: Internal server error
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/ErrorResponse'
              example:
                error:
                  code: internal_error
                  message: Internal server error
                  type: api_error
components:
  schemas:
    QwenVoiceDesignRequest:
      type: object
      required:
        - model
        - voice_prompt
        - preview_text
        - preferred_name
      properties:
        model:
          type: string
          description: Model name
          enum:
            - qwen-voice-design
          default: qwen-voice-design
          example: qwen-voice-design
        voice_prompt:
          type: string
          description: >-
            A text description of the voice characteristics used to define the
            voice profile


            **Constraints:**

            - Maximum `2048` characters

            - Supports Chinese and English only


            **Suggested dimensions:**

            - Gender: male, female, neutral

            - Age: child (5-12), teen (13-18), young adult (19-35), middle-aged
            (36-55), senior (55+)

            - Pitch: high, medium, low

            - Pace: fast, moderate, slow

            - Emotion: cheerful, calm, gentle, serious, lively, composed

            - Character: magnetic, crisp, husky, mellow, sweet, deep

            - Use case: news broadcasting, commercial narration, audiobook,
            animation character, voice assistant


            **Example descriptions:**

            - `A calm middle-aged male with a slow pace, deep magnetic voice,
            suitable for news reading or documentary narration`

            - `A cute child voice, approximately an 8-year-old girl, slightly
            childlike speech, suitable for animated character dubbing`

            - `A gentle and intellectual female, around 30 years old, calm tone,
            suitable for audiobook narration`
          maxLength: 2048
          example: >-
            A calm middle-aged male news anchor with a deep, resonant voice,
            rich in magnetism, steady pace, and clear articulation
        preview_text:
          type: string
          description: >-
            Preview text used to generate a sample audio clip


            **Constraints:**

            - Maximum `1024` characters

            - Supports 10 languages: Chinese, English, Japanese, Korean, German,
            French, Italian, Russian, Portuguese, Spanish

            - Recommended to match the `language` field
          maxLength: 1024
          example: Good evening, listeners. Welcome to the evening news broadcast.
        preferred_name:
          type: string
          description: >-
            Voice name prefix


            **Constraints:**

            - Only digits, English letters, and underscores

            - No more than `16` characters


            The generated full voice name format:
            `qwen-tts-vd-{preferred_name}-voice-{timestamp}`


            For example, passing `announcer` results in a voice name like:
            `qwen-tts-vd-announcer-voice-20260402-a1b2`
          maxLength: 16
          pattern: ^[a-zA-Z0-9_]+$
          example: announcer
        language:
          type: string
          description: >-
            Language preference for the voice profile; recommended to match
            `preview_text`


            Defaults to `zh` if not provided
          enum:
            - zh
            - en
            - ja
            - ko
            - de
            - fr
            - it
            - ru
            - pt
            - es
          example: zh
        sample_rate:
          type: integer
          description: |-
            Preview audio sample rate (Hz)

            Defaults to `24000` if not provided
          enum:
            - 8000
            - 16000
            - 24000
            - 48000
          example: 24000
        response_format:
          type: string
          description: |-
            Preview audio format

            Defaults to `wav` if not provided
          enum:
            - pcm
            - wav
            - mp3
            - opus
          example: wav
        target_model:
          type: string
          description: >-
            The TTS model that will drive the created voice


            **Important:** The `target_model` specified when creating the voice
            must match the model used in subsequent speech synthesis; otherwise
            synthesis will fail


            | Value | Description |

            |-----|------|

            | `qwen3-tts-vd-2026-01-26` | Qwen3-TTS-VD non-streaming (default) |

            | `qwen3-tts-vd-realtime-2026-01-15` | Qwen3-TTS-VD-Realtime
            bidirectional streaming (new) |

            | `qwen3-tts-vd-realtime-2025-12-16` | Qwen3-TTS-VD-Realtime
            bidirectional streaming (legacy) |


            > Currently this platform supports `qwen3-tts-vd-2026-01-26`
            (non-streaming); realtime models are not yet integrated but voices
            can be pre-created
          enum:
            - qwen3-tts-vd-2026-01-26
            - qwen3-tts-vd-realtime-2026-01-15
            - qwen3-tts-vd-realtime-2025-12-16
          default: qwen3-tts-vd-2026-01-26
          example: qwen3-tts-vd-2026-01-26
        callback_url:
          type: string
          description: >-
            HTTPS callback URL invoked when the task completes


            **Trigger conditions:**

            - Triggered when the task is completed, failed, or cancelled

            - Sent after billing confirmation


            **Security restrictions:**

            - HTTPS only

            - Internal IP addresses are blocked (127.0.0.1, 10.x.x.x,
            172.16-31.x.x, 192.168.x.x, etc.)

            - URL length must not exceed `2048` characters


            **Callback behavior:**

            - Timeout: `10` seconds

            - Up to `3` retries after failure (at 1s / 2s / 4s intervals)

            - Response body format matches the task query API response

            - A 2xx status code is considered success; other codes trigger a
            retry
          format: uri
          example: https://your-domain.com/webhooks/voice-design-completed
    QwenVoiceDesignResponse:
      type: object
      properties:
        created:
          type: integer
          description: Task creation timestamp
          example: 1775123456
        id:
          type: string
          description: Task ID
          example: task-unified-1775123456-abcd1234
        model:
          type: string
          description: Actual model name used
          example: qwen-voice-design
        object:
          type: string
          enum:
            - audio.generation.task
          description: Specific task type
        progress:
          type: integer
          description: Task progress percentage (0-100)
          minimum: 0
          maximum: 100
          example: 0
        status:
          type: string
          description: Task status
          enum:
            - pending
            - processing
            - completed
            - failed
          example: pending
        task_info:
          $ref: '#/components/schemas/AudioTaskInfo'
          description: Audio task details
        type:
          type: string
          enum:
            - audio
          description: Task output type
          example: audio
        usage:
          $ref: '#/components/schemas/AudioUsage'
          description: Usage and billing information
    ErrorResponse:
      type: object
      properties:
        error:
          type: object
          properties:
            code:
              type: string
              description: Error code identifier
            message:
              type: string
              description: Error description
            type:
              type: string
              description: Error type
    AudioTaskInfo:
      type: object
      properties:
        can_cancel:
          type: boolean
          description: Whether the task can be cancelled
          example: true
        estimated_time:
          type: integer
          description: Estimated completion time (seconds)
          minimum: 0
          example: 15
        audio_type:
          type: string
          description: Audio task type
          example: voice_design
    AudioUsage:
      type: object
      description: Usage information
      properties:
        credits_reserved:
          type: number
          description: Estimated credits to be consumed
          minimum: 0
          example: 2
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer
      description: >-
        ##All endpoints require Bearer Token authentication##


        **Get your API Key:**


        Visit the [API Key management page](https://evolink.ai/dashboard/keys)
        to obtain your API Key


        **Add the following header to every request:**

        ```

        Authorization: Bearer YOUR_API_KEY

        ```

````