Audio Speech

The /audio/speech endpoint generates audio from text using text-to-speech models. See the OpenAI documentation for a more comprehensive guide.

HTTP Request

POST /audio/speech
POST /v1/audio/speech
Host: your-aqueduct-domain.com
Authorization: Bearer YOUR_AQUEDUCT_TOKEN
Content-Type: application/json

Request Body

The request body should be a JSON object compatible with the OpenAI SpeechCreateParams schema.

Parameter	Type	Description
`model`	string	The name of the text-to-speech model to use. You can also use a model alias.
`voice`	string	The voice to use for generation.
`input`	string	The text to generate audio for.
`instructions`	string	(Optional) Instructions for how to generate the audio.
`response_format`	string	(Optional) The format to return the audio in. Default: `mp3`.
`speed`	number	(Optional) The speed of the generated audio. Default: `1.0`.

See the OpenAI documentation for a full list of parameters.

Examples

cURL Example

curl https://your-aqueduct-domain.com/audio/speech \
  -H "Authorization: Bearer YOUR_AQUEDUCT_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
      "model": "kokoro",
      "voice": "af_alloy",
      "input": "Today is a wonderful day to build something people really like!",
      "instructions": "Speak in a cheerful and positive tone."
  }' --output speech.mp3

Python Example (OpenAI SDK)

from pathlib import Path
from openai import OpenAI

client = OpenAI(
    base_url="https://your-aqueduct-domain.com/v1",
    api_key="YOUR_AQUEDUCT_TOKEN"
)

speech_file_path = Path("/tmp") / "speech.mp3"

with client.audio.speech.with_streaming_response.create(
    model="kokoro",
    voice="af_alloy",
    input="Today is a wonderful day to build something people really like!",
    instructions="Speak in a cheerful and positive tone.",
) as response:
    response.stream_to_file(speech_file_path)

Error Responses

Same as Completions endpoint.