Passer au contenu principal
POST
/
v1
/
audio
/
speech
Create Speech
curl --request POST \
  --url https://aihubmix.com/v1/audio/speech \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "input": "<string>",
  "voice": "ash"
}
'
"<string>"

Documentation Index

Fetch the complete documentation index at: https://docs.aihubmix.com/llms.txt

Use this file to discover all available pages before exploring further.

Autorisations

Authorization
string
header
requis

Gateway-issued API key, formatted as sk-gateway-xxxxxxxx. Used by OpenAI-shaped endpoints (/v1/chat/completions, etc.).

Corps

application/json
model
requis

One of the available TTS models: tts-1, tts-1-hd, gpt-4o-mini-tts, or gpt-4o-mini-tts-2025-12-15.

input
string
requis

The text to generate audio for. The maximum length is 4096 characters.

Maximum string length: 4096
voice
requis

The voice to use when generating the audio. Supported built-in voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, verse, marin, and cedar. You may also provide a custom voice object with an id, for example { "id": "voice_1234" }. Previews of the voices are available in the Text to speech guide.

Exemple:

"ash"

instructions
string

Control the voice of your generated audio with additional instructions. Does not work with tts-1 or tts-1-hd.

Maximum string length: 4096
response_format
enum<string>
défaut:mp3

The format to audio in. Supported formats are mp3, opus, aac, flac, wav, and pcm.

Options disponibles:
mp3,
opus,
aac,
flac,
wav,
pcm
speed
number
défaut:1

The speed of the generated audio. Select a value from 0.25 to 4.0. 1.0 is the default.

Plage requise: 0.25 <= x <= 4
stream_format
enum<string>
défaut:audio

The format to stream the audio in. Supported formats are sse and audio. sse is not supported for tts-1 or tts-1-hd.

Options disponibles:
sse,
audio

Réponse

OK

The response is of type file.