Skip to content

音频

语音识别(STT)和语音合成(TTS)接口。

语音合成 POST

将文本转换为语音。

POST https://callxyq.xyz/v1/audio/speech

请求参数

参数类型必填说明
modelstringTTS 模型,如 tts-1tts-1-hd
inputstring要转换的文本
voicestring声音类型:alloy/echo/fable/onyx/nova/shimmer
response_formatstring输出格式:mp3/opus/aac/flac,默认 mp3
speednumber语速,0.25~4.0,默认 1.0

示例请求

cURL Python JavaScript

bash
curl -X POST "https://callxyq.xyz/v1/audio/speech" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-xxxx" \
  -d '{
    "model": "tts-1",
    "input": "你好,欢迎使用 callxyq。",
    "voice": "alloy"
  }' \
  --output speech.mp3
python
from openai import OpenAI

client = OpenAI(
    api_key="sk-xxxx",
    base_url="https://callxyq.xyz/v1"
)

response = client.audio.speech.create(
    model="tts-1",
    voice="alloy",
    input="你好,欢迎使用 callxyq。"
)

response.stream_to_file("speech.mp3")
javascript
import OpenAI from 'openai';
import fs from 'fs';

const client = new OpenAI({
  apiKey: 'sk-xxxx',
  baseURL: 'https://callxyq.xyz/v1',
});

const response = await client.audio.speech.create({
  model: 'tts-1',
  voice: 'alloy',
  input: '你好,欢迎使用 callxyq。',
});

const buffer = Buffer.from(await response.arrayBuffer());
fs.writeFileSync('speech.mp3', buffer);

语音识别 POST

将音频文件转换为文本。

POST https://callxyq.xyz/v1/audio/transcriptions

请求参数

参数类型必填说明
filefile音频文件(mp3/mp4/wav/webm 等)
modelstring识别模型,如 whisper-1
languagestring语言代码,如 zh
response_formatstring输出格式:json/text/srt/vtt

示例请求

cURL Python

bash
curl -X POST "https://callxyq.xyz/v1/audio/transcriptions" \
  -H "Authorization: Bearer sk-xxxx" \
  -F file="@audio.mp3" \
  -F model="whisper-1" \
  -F language="zh"
python
from openai import OpenAI

client = OpenAI(
    api_key="sk-xxxx",
    base_url="https://callxyq.xyz/v1"
)

with open("audio.mp3", "rb") as f:
    response = client.audio.transcriptions.create(
        model="whisper-1",
        file=f,
        language="zh"
    )

print(response.text)

响应

json
{
  "text": "你好,欢迎使用 callxyq。"
}

支持 TTS(文本转语音)与 STT(语音转文本)。

语音合成

http
POST https://callxyq.xyz/v1/audio/speech

语音识别

http
POST https://callxyq.xyz/v1/audio/transcriptions

上传音频并传入模型参数(如 whisper-1)。

和谐、友善、互助、开心