음성 텍스트 변환 (STT)

curl --request POST \
  --url https://api.clovastudio.go.kr/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form file=@example-file \
  --form model=whisper-large-v3 \
  --form response_format=json

{
  "model": "whisper-large-v3",
  "text": "민원 처리 현황 보고드립니다. 금일 접수된 민원 건수는 총 15건이며, 이중 긴급 민원은 3건입니다.",
  "usage": {
    "audio_duration_ms": 6800
  }
}

POST

audio

transcriptions

curl --request POST \
  --url https://api.clovastudio.go.kr/v1/audio/transcriptions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form file=@example-file \
  --form model=whisper-large-v3 \
  --form response_format=json

{
  "model": "whisper-large-v3",
  "text": "민원 처리 현황 보고드립니다. 금일 접수된 민원 건수는 총 15건이며, 이중 긴급 민원은 3건입니다.",
  "usage": {
    "audio_duration_ms": 6800
  }
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data

file

required

변환할 오디오 파일 (.MP3, .WAV, .M4A, .FLAC, .OGG, .WEBM, .AAC, .WMA, .OPUS 지원)

model

string

default:whisper-large-v3

STT 모델 (whisper-large-v3)

Examples:

"whisper-large-v3"

response_format

string

default:json

응답 형식 (text, json, verbose_json)

Examples:

"json"

"text"

"verbose_json"

Response

성공 - 텍스트 반환

model

string

required

사용된 모델

text

string

required

변환된 텍스트

duration

number | null

오디오 길이(초)

usage

object | null

API 사용량 정보 STT usage

Show child attributes

텍스트 음성 변환 (TTS)문서 검색

⌘I

API Reference

Chat

Agents

Tools

RAG42

음성 텍스트 변환 (STT)

Authorizations

Body

Response