Create Pronunciation

POST https://app.resemble.ai/api/v2/pronunciations

Upload a single pronunciation with a reference audio file.

Request Body (multipart/form-data)

FieldTypeRequiredDescription
wordstringYesThe word or phrase (2-100 characters). Letters, accented characters, apostrophes, hyphens, and spaces only.
audiofileYesReference audio file (wav, flac, mp3, m4a, ogg, webm, aac). Duration: 200ms-10s. Max size: 10MB.
$curl -X POST https://app.resemble.ai/api/v2/pronunciations \
> -H "Authorization: Bearer YOUR_API_KEY" \
> -F "word=abemaciclib" \
> -F "audio=@abemaciclib.wav"

Response (201 Created)

1{
2 "success": true,
3 "item": {
4 "uuid": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
5 "word": "abemaciclib",
6 "status": "pending",
7 "active": true,
8 "audio_url": "https://...",
9 "created_at": "2026-03-12T00:00:00.000Z",
10 "updated_at": "2026-03-12T00:00:00.000Z"
11 }
12}

The pronunciation starts with status: "pending" while the audio is being processed. Once processing completes, the status transitions to "ready" (usually within a few seconds) or "failed" if something went wrong. Only pronunciations with status "ready" are applied during synthesis.