Streaming text-to-speech synthesis (HTTP)

Stream audio as it's generated. Returns chunked WAV data for progressive playback.

Authentication

AuthorizationBearer
API token from https://app.resemble.ai/account/api

Request

This endpoint expects an object.
voice_uuidstringRequired
Voice UUID to use for synthesis
datastringRequired

Text or SSML to synthesize (max 2000 characters)

project_uuidstringOptional
Optional project UUID to store the clip
precisionenumOptionalDefaults to PCM_32
Audio precision
Allowed values:
sample_rateenumOptional
Audio sample rate in Hz
use_hdbooleanOptionalDefaults to false

Enable HD synthesis with small latency trade-off

Response

Streaming audio response (chunked WAV)

Errors