For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Status
OverviewProductsManageAPI ReferenceTutorialsClient Libraries
OverviewProductsManageAPI ReferenceTutorialsClient Libraries
  • Voice Generation
      • Overview
      • Synchronous
      • Streaming (HTTP)
      • Streaming (WebSocket)
    • Speech-to-Speech
  • Voice Creation
  • Voice Tools
  • Real-Time Agents
    • Overview
  • Safety & Detection
Status
LogoLogo
On this page
  • Synthesis Modes
  • Synchronous
  • Streaming over HTTP
  • Streaming over WebSocket
  • Next Steps
Voice GenerationText to Speech

Text-to-Speech

Was this page helpful?

Synchronous

Next
Built with

Turn text into natural, production-ready speech. Resemble supports multiple synthesis modes tuned for different latency and integration needs.

Synthesis Modes

Synchronous

Request-based synthesis that returns a complete audio file in a single response.

Best suited for:

  • Alerts and notifications
  • Short-form content
  • Workflows that require the entire clip before progressing

Streaming over HTTP

Receive audio chunks progressively via chunked HTTP responses.

Best suited for:

  • Longer scripts
  • Progressive playback experiences
  • Reducing perceived latency without persistent sockets

Streaming over WebSocket

Maintain a WebSocket to receive the lowest-latency audio stream with per-chunk metadata.

Best suited for:

  • Conversational agents
  • Interactive assistants
  • Real-time media experiences where milliseconds matter (Business plan and above)

Next Steps

  1. Generate an API token from the dashboard.
  2. Pick the synthesis mode that fits your UX.
  3. Follow the dedicated page for request/response formats and implementation tips.