Speech-to-Text
Speech-to-Text
Transcribe audio or video and ask follow-up questions using Intelligence queries.
Key Capabilities
- Accepts file uploads, signed tokens, or remote URLs
- Returns transcripts with speaker labels and word-level timestamps
- Supports Intelligence queries for summaries and insights
- Delivers results to your server via webhook callbacks (
callback_url) - Offers a zero-retention mode that permanently deletes all media and transcript content after delivery
- Handles files up to 500 MB and 20 minutes in duration
Workflow
- Create a job – Upload content and optionally include an initial Intelligence query.
- Check status – Poll the job or list active submissions, or pass a
callback_urlto be notified. - Retrieve results – Fetch the completed transcript and Intelligence answers.
- Run additional queries – Ask further questions using the job UUID.
Zero Retention
For privacy-sensitive workloads, pass zero_retention_mode=true (with a mandatory callback_url): the uploaded media and any temporary processing copies are permanently deleted as soon as transcription finishes, the transcript is delivered once to your callback, and the content is then purged from Resemble entirely — only a content-free audit stub remains. Zero retention is a plan feature; see Create Transcript Job for details.
Access Requirements
Quick Start
Platform-wide rate limits apply. Contact support for higher throughput or dedicated capacity.
