For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Status
OverviewProductsManageAPI ReferenceTutorialsClient Libraries
OverviewProductsManageAPI ReferenceTutorialsClient Libraries
  • Voice Generation
    • Speech-to-Speech
  • Voice Creation
  • Voice Tools
      • Overview
      • Create Transcript Job
      • Get Transcript Job
      • List Transcript Jobs
      • Intelligence Queries
  • Real-Time Agents
    • Overview
  • Safety & Detection
Status
LogoLogo
On this page
  • Key Capabilities
  • Workflow
  • Access Requirements
  • Quick Start
Voice ToolsSpeech to Text

Speech-to-Text

Was this page helpful?
Previous

Create Transcript Job

Next
Built with

Transcribe audio or video and ask follow-up questions using Intelligence queries.

Key Capabilities

  • Accepts file uploads, signed tokens, or remote URLs
  • Returns transcripts with speaker labels and word-level timestamps
  • Supports Intelligence queries for summaries and insights
  • Handles files up to 500 MB and 20 minutes in duration

Workflow

  1. Create a job – Upload content and optionally include an initial Intelligence query.
  2. Check status – Poll the job or list active submissions.
  3. Retrieve results – Fetch the completed transcript and Intelligence answers.
  4. Run additional queries – Ask further questions using the job UUID.

Access Requirements

RequirementDetails
Team scopeOperates within the authenticated user’s team.
AuthenticationResemble session or API token.

Quick Start

$curl --request POST 'https://app.resemble.ai/api/v2/speech-to-text' \
> -H 'Authorization: Bearer YOUR_API_TOKEN' \
> -H 'Content-Type: multipart/form-data' \
> -F 'file=@/path/to/audio.mp3' \
> -F 'query=What are the key points discussed?'

Platform-wide rate limits apply. Contact support for higher throughput or dedicated capacity.