Agents

The Agents API provides a comprehensive interface for creating and managing voice AI agents on the Resemble AI platform. Voice agents combine automatic speech recognition (ASR), text-to-speech (TTS), large language models (LLM), and turn-taking capabilities to power interactive voice conversations.

What are Voice Agents?

Voice agents are AI-powered conversational systems that can:

  • Listen: Use ASR to transcribe user speech in real-time
  • Think: Process conversations using LLMs with custom prompts and tools
  • Speak: Generate natural responses using Resemble’s TTS voices
  • Take Turns: Manage conversation flow with intelligent turn-taking

Key Features

Configurable Components

Each agent consists of four main components:

  • ASR Configuration: Speech recognition settings including provider, model, and audio format
  • TTS Configuration: Voice selection for speech synthesis
  • LLM Configuration: Language model settings including prompt, provider, model, and temperature
  • Turn Configuration: Conversation flow settings including timeouts and detection modes

Dynamic Variables

Agents support dynamic variables for runtime context:

  • Populate variables via pre-call webhooks
  • Reference variables in prompts and tools using {{variable_name}} syntax
  • Update variables during conversations using tool assignments
  • Maximum 64 characters for keys, 2048 characters for values

Extensibility

Extend agent capabilities with:

  • Webhook Tools: Call external APIs during conversations
  • Client Tools: Trigger client-side actions
  • System Tools: Use platform-provided functionality (end call, transfer, voicemail detection)
  • Knowledge Base: Enable RAG (Retrieval Augmented Generation) for context-aware responses
  • Pre-Call Webhooks: Fetch data before conversations start
  • Post-Call Webhooks: Log analytics after conversations end

Phone Integration

Connect agents to phone numbers for:

  • Inbound Calls: Answer incoming calls automatically
  • Outbound Calls: Make calls programmatically
  • Twilio Integration: Automatic trunk configuration and management

Getting Started

To create a voice agent, you’ll need:

  1. A Resemble voice (get via Voices API)
  2. An API token for authentication
  3. Optionally, a phone number for inbound/outbound calling

API Resources

The Agents API is organized into five main resource types:

Agents

Core agent management including creation, configuration, and lifecycle operations.

Phone Numbers

Manage phone numbers for inbound and outbound calling.

Agent Tools

Configure tools that extend agent capabilities.

Agent Webhooks

Set up webhooks for pre-call and post-call integrations.

Agent Knowledge Base

Enable RAG support by attaching knowledge items to agents.

Base URL

https://app.resemble.ai/api/v2

Authentication

All requests require Bearer token authentication using your API key:

$Authorization: Bearer YOUR_API_TOKEN

Get your API key from https://app.resemble.ai/account/api