Agents

The Agents API provides a comprehensive interface for creating and managing voice AI agents on the Resemble AI platform. Voice agents combine automatic speech recognition (ASR), text-to-speech (TTS), large language models (LLM), and turn-taking capabilities to power interactive voice conversations.

What are Voice Agents?

Voice agents are AI-powered conversational systems that can:

Listen: Use ASR to transcribe user speech in real-time
Think: Process conversations using LLMs with custom prompts and tools
Speak: Generate natural responses using Resemble’s TTS voices
Take Turns: Manage conversation flow with intelligent turn-taking

Key Features

Configurable Components

Each agent consists of four main components:

ASR Configuration: Speech recognition settings including provider, model, and audio format
TTS Configuration: Voice selection for speech synthesis
LLM Configuration: Language model settings including prompt, provider, model, and temperature
Turn Configuration: Conversation flow settings including timeouts and detection modes

Dynamic Variables

Agents support dynamic variables for runtime context:

Populate variables via pre-call webhooks
Reference variables in prompts and tools using {{variable_name}} syntax
Update variables during conversations using tool assignments
Maximum 64 characters for keys, 2048 characters for values

Extensibility

Extend agent capabilities with:

Webhook Tools: Call external APIs during conversations
Client Tools: Trigger client-side actions
System Tools: Use platform-provided functionality (end call, transfer, voicemail detection)
Knowledge Base: Enable RAG (Retrieval Augmented Generation) for context-aware responses
Pre-Call Webhooks: Fetch data before conversations start
Post-Call Webhooks: Log analytics after conversations end

Phone Integration

Connect agents to phone numbers for:

Inbound Calls: Answer incoming calls automatically
Outbound Calls: Make calls programmatically
Twilio Integration: Automatic trunk configuration and management

Getting Started

To create a voice agent, you’ll need:

A Resemble voice (get via Voices API)
An API token for authentication
Optionally, a phone number for inbound/outbound calling

API Resources

The Agents API is organized into five main resource types:

Core agent management including creation, configuration, and lifecycle operations.

Phone Numbers

Manage phone numbers for inbound and outbound calling.

Agent Tools

Configure tools that extend agent capabilities.

Agent Webhooks

Set up webhooks for pre-call and post-call integrations.

Agent Knowledge Base

Enable RAG support by attaching knowledge items to agents.

Base URL

https://app.resemble.ai/api/v2

Authentication

All requests require Bearer token authentication using your API key:

$ Authorization: Bearer YOUR_API_TOKEN

Get your API key from https://app.resemble.ai/account/api