LiveKit Voice Agent Prompt Builder
Create production-ready prompts for LiveKit voice agents that work seamlessly with text-to-speech systems.
Quick Start
For most use cases, follow this simple workflow:
- •Choose a template from
assets/templates/that matches your use case - •Customize the template with your specific information
- •Validate using
scripts/validate_prompt.py - •Test with your LiveKit agent
- •Refine based on real conversations
Workflow
Step 1: Determine Your Agent Type
Select the template that best matches your needs:
General Purpose:
- •
Basic Assistant (
basic-assistant.txt) - Simple conversational agent- •Use for: General chat, demos, minimal functionality
- •Complexity: Low
- •Tools: Optional
- •
Tool-Enabled Agent (
tool-enabled-agent.txt) - Agent with function calling- •Use for: Weather, search, data lookup, single-domain assistants
- •Complexity: Medium
- •Tools: 1-5 tools
- •
Customer Service (
customer-service.txt) - Support agent with escalation- •Use for: Help desks, support lines, troubleshooting
- •Complexity: Medium
- •Tools: Lookup, search, escalation
- •
Specialist Agent (
specialist-agent.txt) - Task-specific agent (reservations, orders, etc.)- •Use for: Booking systems, order taking, appointments
- •Complexity: Medium-High
- •Tools: Action tools (create, update)
- •
Multi-Agent Coordinator (
multi-agent-coordinator.txt) - Routes to other agents- •Use for: Main entry point in multi-agent systems
- •Complexity: Medium
- •Tools: Transfer/handoff tools
Industry-Specific:
- •
Front Desk Receptionist (
front-desk-receptionist.txt) - Professional call routing- •Use for: Office reception, call centers, front desk operations
- •Complexity: Low-Medium
- •Tools: Transfer, routing, message taking
- •
Debt Collection (
debt-collection-agent.txt) - Compliant recovery agent- •Use for: Debt recovery, collections, payment arrangements
- •Complexity: High
- •Tools: Payment processing, sentiment analysis
- •Note: Includes FDCPA/TCPA/CFPB compliance requirements
Step 2: Customize Your Template
Open your selected template and fill in the placeholders:
Example - Customizing basic-assistant.txt:
Template:
You are [NAME], a friendly voice assistant. You are [TRAIT_1] and [TRAIT_2], with [TRAIT_3].
Customized:
You are Casey, a friendly voice assistant. You are helpful and patient, with a sense of humor.
Key customization areas:
- •Identity (name, role, personality)
- •Domain context (business info, hours, policies)
- •Tool descriptions (when and how to use each tool)
- •Workflow steps (information collection sequence)
- •Boundaries (what agent can't do)
Step 3: Apply Voice Optimization
Ensure your prompt includes these critical voice optimization rules:
Required elements:
Keep your responses brief - 1-3 sentences. Respond in plain text only - no emojis, asterisks, markdown, or special formatting. Ask one question at a time.
For agents that mention numbers, prices, or contact info:
Spell out numbers ("fifteen" not "15"). Spell phone numbers digit by digit ("5-5-5, 1-2-3-4"). For URLs, omit "https://" and say "dot" for periods.
For agents with tools:
If a tool call fails, explain the issue simply and suggest a fallback. Summarize tool results in conversational language - don't recite technical IDs or raw data.
Step 4: Validate Your Prompt
Run the validation script to check for common issues:
python scripts/validate_prompt.py your_prompt.txt
The validator checks for:
- •Missing identity statement
- •Lack of voice optimization rules
- •TTS antipatterns (emojis, markdown, special characters)
- •Excessive length (>500 words)
- •Missing tool error handling
- •Number formatting guidance
Fix any errors before deploying your prompt.
Step 5: Test with LiveKit
Deploy your prompt in a LiveKit agent and test with real conversations:
Test scenarios:
- •Happy path - User provides information smoothly
- •Interruptions - User changes topic or interrupts
- •Tool failures - Simulate tool errors
- •Edge cases - Ambiguous requests, out-of-scope questions
- •Voice quality - Listen to TTS output for formatting issues
Step 6: Refine
Based on testing, refine your prompt:
Common refinements:
- •Add specific guidance for observed failure modes
- •Adjust tone or personality descriptors
- •Add examples for complex tool usage
- •Tighten response length constraints if agent is too verbose
- •Add boundaries for out-of-scope requests
Key Concepts
Voice-First Design
Voice agents differ from text chatbots:
Critical differences:
- •No visual formatting (users can't see bold, lists, etc.)
- •Limited user patience (long responses cause frustration)
- •TTS limitations (special characters cause problems)
- •No "scroll back" (users can't review previous responses)
Implications for prompts:
- •Always specify plain text only
- •Enforce brevity (1-3 sentences typical)
- •Spell out numbers and abbreviations
- •Ask one question at a time
- •Avoid complex structures (lists, tables)
The Three Essential Sections
Every voice agent prompt should have:
- •Identity - "You are [name/role]. You are [personality]."
- •Voice Rules - Plain text, brief responses, number formatting
- •Core Functionality - What the agent does, tools it uses, workflows it follows
Additional sections (domain context, guardrails, handoffs) add as needed.
Template vs. Custom Prompts
Use templates when:
- •Starting a new agent
- •Following common patterns (support, booking, etc.)
- •Want quick setup with best practices
Write custom when:
- •Unique use case not covered by templates
- •Very specialized domain
- •Complex multi-step workflows
Even with custom prompts, reference the templates for voice optimization patterns.
Progressive Disclosure: When to Load References
Load reference documentation based on your specific needs:
For all users:
- •Start with templates and this SKILL.md
Load prompt-components.md when:
- •Building complex prompts from scratch
- •Need detailed guidance on specific sections
- •Want to understand the "why" behind best practices
- •Customizing beyond simple template fill-in
Load voice-optimization.md when:
- •Encountering TTS formatting issues
- •Agent responses sound awkward or unnatural
- •Need deep understanding of voice-specific constraints
- •Working with multiple TTS providers
- •Debugging number, URL, or email pronunciation
Load tool-integration.md when:
- •Integrating function calling/tools
- •Building multi-agent systems with handoffs
- •Need patterns for error handling or parameter collection
- •Tools aren't being used correctly
- •Working with RAG or external data sources
Load examples.md when:
- •Want to see complete, real-world prompts
- •Need inspiration for similar use cases
- •Comparing complexity levels
- •Learning from production patterns
Load industry-best-practices.md when:
- •Building agents for regulated industries (healthcare, banking, debt collection)
- •Need compliance guidance (HIPAA, FDCPA, TCPA, etc.)
- •Working on industry-specific use cases (front desk, customer service, sales)
- •Require specialized tone and approach patterns
- •Need security or privacy considerations
Load iterative-improvement.md when:
- •Improving existing prompts based on feedback
- •Analyzing test results and conversation data
- •Conducting A/B tests on prompt variations
- •Establishing metrics and measurement frameworks
- •Need systematic approach to prompt refinement
- •Want to preserve prompt structure while making improvements
Strategy: Start minimal (templates + SKILL.md). Load references only when you hit specific challenges or need deeper guidance.
Multi-Agent Systems
For systems with multiple agents, design each agent separately:
Pattern:
- •
Coordinator/Greeter - Uses
multi-agent-coordinator.txt- •Simple routing logic
- •Minimal conversation
- •Quick transfers
- •
Specialist Agents - Use
specialist-agent.txt- •Each handles one specific task
- •Collects required information
- •Takes action or transfers to next agent
Key considerations:
- •Each agent should have a single, clear responsibility
- •Define handoff criteria explicitly
- •Decide what context to pass between agents (full chat history vs structured data)
- •Test agent transitions for smoothness
Common Patterns
Pattern 1: Simple Conversational Agent
Template: basic-assistant.txt
Use for: Chat, demos, general assistance
Prompt structure:
- •Identity + personality
- •Voice optimization rules
- •Optional: specific topics or capabilities
Example use cases:
- •Demo voice assistant
- •Companion chatbot
- •General information assistant
Pattern 2: Information Retrieval Agent
Template: tool-enabled-agent.txt
Use for: Looking up data, answering questions from external sources
Prompt structure:
- •Identity + domain
- •Voice optimization
- •Tool descriptions (when to use each tool)
- •Result summarization guidance
Example use cases:
- •Weather agent
- •Product lookup
- •Knowledge base search
- •RAG-enabled assistants
Pattern 3: Action-Taking Agent
Template: specialist-agent.txt
Use for: Booking, ordering, scheduling, transactions
Prompt structure:
- •Identity + role
- •Voice optimization
- •Information collection sequence
- •Confirmation before action
- •Tool usage for state changes
Example use cases:
- •Reservation agent
- •Order taking
- •Appointment scheduling
- •Registration
Pattern 4: Support Agent with Escalation
Template: customer-service.txt
Use for: Help desk, customer support, troubleshooting
Prompt structure:
- •Identity + tone (empathetic, patient)
- •Voice optimization
- •Capabilities and tools
- •Clear escalation criteria
- •Error handling and fallbacks
Example use cases:
- •Customer support
- •Technical troubleshooting
- •Account assistance
- •Order status lookup
Troubleshooting
Issue: Agent uses markdown or special formatting
Symptoms: TTS says "asterisk asterisk hello" or skips characters
Solution: Ensure prompt includes:
Respond in plain text only - no emojis, asterisks, markdown, or special formatting.
Prevention: Run validation script
Issue: Responses too long
Symptoms: Users interrupt frequently, seem impatient
Solution: Add stricter constraints:
Keep responses very brief - 1-2 sentences maximum. Ask one question at a time.
Prevention: Test with real users early
Issue: Numbers sound awkward
Symptoms: "Dollar sign forty-nine point nine nine" or unclear prices/phone numbers
Solution: Add number formatting rules:
Spell out numbers ("forty-nine ninety-nine" for prices). Spell phone numbers digit by digit ("5-5-5, 1-2-3-4").
Issue: Agent doesn't use tools correctly
Symptoms: Tools called at wrong time, missing parameters, no error handling
Solution:
- •Add explicit tool usage instructions in prompt
- •Specify when to use each tool
- •Define parameter collection flow
- •Add error handling guidance
Check: Load tool-integration.md for detailed patterns
Issue: Agent goes off-topic or out-of-scope
Symptoms: Agent tries to handle requests beyond capabilities
Solution: Add boundaries and guardrails:
You can only help with [specific domain]. For questions about [out-of-scope topics], direct users to [alternative resource].
Issue: Multi-agent transfers feel awkward
Symptoms: Abrupt handoffs, lost context, repeated questions
Solution:
- •Add transition messages before transfer
- •Define what context passes between agents
- •Ensure previous agent collects minimum required info
Check: See multi-agent examples in examples.md
Advanced Topics
Emotional Expression
For agents that should convey emotion, use descriptive language:
Good:
When users share good news, respond warmly and enthusiastically. When they express frustration, respond with empathy and patience.
Avoid: Trying to use emojis, asterisks, or formatting to convey emotion.
Trust TTS: Modern TTS systems convey emotion through intonation based on word choice.
Multiple TTS Providers
Different TTS providers have different characteristics:
- •OpenAI TTS - Natural, handles most punctuation well
- •Cartesia Sonic - Fast, low-latency, clear pronunciation
- •Deepgram - Strong with accents and dialects
- •ElevenLabs - Highly expressive, personality-driven
Universal best practices work across all providers:
- •Plain text only
- •Brief responses
- •Spell out numbers
- •Avoid special characters
Test your specific provider to identify any quirks.
Handling Sensitive Contexts
For healthcare, finance, or other regulated industries:
Add explicit constraints:
Never: - Share customer information without verification - Provide medical/legal/financial advice - Process transactions without explicit confirmation
Be security-conscious:
For security, you'll need to verify identity before providing account information.
Escalate appropriately:
For [sensitive issue], immediately transfer to [appropriate specialist].
See healthcare and banking examples in examples.md.
Best Practices Summary
- •Start with templates - They encode best practices
- •Keep prompts concise - Under 300 words when possible
- •Always include voice rules - Plain text, brevity, number formatting
- •Test with real TTS - Listen to actual output
- •Validate before deploying - Use the validation script
- •One responsibility per agent - In multi-agent systems
- •Iterate based on real usage - Refine after testing
- •Error handling is critical - Always include fallback guidance
- •Confirm before actions - Especially for state changes
- •Boundaries prevent confusion - Clearly define scope
Resources
Templates:
General Purpose:
- •
assets/templates/basic-assistant.txt- Simple conversational agent - •
assets/templates/tool-enabled-agent.txt- Agent with function calling - •
assets/templates/customer-service.txt- Support with escalation - •
assets/templates/specialist-agent.txt- Task-specific workflows - •
assets/templates/multi-agent-coordinator.txt- Router/greeter agent
Industry-Specific:
- •
assets/templates/front-desk-receptionist.txt- Professional call routing and reception - •
assets/templates/debt-collection-agent.txt- Compliant debt recovery (FDCPA/TCPA)
Reference Documentation:
- •
references/prompt-components.md- Detailed component guide - •
references/voice-optimization.md- TTS formatting deep dive - •
references/tool-integration.md- Function calling patterns - •
references/examples.md- Complete real-world prompts - •
references/industry-best-practices.md- Industry-specific patterns and compliance - •
references/iterative-improvement.md- Systematic prompt refinement and testing
Tools:
- •
scripts/validate_prompt.py- Validation script
External Resources:
- •LiveKit Agents Documentation: https://docs.livekit.io/agents/
- •LiveKit Prompting Guide: https://docs.livekit.io/agents/build/prompting/
- •LiveKit Examples: https://github.com/livekit/agents/tree/main/examples
Example: Complete Workflow
Scenario: Build a pizza ordering agent
Step 1: Choose template
- •Task-specific with order taking →
specialist-agent.txt
Step 2: Customize
You are Marco, an order specialist at Mario's Pizza. You are friendly, efficient, and helpful. Keep responses very brief - 1-2 sentences. Plain text only, no formatting. Ask one question at a time. Our menu: - Margherita: twelve dollars - Pepperoni: fourteen dollars - Veggie Supreme: fifteen dollars Collect in order: 1. Pizza selection - "What pizza would you like?" 2. Size - "What size: small, medium, or large?" 3. Any modifications - "Any special requests or modifications?" 4. Name - "Name for the order?" 5. Phone - "Phone number?" Confirm: "Let me confirm: [size] [pizza] for [name] at [phone]. Total is [price]. Is that correct?" Use create_order(pizza, size, modifications, name, phone) after confirmation. Spell out prices: "fourteen dollars" not "dollar sign fourteen". Spell phone numbers: "5-5-5, 1-2-3-4".
Step 3: Validate
python scripts/validate_prompt.py pizza_prompt.txt
Step 4: Deploy and test with LiveKit
Step 5: Refine based on feedback
- •If users often ask about delivery time → Add delivery info to domain context
- •If tool fails occasionally → Add error handling for out-of-stock items
- •If users interrupt with questions → Add guidance for handling interruptions
Iterative Improvement
Voice agent prompts improve through continuous refinement based on real-world usage:
Quick Improvement Workflow
- •Collect feedback - Gather test results, user feedback, conversation metrics
- •Identify patterns - Look for common issues across multiple conversations
- •Make targeted changes - Modify specific sections while preserving structure
- •Test changes - Validate improvements with test scenarios
- •Measure impact - Compare before/after metrics
- •Iterate - Repeat the cycle
Key Principles
- •Structure preservation - Make minimal, targeted changes to address specific issues
- •Data-driven - Use evidence from tests and conversations, not guesses
- •Version control - Track changes and document reasons
- •A/B testing - When possible, run old and new versions in parallel
- •Regression testing - Ensure changes don't break existing functionality
For detailed guidance: See references/iterative-improvement.md
Industry-Specific Guidance
Different industries require specialized approaches:
Debt Collection:
- •Empathetic but professional tone
- •FDCPA/TCPA/CFPB compliance built-in
- •Sentiment-aware escalation
- •Payment arrangement workflows
Healthcare:
- •HIPAA compliance and privacy protection
- •Emergency detection and routing
- •Professional, caring tone
- •Limited scope (no medical advice)
Banking/Financial:
- •Security-first verification
- •Fraud detection awareness
- •Regulatory compliance
- •Transaction limitations
Customer Service:
- •Empathetic problem-solving
- •Clear escalation criteria
- •Knowledge base integration
- •Omnichannel awareness
Front Desk/Reception:
- •Professional greeting frameworks
- •Efficient call routing
- •Message taking protocols
- •After-hours handling
For detailed patterns: See references/industry-best-practices.md
Getting Started Now
- •Pick the closest template for your use case
- •Copy it and fill in your information
- •Run the validator to catch common issues
- •Test it in your LiveKit agent
- •Collect feedback and iterate based on real usage
- •Come back to this skill when you need to refine or troubleshoot
The templates handle 80% of the work. Focus your customization on domain-specific information, tools, and workflows. Then continuously improve based on real-world performance.