Back to Blog

VoxReception Launch: Building an AI Voice Receptionist for the Middle East

The story behind VoxReception—how I built a bilingual AI receptionist that understands Gulf Arabic dialects and serves thousands of businesses.

Vikas Thakur
Vikas Thakur
The mind behind VoxReception
February 28, 2025
12 min read
VoxReception
AI
Voice
Product Launch
Arabic
1,047
Active Customers
147%
MRR Growth
98.3%
Call Success Rate
< 200ms
Avg Response Time

Four months ago, a hotel owner in Muscat told me: "I lose 30% of bookings because no one answers after hours." That conversation sparked VoxReception.

The Problem Was Clear

Middle Eastern businesses face unique challenges:

  • Language complexity: Customers switch between Arabic and English mid-sentence
  • Dialect variations: Gulf Arabic differs significantly from Egyptian or Levantine
  • Cultural nuances: Formality levels, greeting customs, prayer time considerations
  • Technical barriers: Poor internet, preference for phone over apps

Building the Solution

Week 1-2: Research and Validation

I interviewed 50 business owners across Oman, UAE, and Saudi. Key findings:

  • 67% missed calls happen after 6 PM
  • 45% of callers prefer Arabic but switch to English for technical terms
  • Average hold time tolerance: 30 seconds (vs 2 minutes in the US)

Week 3-4: Technical Architecture

The stack I chose and why:

  • Twilio: Reliable PSTN connectivity in MENA
  • 11Labs: Best Arabic voice synthesis
  • OpenAI Whisper: Superior Arabic transcription
  • Custom LLM: Fine-tuned on Gulf business conversations
  • Supabase: Real-time call logs and analytics

Week 5-8: The Hard Parts

Arabic dialect handling was brutal. Here's what worked:

# Dialect detection pipeline
def detect_dialect(audio_chunk):
    features = extract_phonetic_features(audio_chunk)
    dialect = classify_dialect(features)
    return adapt_response_style(dialect)

"The first time VoxReception answered in proper Omani dialect, I knew this would change everything." - Mohammed Al-Rashidi, First Customer

Week 9-12: Launch and Scale

Launch week metrics:

  • Day 1: 10 signups
  • Week 1: 87 signups
  • Month 1: 412 customers
  • Month 3: 1,047 customers

The Technology That Powers It

Voice Processing Pipeline

  1. Audio ingestion via SIP/WebRTC
  2. Real-time transcription with 50ms chunks
  3. Intent classification + entity extraction
  4. Response generation with context
  5. TTS with emotion and emphasis
  6. Sub-200ms total latency

The Secret: Context Windows

Most AI receptionists fail because they lack context. VoxReception maintains:

  • Business knowledge graph
  • Caller history
  • Conversation state
  • Cultural context markers

Customer Success Stories

Muscat Grand Hotel: 45% reduction in missed bookings

Al-Fahad Clinic: 24/7 appointment scheduling, 3x appointment volume

Dubai Auto Parts: Handles 500+ daily inventory queries in Arabic/English

"VoxReception doesn't just answer calls—it understands our business." - Fatima Al-Zaabi, Clinic Manager

Challenges and Solutions

Challenge 1: Accent Variations

Solution: Trained on 10,000 hours of Gulf Arabic conversations

Challenge 2: Code-Switching

Solution: Dual-language model with seamless transition detection

Challenge 3: Cultural Appropriateness

Solution: Region-specific response templates and greeting patterns

The Business Model

Pricing strategy:

  • Starter: $99/month (100 calls)
  • Professional: $299/month (500 calls)
  • Enterprise: Custom pricing

Why this works:

  • Average missed call cost: $50-200
  • ROI in first month for most customers
  • No hardware, instant setup

Growth Metrics

Month-over-month:

  • MRR Growth: 147%
  • Churn Rate: 2.1%
  • NPS Score: 72
  • Customer LTV: $4,200

Technical Innovations

Three innovations I'm most proud of:

  1. Adaptive Personality Engine: VoxReception adjusts formality based on caller cues
  2. Prayer Time Handling: Automatic message adjustment during prayer times
  3. Multilingual SMS Followup: Sends confirmations in detected preferred language

Lessons Learned

  1. Localization is everything: Generic solutions don't work in MENA
  2. Voice quality matters more than features: Invested heavily in natural-sounding Arabic
  3. Onboarding determines success: We call every customer personally
  4. Support in Arabic is non-negotiable: Built a Arabic-first support team

What's Next

The roadmap:

  • Q2 2025: WhatsApp integration
  • Q3 2025: Egyptian and Levantine dialects
  • Q4 2025: AI-powered sales coaching
  • 2026: Expansion to APAC

The Impact

Beyond metrics, VoxReception is:

  • Helping businesses serve customers in their preferred language
  • Enabling 24/7 availability for SMBs
  • Creating jobs (we've hired 12 people locally)
  • Building AI capability in the Middle East

"This isn't just a product—it's infrastructure for the region's digital transformation." - Regional tech journalist

Technical Deep-Dive

For the engineers reading, here's how we handle the trickiest part—real-time dialect adaptation:

class DialectAdapter {
  private contextWindow: ConversationContext[];

  async processUtterance(audio: AudioBuffer): Promise<Response> {
    const transcript = await this.transcribe(audio);
    const dialect = await this.detectDialect(transcript);
    const intent = await this.classifyIntent(transcript, dialect);

    // Critical: Maintain context across code-switches
    const response = await this.generateResponse({
      intent,
      dialect,
      context: this.contextWindow,
      businessRules: this.loadBusinessRules(),
    });

    return this.synthesize(response, dialect);
  }
}

Join the Journey

VoxReception is more than a product—it's a mission to make AI accessible for MENA businesses.

Want to be part of this? We're hiring engineers and dialect specialists.


Transform your customer service with VoxReception. Get started today.

The first time VoxReception answered in proper Omani dialect, I knew this would change everything.

Mohammed Al-Rashidi, First Customer

VoxReception doesn't just answer calls—it understands our business.

Fatima Al-Zaabi, Clinic Manager

This isn't just a product—it's infrastructure for the region's digital transformation.

Regional Tech Journalist

Thank You

Special thanks to: Early customers who trusted us, The Twilio team for MENA support, 11Labs for Arabic voice work, My co-founder Ahmed

Ready to accelerate your development?

Let's discuss how AI-first development can transform your project.