AethexAI, a startup founded in 2025, launched a voice AI platform tailored for African and Middle Eastern markets and is now handling more than 17,000 calls per day. The company built its own small language models and orchestration layer from scratch rather than relying on existing tools like Vapi and LiveKit.

The platform is designed to handle localized dialects of English, French, and Arabic spoken across Africa and the Middle East. AethexAI developed its Kora series of small language models with parameters ranging from 300 million to 1.7 billion. To train these models, the company used anonymized recordings from a call center partner and shipped hard drives to radio stations across Africa to collect additional audio data. It also built a contributor network of university students to annotate data and pronounce local names, reducing training costs.

Ayooluwa Odemuyiwa, CTO of AethexAI, said the decision to build small models was driven by regional infrastructure constraints. "The latency and jitter that we saw on automated calls in this region were outrageous. If we had become orchestrators, we might have had to use large models that were hosted outside the region, resulting in higher latency. We realized that in order for this to work, we have to use very small models and cut latency at every step."

The company raised $3 million in pre-seed funding led by 4DX Ventures, with participation from Enza Capital, Dorm Room Fund, Mojo Ventures, and Stanford GSB 26 Fund. Individual investors include Stanford faculty, telecom executives, and AI researchers from Anthropic. AethexAI was co-founded by Mariama Diallo, formerly of Goldman Sachs and YC-backed ModelML, and Odemuyiwa, a Caltech graduate and former Meta engineer who enrolled at Stanford Business School before starting the company.

AethexAI’s current use cases include debt collection, customer activation, and KYC verification. The company offers onsite demos and workshops to help clients identify high-impact starting points. "We always tell customers that we cannot be everything for everybody right now. We're small. When we start talking to a company, we ask them to pick one use case that is the most important to them to start [with]."

Walter Baddoo, co-founder and managing partner of 4DX Ventures, noted regional demand differences. "Enterprises in Africa and the Middle East process roughly three times the call volume of their Western counterparts, as voice is still the dominant channel for customer interaction." He added, "Incumbent systems were built for Western markets characterized by high-end GPU infrastructure, standard English and European speech environments, and enterprise workflows common in the U.S. and Europe. That creates real gaps when enterprises need systems that handle dialects, code-switching, and informal speech patterns, and that work within their existing telephony infrastructure and their actual price points."