Startup bets big on voice AI that actually listens

Emily Chen May 2, 2026 0 comments 6 min read

Tolan has built a voice-first AI companion that ditches text altogether, relying instead on OpenAI's GPT-5.1 to power fluid, natural conversations without the typical lag that makes voice assistants feel robotic.

The company's approach hinges on three technical pillars. Low-latency responses mean the AI talks back almost instantly, eliminating the dead air that breaks conversational flow. Real-time context reconstruction lets the system understand what's happening around the user and adjust on the fly. Memory-driven personalities give each AI instance a consistent character that develops over time rather than resetting with every interaction.

What sets Tolan apart is treating voice as the primary interface rather than bolting it onto a text-based system. By building around GPT-5.1 from the ground up, the company avoids the awkward pauses and misunderstandings that plague most voice AI today. The result feels less like talking to a chatbot and more like actual conversation.

The memory layer is particularly interesting. Instead of treating each voice session as isolated, Tolan's system builds on previous interactions, giving the AI a sense of continuity. A user might reference something from days ago and the companion understands without explicit reminding.

As generative AI moves beyond text, voice stands as the next frontier. Tolan is banking that combining faster models with smarter memory architecture can finally deliver on the promise of AI that talks like humans talk.

Author Emily Chen: "This could matter if they've actually solved the latency problem, because that's where most voice AI falls apart in practice."

Comments