Reference implementation

Homai

A Bashkir-speaking smart speaker.
A real-time voice agent: on-device wake word → ASR/TTS workers → an LLM agent with actions and RAG over user data.

I’m the founder of Homai.

What matters here

Architecture (high-level)

Links

Manufacturing & quality control

We manufacture devices in-house. Our SMD assembly machine runs software we wrote; the pipeline uses ML models and a vision LLM to improve assembly quality and reduce errors.

Media

Homai

Want to build a similar voice agent?

Send 3–5 lines of context and I’ll suggest the next step.