Voice that listens, thinks, and replies in 300 ms.
A real-time voice stack for products that talk back. Multimodal Gemma 4 Live for understanding, OmniVoice for synthesis, Whisper for transcription — all behind one API. Fluent in Arabic, multilingual by default.
Sub-300ms latency
p50 turn time-to-first-byte under 300 ms on the Conversational pillar SLO. Every hop is instrumented in docs/SLOs.md.
Privacy-first
Routes that never leave the device for users who opt in. The on-device path uses Whisper + Gemma 4 quantized; switch via a single flag.
Persistent memory
Your agents remember user preferences, in-progress tasks, and interrupted conversations across sessions — region-pinned per user.
Build premium voice products faster
Move from prototype to production with low-latency APIs, real-time streaming, and SDKs for the teams shipping voice into apps, workflows, and customer experiences.
- 01Streaming-ready APIsSub-200ms first chunk latency for real-time experiences
- 02Python & Node SDKsTyped clients that mirror the REST API you already use
- 03Batch & real-timeProcess millions of characters or stream live conversation
- 04Webhook supportAsync processing with reliable delivery guarantees
Available SDKs
Build voice experiences
that truly resonate
صوتك للعالم — بالعربية وأكثر من ٣٠ لغة
Join thousands of creators and developers building the future of Arabic voice with Nur.
No credit card required · 10,000 characters free every month