
On-device speech AI for Mac, Windows, Linux & Android
Build voice agents on a complete on-device speech stack: ASR (NVIDIA Nemotron, multilingual + streaming), TTS, voice cloning, diarization, denoising, and full-duplex speech-to-speech (NVIDIA PersonaPlex) — plus a voice agent pipeline for turn-taking, interruptions, and queuing. Runs on Mac, Windows, Linux, and mobile (iPhone + Android), with NPU-optimized inference (CoreML, NNAPI). Swift, Kotlin, and C++ APIs. Plus Speech Studio, a desktop voice-cloning app for creators. Apache 2.0, on-device.
Soniqo Speech is an open-source on-device speech AI platform that supports multiple operating systems, including Mac, Windows, Linux, and Android. It offers a comprehensive suite of features for building voice agents, including ASR, TTS, voice cloning, and more, with APIs available in Swift, Kotlin, and C++.