- Vosk is an offline open source speech recognition toolkit. It enables speech recognition models for 17 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino.
- Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.
- Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others.
- Vosk works on Raspberry Pi3 and Pi4 but it also scales from mobile phones to big callcenter cluster. Vosk can also create subtitles for movies, transcription for lectures and interviews.
In this topic I will post some news about Vosk on Raspberry Pi, setup configurations. You are also welcome to ask me anything about Vosk or speech technology in general.