Voice-to-Text: Xeoma’s Intellectual Module for Speech Recognition

The AI-powered Voice-to-Text module of the Xeoma video surveillance software ‘listens’ to the audio stream from a camera or a separate microphone, hears speech, and saves the transcript of it in a CSV report or overlays it on the preview as text. Alternatively, you can set it to react to certain words or phrases. It can also work with .mp3 audio files – recordings of conversations, training videos, etc. – transcribing speech and providing it as text.

Working with Xeoma’s Voice-to-Text does not require specialized equipment: the sound stream from any camera or a separate microphone as well as regular off-the-shelf computers and video graphics cards are suitable.

More about the module and its capabilities in our video:

Check and comment on Youtube