Search⌘ K
AI Features

Implementing Speech to Text Translation

Explore the implementation of speech to text translation using Azure Speech Cognitive Services. Understand how to convert audio files and real-time microphone input into text using Python, leveraging synchronous and asynchronous methods. Gain practical skills to apply speech recognition in cloud-based AI applications.

Introduction

In this lesson, we’re going to explore the speech to text conversion using the Azure speech service. The Speech-to-Text—also referred to as STT—helps to generate real-time text transcriptions from audio data. We can provide the audio in a file format, from real-time streaming data, or directly from a microphone.

The model that is being used behind the scenes to convert the audio into text is the one Microsoft themselves is using in its Office products and Cortana. The model is capable of performing the speech to text translation in more than 100 languages. You can refer to the list from Microsoft’s Language Support Documentation.

Dependencies

To work with this ...