Implementing Speech to Text Translation

Explore the implementation of speech to text translation using Azure Speech Cognitive Services. Understand how to convert audio files and real-time microphone input into text using Python, leveraging synchronous and asynchronous methods. Gain practical skills to apply speech recognition in cloud-based AI applications.

We'll cover the following...

Introduction
Dependencies
Implementation

Speech to text from an audio file
Speech to text using the microphone

Introduction

In this lesson, we’re going to explore the speech to text conversion using the Azure speech service. The Speech-to-Text—also referred to as STT—helps to generate real-time text transcriptions from audio data. We can provide the audio in a file format, from real-time streaming data, or directly from a microphone.

The model that is being used behind the scenes to convert the audio into text is the one Microsoft themselves is using in its Office products and Cortana. The model is capable of performing the speech to text translation in more than 100 languages. You can refer to the list from ... ...

1.Overview of the Course

2.Refresher to FastAPI - Python Web Framework

3.Introduction to Cloud and Microsoft Azure

4.Azure Vision Cognitive Services: Computer Vision

5.Azure Vision Cognitive Services: Custom Vision

6.Azure Vision Cognitive Services: Face API

Mini Project

Assessment

7.Azure Language Cognitive Services: LUIS

Mini Project

8.Azure Language Cognitive Services: QnA Maker

Mini Project

9.Azure Language Cognitive Services: Text Analytics

10.Azure Language Cognitive Services: Translator

Assessment

11.Azure Decision Cognitive Services: Anomaly Detection

12.Azure Decision Cognitive Services: Content Moderator

13.Azure Decision Cognitive Services: Personalizer

Assessment

14.Azure Speech Cognitive Services

15.Azure Bing Search Services

Assessment

16.Conclusion

17.Appendix

Implementing Speech to Text Translation

Introduction