Prerequisites and Required Libraries

In this chapter, we’ll build a web app that can transcribe audio using AssemblyAI and Streamlit.

Transcription

Transcription is the process of converting audio to text. Although we can implement a machine learning model to get the text from audio, it is extremely inconvenient. If we try to train a machine learning model for transcription, we’ll have to deal with the following:

  • Extensive knowledge of audio signal processing is needed to extract features from an audio signal.
  • A large amount of data will have to be mined/scraped from various sources.
  • Knowledge of machine learning libraries such as PyTorch or TensorFlow is required.

Fortunately, AssemblyAI has a free tier version available that lets us transcribe audio by making a few requests.

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy