Trusted answers to developer questions
Trusted Answers to Developer Questions

Related Tags

ai

What is Speech-to-Text (STT)?

Nouman Abbasi

Grokking Modern System Design Interview for Engineers & Managers

Ace your System Design Interview and take your career to the next level. Learn to handle the design of applications like Netflix, Quora, Facebook, Uber, and many more in a 45-min interview. Learn the RESHADED framework for architecting web-scale applications by determining requirements, constraints, and assumptions before diving into a step-by-step design process.

Speech-to-Text, or STT, is an easy-to-use API powered by Google’s AI technologies to convert speech into text.

widget

Since Speech-to-Text is powered by Google’s own advanced deep learning models, you can expect state-of-the-art accuracy. You can also customize speech recognition to transcribe domain-specific terms and rare words by providing hints and boosting your transcription accuracy of specific words or phrases.

Speech-to-Text can use one of several machine learning models to transcribe your audio file. The API currently offers voice recognition that supports more than 125125 languages and variants.

Other than the above-mentioned features, the STT API allows you to:

  • Transcribe your content in real-time or from stored files
  • Deliver a better user experience in products through voice commands
  • Gain insights from customer interactions to improve your service

Speech-to-Text is priced based on the amountmeasured in increments rounded up to 15 seconds of audio that is successfully processed by the service each month. However, you can use this service for free if your audio duration does not exceed 60 minutes per month.

If you’re interested in how to incorporate Speech-to-Text in your program, check out the course Google Cloud: AI Speech-to-Text with Python 3.

RELATED TAGS

ai

CONTRIBUTOR

Nouman Abbasi
Copyright ©2022 Educative, Inc. All rights reserved

Grokking Modern System Design Interview for Engineers & Managers

Ace your System Design Interview and take your career to the next level. Learn to handle the design of applications like Netflix, Quora, Facebook, Uber, and many more in a 45-min interview. Learn the RESHADED framework for architecting web-scale applications by determining requirements, constraints, and assumptions before diving into a step-by-step design process.

Keep Exploring