Building an OCR script for Images using Read API

Explore how to implement an OCR script utilizing Azure Computer Vision's Read API. Learn to authenticate, call the API, process JSON results, and draw bounding boxes around extracted text on images.

We'll cover the following...

Introduction
Implementing OCR

Importing the required packages
Authenticating and calling the read API
Fetching the results from read API

Introduction

We are going to build an OCR script which will use the Azure Computer Vision’s Read API to perform OCR on some sample images.

If you want to execute the code snippets mentioned in this chapter on your local machine, then you can visit the Appendix section where you can follow the steps to install the dependencies (python packages).

Implementing OCR

First let’s import all the required packages that we would need to complete our OCR functionality.

Importing the required packages

Let us first import all the required packages that we would need to complete our OCR functionality.

1.Overview of the Course

2.Refresher to FastAPI - Python Web Framework

3.Introduction to Cloud and Microsoft Azure

4.Azure Vision Cognitive Services: Computer Vision

5.Azure Vision Cognitive Services: Custom Vision

6.Azure Vision Cognitive Services: Face API

Mini Project

Assessment

7.Azure Language Cognitive Services: LUIS

Mini Project

8.Azure Language Cognitive Services: QnA Maker

Mini Project

9.Azure Language Cognitive Services: Text Analytics

10.Azure Language Cognitive Services: Translator

Assessment

11.Azure Decision Cognitive Services: Anomaly Detection

12.Azure Decision Cognitive Services: Content Moderator

13.Azure Decision Cognitive Services: Personalizer

Assessment

14.Azure Speech Cognitive Services

15.Azure Bing Search Services

Assessment

16.Conclusion

17.Appendix

Building an OCR script for Images using Read API

Introduction

Implementing OCR

Importing the required packages

Authenticating and calling the read API