Introduction to OCR using Computer Vision's Read API

Explore the fundamentals of Optical Character Recognition using Azure Computer Vision's Read API. Understand the steps involved in extracting text from images and multi-page PDFs, including preprocessing, text localization, and post-processing. Gain practical insights into handling text extraction through structured JSON output and learn the API's limitations to implement effective OCR solutions.

We'll cover the following...

Introduction to OCR
The Read API
Limitations of the Read API

1.Overview of the Course

2.Refresher to FastAPI - Python Web Framework

3.Introduction to Cloud and Microsoft Azure

4.Azure Vision Cognitive Services: Computer Vision

5.Azure Vision Cognitive Services: Custom Vision

6.Azure Vision Cognitive Services: Face API

Mini Project

Assessment

7.Azure Language Cognitive Services: LUIS

Mini Project

8.Azure Language Cognitive Services: QnA Maker

Mini Project

9.Azure Language Cognitive Services: Text Analytics

10.Azure Language Cognitive Services: Translator

Assessment

11.Azure Decision Cognitive Services: Anomaly Detection

12.Azure Decision Cognitive Services: Content Moderator

13.Azure Decision Cognitive Services: Personalizer

Assessment

14.Azure Speech Cognitive Services

15.Azure Bing Search Services

Assessment

16.Conclusion

17.Appendix

Introduction to OCR using Computer Vision's Read API

Introduction to OCR