Building an OCR script for Documents using Read API
Understand how to implement Optical Character Recognition (OCR) for PDF documents and images using Azure Computer Vision's Read API. Learn to authenticate, process document streams, and extract text along with bounding-box coordinates. Practice with Python to build scripts capable of reading text from local files for intelligent applications.
We'll cover the following...
We'll cover the following...
Introduction
In the previous lesson, we’ve seen how to extract the text from an image. Now, we’ll have a look at how to extract the text from a PDF document.
You can download the sample PDF that we are going to use in this lesson for extracting the text below:
Implementation for Documents
Now that you have the sample PDF we can move ahead to the implementation of this functionality.
-
From lines 1 to 4, we’ve import the required packages. ...