SUBSCRIPTION_KEY

ENDPOINT

Integrating Microsoft Computer Vision API with Python.png

Dockerfile.tar.gz

Azure

Django

In this course, you will learn to integrate Microsoft Computer Vision API. This API provides multiple functionalities such as extracting text from images, describing the image in human-understandable language, and moderating the content of an image.

More specifically, you will dive into Optical Character Recognition (OCR) and image analysis, effectively giving you a toolset to analyze any image you want.

Integrating Microsoft Computer Vision API with Python

## Read printed and handwritten text
Using the OCR service, we can read the visible text in an image and convert it to a character stream. There are two steps to successfully using the OCR service:
1. Call the Read API
2. Get the read results

Let’s see both in detail.
### Call the Read API
First, we’ll call the API with the image URL. We will use the `read` method as follows:
```python
read(read_image_url, language = None, pages = None, raw=True)
```
It will start an asynchronous process to read the image and will return an operation ID. 

Other than the image URL, we can also specify the following parameters:

|  Name |  Description |
| - | - |
|`language` (optional):| There are a lot of languages you can use. Supported languages for handwritten and print text are different. See the [Language Support](https://aka.ms/ocr-languages) page of the documentation for a list of supported languages.|
|`pages` (optional):| This option is only used for multi-page PDF and TIFF documents.|


Accepted inputs for `pages` include:


* **Single pages:** `1, 2` will process pages 1 and 2.
* **Finite:** `2-5` will process pages from 2 to 5.
* **Open-ended ranges:** `5-` will process all the pages beginning from page 5. Similarly, `-10` will process pages from 1 to 10.

The code for the `read` method is on **lines 13–16** in the code below.

# Read printed and handwritten text
Using the OCR service, we can read the visible text in an image and convert it to a character stream. There are two steps to successfully using the OCR service:
1. Call the Read API
2. Get the read results

Let’s see both in detail.
## Call the Read API
First, we’ll call the API with the image URL. We will use the `read` method as follows:
```python
read(read_image_url, language = None, pages = None, raw=True)
```
It will start an asynchronous process to read the image and will return an operation ID. 

Other than the image URL, we can also specify the following parameters:

|  Name |  Description |
| - | - |
|`language` (optional):| There are a lot of languages you can use. Supported languages for handwritten and print text are different. See the [Language Support](https://aka.ms/ocr-languages) page of the documentation for a list of supported languages.|
|`pages` (optional):| This option is only used for multi-page PDF and TIFF documents.|


Accepted inputs for `pages` include:


* **Single pages:** `1, 2` will process pages 1 and 2.
* **Finite:** `2-5` will process pages from 2 to 5.
* **Open-ended ranges:** `5-` will process all the pages beginning from page 5. Similarly, `-10` will process pages from 1 to 10.

The code for the `read` method is on **lines 13–16** in the code below.

Learn to read text from images and PDFs using OCR.

Get Started

Optical Character Recognition

Image Analysis

Sample Application

Conclusion

Read Printed and Handwritten Text

Read printed and handwritten text

Call the Read API

Name	Description
`language` (optional):	There are a lot of languages you can use. Supported languages for handwritten and print text are different. See the Language Support page of the documentation for a list of supported languages.
`pages` (optional):	This option is only used for multi-page PDF and TIFF documents.