Mastering Artificial Intelligence with Azure Cognitive Services/

...

Building an OCR script for Documents using Read API

Learn to extract text from PDF documents using Computer Vision's Read API.

We'll cover the following...

Introduction
Implementation for Documents
Implementation for Images

Press + to interact

C++

import time
from azure.cognitiveservices.vision.computervision import ComputerVisionClient
from msrest.authentication import CognitiveServicesCredentials
from azure.cognitiveservices.vision.computervision.models import OperationStatusCodes
client = ComputerVisionClient(
                      computer_vision_endpoint, 
                      CognitiveServicesCredentials(computer_vision_key)
                    )
def pdf_to_text():
  filepath = open('CourseAssets/printed_handwritten.pdf','rb')
  response = client.read_in_stream(filepath, raw=True)
  filepath.close()
  operation_location = response.headers["Operation-Location"]
  operation_id = operation_location.split("/")[-1]
  while True:
    result = client.get_read_result(operation_id)
    if result.status.lower () not in ['notstarted', 'running']:
      break
    time.sleep(10)
  return result
result = pdf_to_text()
if result.status == OperationStatusCodes.succeeded:
  for readResult in result.analyze_result.read_results:
    for line in readResult.lines:
      print(line.text)
      print(line.bounding_box)

Overview of the Course

Refresher to FastAPI - Python Web Framework

Introduction to Cloud and Microsoft Azure

Azure Vision Cognitive Services: Computer Vision

Azure Vision Cognitive Services: Custom Vision

Azure Vision Cognitive Services: Face API

Capstone Project 1: Building a Face Mask Classifier

Azure Vision Cognitive Services: Assessment

Azure Language Cognitive Services: LUIS

Capstone Project 2: Building a Weather Application Using LUIS

Azure Language Cognitive Services: QnA Maker

Capstone Project 3: Building a Chatbot Using Azure QnA Maker

Azure Language Cognitive Services: Text Analytics

Azure Language Cognitive Services: Translator

Azure Language Cognitive Services: Assessment

Azure Decision Cognitive Services: Anomaly Detection

Azure Decision Cognitive Services: Content Moderator

Azure Decision Cognitive Services: Personalizer

Azure Decision Cognitive Services: Assessment

Azure Speech Cognitive Services

Azure Bing Search Services

Azure Speech and Bing Search Services: Assessment

Conclusion

Appendix

Building an OCR script for Documents using Read API

Introduction

Implementation for Documents