How to Extract Information from Images

About

Use the Microsoft Computer Vision OCR component to extract OCR information from images and PDF files using the Microsoft Computer Vision READ API.

Supported file formats:

The maximum size that is currently supported for scanned PDFs is 4 MB.

All component triggers are set using the same method and instructions. See How to Classify Images.

Configure the Trigger so that only those documents to be OCR'd are passed to the API.

Passing unsupported file types or documents you do not desire to OCR results count as API requests.

This results in additional cost and time/performance.

Tip: Caching saves costs by reducing the API calls to prevent reprocessing of processed documents.

When caching is enabled the stage does not make a request to the API if both:

Clear Cache on Configuration Change
- If enabled, any change to this stage removes all items from the cache
Expiration
- Never: Cache never expires
- Sliding: Cache expires x days after the item is cached.
- Absolute: Cache expires after a set date.
Caching is enabled
- Enables caching of documents

Endpoint url: Enter the URL used to make calls to the Computer Vision API. The "computer vision" resource in the Azure Portal contains your endpoint URL. See the example below:
Api Key: Enter API key obtained when configuring Microsoft Computer Vision.
Total time to wait for processing: Specify the time to wait, in seconds, before retrying if API calls time out or fail.

Output property	Description
MSComputerVisionOCR	Text - Multi