How to Extract Information from Images

About

Use the Microsoft Computer Vision OCR component to extract OCR information from images and PDF files using the Microsoft Computer Vision READ API.

Supported file formats:

All component triggers are set using the same method and instructions. See How to Classify Images.

Configure the Trigger so that only those documents to be OCR'd are passed to the API.

Passing unsupported file types or documents you do not desire to OCR results count as API requests.

This results in additional cost and time/performance.

Name your component appropriately.

Check the "Component is enabled" checkbox (enabled by default).

Tip: Caching saves costs by reducing the API calls to prevent reprocessing of processed documents.

When caching is enabled the stage does not make a request to the API if both:

Clear Cache on Configuration Change
- If enabled, any change to this stage removes all items from the cache
Expiration
- Never: Cache never expires
- Sliding: Cache expires x days after the item is cached.
- Absolute: Cache expires after a set date.
Caching is enabled
- Enables caching of documents

Endpoint url
- URL used to make calls to the Computer Vision API.
- The "computer vision" resource in the Azure Portal contains your endpoint URL. See the example below:
Api Key
- API key obtained when configuring Microsoft Computer Vision
Total time to wait for processing
- Time to wait before retrying calls if API calls time out or fail

Output property	Description
MSComputerVisionOCR	Text - Multi