Text PDF Form Identification and Zonal Field Extraction

The Data Extractor component supports Text PDF Form Identification and Zonal Field Extraction.

Zonal Forms identify the Text-based PDF using a set of one or more locations (for example, Page number and PDF coordinates on the page) and a regular expression against which to match the extracted text. If the set matches, it is assumed the form is properly identified.

The Default Form Library is automatically created during installation. The Administrator manages the Forms in the Default Form Library on the Forms tab of the Data Extractor Component Properties dialog box. However, the Administrator cannot add or remove Form Libraries.

Note: At this time, we only support attachment of a single page form per job.

To set data extraction, you can use a control file and set:

prDataExtraction = Mode

ZonalForms = 8

Text PDF Form Identification and Zonal Field Extraction

See also