How to Extract Images From Documents
Use the BA Insight Image Extractor to extract images inside PDFs and Open XML documents such as.docx
, .xlsx
, and .pptx
files.How to Configure the Image Extractor
Use the following iinstructions to set up and configure the Image Extractor component
On the Pipelines page, select a pipeline, and epand the New Component section.
from the Component list, select Image Extractor and provide a name for the component
Click Add Component.
From the existing components list, select the Image Extractor component that you just created.
Specify the Triggers or alter the name of your component.
There is no extra configuration for the image extraction component.
Input Property | Type |
---|---|
File | RawData |
Output Property | Type |
---|---|
ExtractedImagesBinaryData | Byte Array – Multi |
How to Test Your Image Extractor Pipeline
BA Insight recommends you test the Image Extractor before using it in real-time.
Use the following steps to test your Image Extractor:
- Click Pipeline Testing.
- Test Target: Select one of the following:
- Test the whole configuration:
- The document processing functions in the same way as for real documents.
- Test a specific feature:
- The document is only processed by the feature that you select in the drop-down list.
- Test a specific component:
- The document is only processed by the component that you select in the drop-down list.
- You can also click Skip Trigger in order to test using the feature but without any of the feature's triggers.
- Test the whole configuration:
- Select either of these choices:
- Recorded Data:
- Choose your prerecorded data, if you have this data.
- Or Paste RAW Text Data:
- The input data for testing can be defined here.
- Copy-Paste the XML with the same format used to create the Recorder.
- Recorded Data:
- Log Level:
- Use the drop-down if you do not want to leave the default selection
Error
, and selectWarning
,Info
, orDebug
.
- Use the drop-down if you do not want to leave the default selection
- Click Start Test to see your testing results.
Note: If there are no errors, the log does not return any results.