About Postprocessing Settings

Postprocessing settings are for document corrections, transformations, or enhancements that occur after the OCR process is complete and the document has been processed.

See the following Postprocessing Settings below.

Image

  • Color Control – retain the color of the original document (black and white, grayscale or color), convert to grayscale if the original document contained color, or convert to black and white if the original document contained color or grayscale.

Note: The Convert to black and white setting is like the Binarize setting. Binarize occurs before character recognition and Convert to black and white occurs after character recognition.    

  • Quality – set a quality (compression) value for the scanned image from 0 – 100%.

Note: A higher value setting produces a better-quality image, but also a larger file size.

  • B&W Format – set the compression algorithm for black and white images to either JBIG2 or CCITT4.

  • Color & GrayScale Format - set the compression algorithm for color and grayscale images. Use the default for best results.

Settings (Paper Size)

  • Paper size – set the paper size of the document from a list options or create a custom paper size.

See the Paper size setting descriptions below.

    • Original Size – same size as the original scanned document. This is the default.

    • Auto Fixed - automatically selects a standard paper size so that the contents of each page in the document fits. A minimum paper size is selected for the entire document.

    • Auto Flexible - automatically selects a standard paper size so that the contents of each page in the document fits. A minimum paper size is selected for each page.

    • Custom -  create a custom paper size by specifying the width and height. Use numbers and decimals only for the width and height.

Note: An unusual custom paper size may cause the Compose Profile to fail.

DOC, DOCX, and RTF Final Format Settings

Document Layout

  • Editable Copy - produces an editable copy of the final format. Open it in Word or WordPad and edit, if necessary. This is a default setting.

  • Exact Copy - places a text box around paragraphs to maintain the document’s page layout. Editing may be difficult because this setting attempts to preserve the document’s page layout.

Note: This setting is recommended for documents with complex layouts like promotional brochures or materials.

  • Formatted Text - preserves fonts types and sizes only in a paragraph. Text formatting like bold, italic, and underline is not retained.

  • Plain Text – preserves paragraph text formatted in a single column. Frames are not used, and font types and sizes are not retained.

  • Keep Line Breaks and Hyphens – keep the line breaks and hyphens used in the original document.

  • Keep Page Breaks - keep the page breaks at the locations used in the original document.

PDF Final Format Settings

  • Compress Images using MRC -  MRC stands for mixed raster content and should only be used by users who are familiar and have a need for this setting. Its quality is contingent upon the original document (input file). It produces smaller file sizes, but occasionally can create blurry text.

  • Enable Text Sharpen Filter – use to sharpen text.

Note: Do not clear this default setting unless instructed to do so by a support representative.

  • Write HyperLinks - detects hyperlinks in the text and implements the hyperlinks in the final format of the document. Selecting the hyperlink will open your default browser or ask you which browser to use.

For PDF Compose Profiles only

Note: Some PDF readers will automatically detect and write hyperlinks to the final format of a document. If you do not want the hyperlinks to be written to your final format, you may need to clear this setting in both the PDF reader and Compose Profile.

  • Keep Text and Background Colors – retains text and background colors used in the original document.

  • Embed Fonts – embed fonts used into the PDF if you do not want the PDF reader  to guess and select random fonts.

  • PDFA Compliance Mode – PDF/A is a sub format of a PDF. Use to ensure that all parameters like embedded fonts and color palettes are available and can be viewed in the final format of the document. Administrators can use this setting to select a PDFA type and features needed for the final format. This is a standard documentation setting for most United States courts and legal firms.

Text Final Format Settings

  • Encoding – sets the encoding for the final format of the document. See Encoding types below.

    • Simple – encode one byte per symbol automatically based on the device settings and contents of the original document.

    • Unicode UTF-8 – encode using UTF-8.

    • Unicode UTF-16 – encode using UTF-16.

    • Auto select – determines if Unicode Transformation Format (UTF) or Simple encoding should be used. Results may not be accurate.

    • Non-Unicode – select a non-unicode option, if necessary.

  • Insert Page Break Character  As Page Break – insert special page break characters (0 - 12) between pages when multiple pages are exported to a *.txt file format.

Note: These special characters are inserted during the OCR process and used as flags by document viewers to identify where page breaks exist. They are not visible in the final format.

  • Use Blank Line As Paragraph Separator – use a blank line to separate paragraphs.

XLS, XLSX Final Format Settings

Document Layout

  • Formatted Text - preserves text formatting exported to a XLS/XLSX format.

  • Plain Text - removes text formatting exported to a XLS/XLSX format.

  • Background Color Mode - specifies background color mode when exporting to a XLSX format. Only the background color in table cells is saved.

  • Save Color for Inverted Blocks – only preserves the background color for inverted blocks of color used in a document. This is a default setting.

  • Don’t Save Color – no color is saved.

  • Save Color – save the background color used in the original document.

  • Save Black and White - save the background color in black and white.

  • Convert Numeric Values to Numbers - converts numeric values exported to a XLS/XLSX format as numbers instead of strings.

  • Ignore Text Outside Tables - only recognized text from table blocks is exported to a XLS/XLSX format. This is not a default setting and must be selected, if needed.

  • Create Separate Worksheet For Each Page – create separate worksheet pages for each document page to be exported to a XLSX file.

  • Keep Text Color -  preserve original colors of text during recognized text export to a XLSX file format.

See also

About Compose Profiles

About Preprocessing Settings

Creating  a Compose Profile

Applying a Compose Profile

Compose Profiles in Action