Page Comparison

...

- Remove Garbage – Removes excess dots that are smaller than a certain size from the image during objects extraction (despeckle).
- Remove Texture – Temporarily remove the background noise during OCR which might interfere with text recognition.
- Detect Matrix Printer – If the source document was produced on a dot-matrix printer, use this option to interpret the text more accurately.
- Detect Porous Text – Detect regions of the document with “porous” text.
- Detect Text On Pictures – If the document has text on an image or colored background, use this selection to allow for OCR on the image.
- Enable Aggressive Text Extraction – Enables the OCR engine to attempt to extract as much text on the image as possible. This is useful when the image contains some low-quality text, (although it may still require manual correction).
- Fast Objects Extraction – When speed is required more than a high level of OCR accuracy, select this setting.
- Prohibit Color Image – For the OCR engine to skip text laid over an image or colored background and only scan the black-and-white text.
Specify how to reuse the text and image layers of the source PDF file by selecting from the PDF Layer Reuse Mode drop-down list. Do not use this setting if the source file contains only raster-based data, such as image-only PDF files.
- Auto – Have the OCR engine use both text and image layers. This is useful in most cases.
- Do Not Reuse – Do not reuse the text layer which exists in the PDF file.
- Content Only – For Have the OCR engine use only text layers in the PDF file, if they exist.

...

Versions Compared

Old Version 14

New Version Current

Key