Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

    • Remove Garbage – Removes excess dots that are smaller than a certain size from the image during objects extraction (despeckle).

    • Remove Texture – Temporarily remove the background noise during OCR which might interfere with text recognition.

    • Detect Matrix Printer – If the source document was produced on a dot-matrix printer, use this option to interpret the text more accurately.

    • Detect Porous Text – Detect regions of the document with “porous” text.

    • Detect Text On Pictures – If the document has text on an image or colored background, use this selection to allow for OCR on the image.

    • Enable Aggressive Text Extraction – Enables the OCR engine to attempt to extract as much text on the image as possible. This is useful when the image contains some low-quality text, (although it may still require manual correction).

    • Fast Objects Extraction – When speed is required more than a high level of OCR accuracy, select this setting.

    • Prohibit Color Image – For the OCR engine to skip text laid over an image or colored background and only scan the black-and-white text.


  • Specify how to reuse the text and image layers of the source PDF file by selecting from the PDF Layer Reuse Mode drop-down list. Do not use this setting if the source file contains only raster-based data, such as image-only PDF files.

    • Auto – Have the OCR engine use both text and image layers. This is useful in most cases.

    • Do Not Reuse – Do not reuse the text layer which exists in the PDF file.

    • Content Only – For Have the OCR engine use only text layers in the PDF file, if they exist.

...