Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Tip

Please note that FTE is an invite only, pre-release feature. If you feel the technology described here fits a particular need, please reach out to your Square 9 representative for more details on how you can join the early adopters program.

Square 9 offers a number of traditional OCR options, but also has options that leverage tooling in the areas of AI and ML. While more modern extraction tooling can be very good at decreasing setup time, it’s often oArticulateMLn not a complete solution. Customers may need to blend modern and traditional approaches to form a complete, all encompassing data capture platform.

Square 9’s most recent offering in the AI extraction space, Form and Table Extraction (FTEArticulateML), involves AI assisted extraction models that are largely application/document/form independent. FTE ArticulateML works off of two core constructs: forms and tables.

FTE ArticulateML differs from other AI driven extraction offerings from Square 9 like TransformAI. Most notably, it is not document specific and can operate on any document type. However, like TransformAI, successful extraction outcomes do have rules. For TAI, those rules revolve around document characteristics that are common among Invoices and Receipts. For FTEArticulateML, the rules revolve around data points being grouped into either key / value pairs and Tables.

Key / Value Pairs

Keys and their associated values are a core construct of extraction with FTEArticulateML. For the more technical audience, Key / Value pairs are a common type of data structure used in programing and scripts. In the context of a document however, Key / Value pairs can take on new meaning.

...

In a traditional extraction model, users are generally less concerned about keys and focus exclusively on values. It would be very simple to create an OCR template that extracted values for Customer Number, Invoice Number, Invoice Date, and Total Due. As your capture needs expand however, this model becomes fragile. Variances in scan resolution might impact positioning, and most certainly, similar documents produced by other vendors will introduce differences in layout. Square 9’s GlobalCapture offers a number of tools to help with such discrepancies in a more traditional manner, whether it be through Marker Zones, pattern matching, etc. FTE ArticulateML takes a different approach.

Rather than using structured or semi-structured templates, FTE ArticulateML leverages the power of AI to make assumptions about the text on a page. Rather than requiring a user to identify via a template that “C8675309” is the customer number, FTE ArticulateML makes the assumptions automatically on behalf of the user. So in the AI assisted world, the OCR result wouldn’t be an arbitrary value “C8675309” that a user has told us should be inferred as Customer Number. Instead, the OCR result would resemble “Key: Customer Number, Value: C8675309”. The same pattern would hold true for all Key / Value pairs identified on the document. So in this case, you would expect to see results like:

...

While the OCR results are extremely good, success does require adherence to a pattern of some kind. In the case of FTEArticulateML, each value is expected to have a descriptive key in its general vicinity. This does not mean keys and values need to be presented in a specific way visually, nor does it mean grid lines must be present in the document’s layout. It means that for each value one cares to extract, there must be a related key.

...

Despite any limitations presented through the lack of a Key, FTE ArticulateML offer’s a very powerful, very accurate approach to semi-structured and unstructured document extraction that fits well into a large set of extraction use cases.

Tables

In addition to Key / Value extraction, FTE ArticulateML can be used to identify and extract tabular structured data from a document page. Because it is not bound to a document type, FTE ArticulateML offers greater flexibility with tables and their associated values when compared to a feature like TAI.

...

While the human eye might be able to quickly determine that the table marked (2) is the table of interest for this specific document, the computer can not make such assumptions. In this image snippet, two discrete tables could be identified. Tables are most commonly and most successfully extracted when there are clearly defined rows and columns.

When FTE ArticulateML executes on a document page, it will return the data organized by the table it was identified in. In the case above, we would expect two tables, one with 2 columns and 8 rows (table 1) and one with 2 rows and 6 columns (table 2).