Extraction – Kofax Getting Started with Ascent Xtrata Pro User Manual

Page 41

Advertising
background image

Chapter 2

22

Ascent Xtrata Pro User's Guide

Layout Classifier

The Layout Classifier analyzes the graphical representation of the document image
and automatically creates classes of similar documents. Training documents are
needed to enable layout classification for a class. The representations of these
training documents are used to train the classifier. For detailed information, see
Layout Classifier on page 43.

Adaptive Feature Classifier

The Adaptive Feature Classifier (AFC) analyzes the textual representations of
documents and automatically creates classes of similar documents. Training
documents are needed to enable the AFC for a class. The classifier is trained with the
textual representation of these training documents. For detailed information, see
Adaptive Feature Classifier on page 44.

Instruction Classifier

The Instruction Classifier searches for specified phrases in the textual representation
of a document; therefore, no training documents are needed. To enable the
Instruction Classifier, characteristic phrases (referred to as instructions) are defined.
For detailed information, see Instruction Classifier on page 45.

Classification based on extraction

You can project level define fields for which extraction is performed before
classification. The extraction results for these project level fields can be used to
classify the document. For example, you can classify a document based on a barcode.

Reclassify Documents

The classification result can also be changed during extraction, after which extraction
is performed once again for the new class.

Extraction

Each class can be set up to contain a set of fields for storing the extracted data. These
fields can be synchronized with Ascent Capture fields. The fields are filled by agents
(referred to as locators) that search for data on the document. Locators exist in
different flavors, which are distinguished by their way of searching. There are
different locator types, described in detail in Extraction.

Advertising