Classification based on extraction, Reclassification of documents – Kofax Getting Started with Ascent Xtrata Pro User Manual

Page 84

Advertising
background image

Classification

Ascent Xtrata Pro User's Guide

65

Hierarchical Evaluation and Other Classification Rules

The evaluation of classification results is primarily based on the minimum
confidence and distance defined in the project settings. But, if the class hierarchy
contains hierarchical elements, a set of hierarchical evaluation rules is automatically
applied to the classification result. This might result in a classification that does not
have the highest confidence.

The following sections provide more information about these classification rules.

Classification based on extraction

You can define fields on project level such that extraction is performed before
classification, and where those extraction results can be used for classification. For
example, it is possible to classify a document based on bar code results. In a similar
manner, it is possible to perform classification using zones. For example, using form
IDs at certain places on the document.

For example:

Private Sub

Document_AfterClassifyXDoc(pXDoc

As _

CASCADELib.CscXDocument)

If

pXDoc.Fields(0).Text = "XYZ"

Then

pXDoc.Reclassify "NewClass3"

End If

End Sub

Reclassification of Documents

The classification result can also be changed during extraction, in which case
extraction is repeated for the new class. Inside the classification script, the extraction
results for the project-level fields can be used to manually reassign the classification
result. In order to avoid loops, this sort of reclassification can only be done once per
document.

Fields, locators, and validation rules (at the project level) are available in all classes as
derived items. By default, the project-level fields and locators will not be extracted
again during any subsequent extractions. Once extraction has been performed, the
preserve-flag for these fields and locators will be set to 'TRUE'. If one of the fields or
locators needs to be extracted again, the preserve-flag must be set to 'FALSE' at the
beginning of extraction.

Advertising