Kofax INDICIUS 6.0 User Manual

Page 81

Advertising
background image

Configuration

Getting Started Guide (Classification and Separation)

71

2

Within the table, clear the “Include” check box for the Header document
type, so these documents are not used in training the classifier.

This document type will be accounted for later by configuring templated
(barcode) classification.

3

Click Build.

4

Once the classifier has been built, click Finish.

Integrate Classifier

In production, Recognition runs a Recognition script, which uses the classifier. The
Recognition script (named Document Classification.ifv) is created automatically
when the configuration is created. Two changes may be needed in this script:

ƒ

The name of the classifier

ƒ

The pages to be used by the classifier (and therefore that need to be read)

The script will, by default, use a classifier called “Document text classifier.ibc.” This
is the default name of the classifier created using the Build Document Text Classifier
tab. If the name is left unchanged, no modification is needed to the script. For
information on changing the classifier name in the script, refer to the INDICIUS Help.

The script will, by default, run the classification on all pages. For information on
changing the pages to be used, refer to the INDICIUS Help.

X

To integrate the classifier,

no modifications to the script are required for this

tutorial.

Test Classification

You will test the configuration on the Test Documents set, that is, the documents that
were not used to build the classifier. These documents require exporting from
Transformation Studio so they can be loaded into the Recognition Test Tool. You will
need to export these documents in the correct file structure for testing document
classification (a multi-page image file for each document).

You will then assign the configuration to a project in Recognition Test Tool, where it
is run on the test documents.

Note

Although all testing could be done once the configuration is finished, it is

recommended that testing is done as each classification method is implemented,
ensuring any issues are quickly found and fixed.

Advertising