Kofax INDICIUS 6.0 User Manual

Page 41

Advertising
background image

Configuring Recognition and Correction

Getting Started Guide (Fixed-Form)

35

f

If the document type is not identified correctly or alignment is not
corrected for any image, alter the registration points and retest until
satisfactory performance is achieved.

3

Define and configure data fields as follows:

a

For the first data field, define its position, set noise and background
removal schemes, configure segmentation and select suitable recognition
weights from the standard set.

b

Test the definition on a single example image.
Check that each data field is located correctly, that noise and background
features are removed without degrading the characters significantly, and
that the majority of characters which are clearly printed/written are
segmented correctly (in many cases Recognition can be configured to
separate joined characters and reconstitute broken ones). Also check that
at least 80% of characters that are segmented correctly are then
recognized correctly.

c

If there are problems in any of these areas, reconfigure and retest until
satisfactory results are achieved.

d

Retest the definition on the other example images.

e

If there are problems in any of these areas, reconfigure and retest until
satisfactory results are achieved.

Note

Problems in recognizing clearly printed/written characters may be

due to the weights file being sub-optimal for the typeface or language –
such problems should be dealt with later by tuning character recognition.

f

Repeat the above for the other data fields on the example images, using
copy and paste of fields wherever possible.

4

Tune character recognition as follows:

a

If clearly printed/written characters are not recognized well, try using
different combinations of weights files and voting configurations to
increase performance.

b

If necessary, use Recognition Trainer to tune the recognition for the
characters on the specific document set.

By using this methodology, optimum Recognition performance on a single document
type is guaranteed. For configuration of multiple document types, refer to the How
to Configure – Recognition book in the INDICIUS Help.

Advertising