Kofax Getting Started with Ascent Xtrata Pro User Manual

Page 192

Advertising
background image

Extraction

Ascent Xtrata Pro User's Guide

173

If used for keywords, the dictionary will behave as if you had manually entered a
long list of keywords. All the optional settings will be applied to the words in the
dictionary.

Keyword dictionaries can be very useful. Consider the case where you want to
extract an invoice date, but the keyword designating it can vary. Instead of listing all
invoice date keywords individually, you can provide a dictionary containing them
(“Invoice date”, “Inv. Date,“ “Date”, “Date of invoice,“ “Day issued”, etc.).

X

To use a dictionary as a keyword

1

Open the properties dialog box for a Format Locator.

2

Select the Evaluation Settings tab.

3

Click the arrow next to the keyword field. A list of available dictionaries is
displayed. If no dictionaries are available, you can select “Dictionary
Settings” to open the Project Settings dialog box and import a dictionary.

4

Select a dictionary from the list.

5

Click Add or Modify to insert the dictionary to the list

As an example, you can use a dictionary of city names to uniquely identify a five-
digit number as a (German) zip code.

First specify a format such as “\d{5}” which will locate all five-digit numbers.
However, business documents usually contain several five-digit numbers.

To zero in on the zip code, the next step is to add a dictionary of city names to the
project and use it as a keyword that must have the correct geometric relationship to
the zip code (for example, to the right in the case of German addresses).

When the Test button is clicked, the results are displayed in the viewer and in the
result list with the corresponding confidences.

Advertising