Instruction classification, Document separation, Extraction – Kofax Getting Started with Ascent Xtrata Pro User Manual

Page 28: Locators

Advertising

Overview

Ascent Xtrata Pro User's Guide

Instruction Classification

Instruction classification uses explicit rules about a document to classify it. These
rules consist of words and phrases that can be combined using Boolean operations.
Negative instructions can be used to inhibit placing a document into a class. When
used in conjunction with the AFC, these explicit instructions can be used to handle
exceptions.

Document Separation

Ascent Xtrata Pro is capable of separating multi-page .tif images into single
documents or grouping loose pages into multi-page documents.

Although disabled by default, document separation can be enabled as a project-level
setting in Project Builder. A variety of options are available for defining how Ascent
Xtrata Pro Server handles unclassified pages. When the feature is enabled, Ascent
Xtrata Pro Server performs document separation before extraction.

For details about setting up document separation, see Project Builder.

Extraction

Extraction is the act of processing a document, usually with an OCR engine, to
identify information from an image file and preserve that information as text.

For classified documents, a class-specific extraction algorithm is applied to the index
fields for that class. Ascent Xtrata Pro provides several complementary extraction
methods for both finding relevant information in a document, and for filling the
index fields with the extracted items.

Extraction is not performed for unclassified documents.

Locators

Extraction methods, which are called locators, are available as integrated components
that can be configured for any class or at the project level.

Locators are attached to one or more fields that store the results of the locator
algorithm. Locators and fields are inherited by classes in accordance with their
position in the class tree.

Advertising