Defining the document characteristics, Font type – I.R.I.S. Readiris Corporate 12 for Windows User Guide User Manual

Page 69

Advertising
background image

Readiris

TM

Corporate 12 – User Guide

63

 All punctuation symbols and special characters at the

beginning and end of words are filtered automatically.

Hyphens inside words are maintained.

E.g. Notre-Dame-de-Paris stays Notre-Dame-de-Paris

Tip: watch out for hyphenation at the end of a line when you import
text files or copy-paste words that cover two lines.

Numbers are rejected. Digits, however, can occur inside product

names and are included.

E.g. FAT32 stays FAT32

Systolic 150 will become Systolic

D

EFINING THE DOCUMENT CHARACTERISTICS

Next to the document language, other document characteristics such
as the Font type and Character pitch play an important role in the
recognition process.

Font type

Readiris distinguishes between "regular" and dot matrix printed
documents. Dot matrix symbols (of the type 9 pin) are made up of
isolated, separate dots.

Special segmentation and recognition techniques are required to
recognize dot matrix documents and need to be activated.

Advertising