Chapter 9 - recognizing documents, Introduction, Hapter – I.R.I.S. Readiris Pro 12 for Windows User Guide User Manual

Page 57: Ecognizing documents

Advertising
background image

Readiris

TM

Pro 12 – User Guide

51

C

HAPTER

9

R

ECOGNIZING DOCUMENTS

I

NTRODUCTION

To recognize documents, Readiris applies linguistics during the
recognition phase. As a result, Readiris recognizes text, tables,
graphics, barcodes and handprinted text in all kinds of documents.
Readiris even copes with complex columnized documents, low-
quality documents, faxes, dot matrix printouts, badly scanned and
copied documents containing too light or dark font shapes, etc.

Readiris supports 128 languages: all American and European
languages are supported, including the Central-European, Baltic and
Cyrillic languages as well as Greek and Turkish. Optionally,
Readiris can read Arabic, Farsi and Hebrew documents and four
Asian languages - Japanese, Simplified and Traditional Chinese and
Korean. Readiris even copes with mixed alphabets: the software
detects “Western” words that occur in Greek, Cyrillic, Arabic,
Hebrew and Asian documents - many untranscribable proper
names, brand names etc. are written using the Western symbols.

Readiris is based on the most advanced recognition technologies.
Font-independent text recognition is complemented by self-learning
techniques. The system is able to learn new characters and words
through contextual and linguistic analysis. This means that the OCR
accuracy of the recognition system will improve as it goes along.

Besides that, Readiris has an (optional) user verification function.
When activated, the user verification function (Interactive
learning
) not only flags the characters the recognition system isn't

Advertising