Plain text, Character sets – Sony PEG-NZ90 User Manual

Page 4

Advertising
background image

Picsel Text File Format Support

Page

4

ePAGE supports the most popular global file formats. The formats

interpreted by ePAGE are richly expressive, containing not just text but

sophisticated layout and rendering features, rich fonts, colour, images,
tables, graphics and many other document features. Picsel is

continuously developing its file format support to eventually cover every
feature of the native file. With such a wealth of features across many

document types, this is inevitably an ongoing process, with milestone
releases of new functionality planned at periodic intervals. The

approach involves researching the feature set most commonly found in
real documents, building support early for the most frequently used

elements, and ensuring these features are reproduced with total
faithfulness to the original. The emphasis of ePAGE is on displaying rich

content rather than on reproducing the document creation facilities of
the original application.


This document describes the features supported in ePAGE. This level of

support already covers the vast majority of characteristics that occur in
day to day documents of this type, and the specific features are

described with notes where appropriate. Those features planned for
future implementation are also described, for completeness.

Plain Text

While there are many document formats which encode extra
information such as page size and pictures into a single file, the most

basic commonly used file format for text is called a plain text file. This
has no special file structure; from the first byte to the last is a seqeuence

of characters. Some of these characters do not appear visually, but
instead take new lines or indent to a tab stop, as defined in the ASCII

standard (ANSI X3.4, initially approved in 1969).

Feature Support

Notes

Plain 7-bit ASCII text

Yes

Latin1 text

Yes

Newlines Yes

Tabs

Yes

Converted to sequences of spaces

Unlimited page size

Yes

Character Sets

ePAGE can work with all characters in the Unicode specification. Text
documents are encoded in other encodings, and ePAGE handles these

by converting Latin1 to UTF-16 Unicode. (In addition to the encoding

Advertising