Data compression considerations, Intelligent data compression – Dell PowerVault LTO4-120HH User Manual

Page 11

Advertising
background image

Data Compression Considerations

In an effective data-compression method, several factors are important:

l

 

The amount of compression, which is measured by the compression ratio. This ratio compares the amount of uncompressed data to the amount of

compressed data. It is obtained by dividing the size of the uncompressed data by the size of the compressed data.

l

 

The speed with which data is compressed and decompressed relative to the host transfer rate.

l

 

The types of data to be compressed.

l

 

The data integrity of the compressed data.

The amount of compression possible in a data stream depends on factors such as:

l

 

Data pattern

l

 

Compression algorithm

l

 

Pattern repetition length

l

 

Pattern repetition frequency

l

 

Object size (block of information to be compressed)

l

 

Starting pattern chosen

The transfer rate depends on factors such as:

l

 

Compression ratio

l

 

Drive buffer size

l

 

Host computer input/output (I/O) speed

l

 

Effective disc speeds of the host computer

l

 

Record lengths that the host computer transmits

Data compression algorithms can be tailored to provide maximum compression for specific types of data. Because varying types of data are encountered in
normal day-to-day operating circumstances, however, an effective data compression method for a tape drive must serve various data types. Additionally, the
data compression method must adapt to different data types, automatically providing optimum handling for all types of data.

 

Intelligent Data Compression

The compressed capacity of the tape is maximized through the use of intelligent data compression. The intelligent data compression hardware determines the
compressibility of each record. If the size of the record is larger after a compression attempt than the native size, then the record is written in its native form.

The intelligent data compression utilizes two compression schemes:

l

 

Scheme-1 is a LZ1-based compression scheme using a history buffer to achieve data compression.

l

 

Scheme-2 is a pass-through compression scheme designed to pass uncompressible data through with minimal expansion.

There are three specific requirements for compliance with the LTO specification:

l

 

The output data stream must be decompressible following LTO rules to create the input sequence of records and file marks perfectly.

l

 

An LTO compressed data stream may not contain any of the eight reserved control symbols.

l

 

While control symbols allow switching to Scheme 2, this should never be used by operational software because this capability is only for diagnostic and

testing purposes.

Do not use software data compression because the built-in intelligent data compression of the PowerVault LTO-4-120 Half-Height tape drive is much more
efficient than software data compression.

The PowerVault LTO-4-120 Half-Height tape drive uses a derivative of ALDC-2 lossless data compression that includes additional control codes for intelligent
data compression.

Back to Contents Page

 

Advertising