Fibonacci Coding Within the Burrows-Wheeler Compression Scheme

  • R. Bastys Vilnius University

Abstract

Burrows-Wheeler data compression algorithm (BWA) is one of the most effective textual data compressors. BWA includes three main iterations: Burrows-Wheeler transform (BWT), Move-To-Front transformation (MTF) and some zeroth order entropy encoder (e.g. Huffman). The paper discusses little investigated scheme when MTF is replaced by the less popular Distance Coding (DC). Some relevant advantages and downsides of such modified scheme are indicated, the most critical being heavy DC output alphabet. It is shown that applying Fibonacci Code instead of entropy encoder elegantly deals with this technical problem. The results we obtain on the Canterbury Corpus text files are very close to the theoretical lower bounds. Our compressor outperforms the most widely used commercial zip archiver and achieves sophisticated BWA implementation bzip2 compression. Ill. 11, bibl. 14, tabl. 1 (in English; abstracts in English, Russian and Lithuanian).

Published
2015-02-19
Section
T 120 SYSTEM ENGINEERING, COMPUTER TECHNOLOGY