J4 ›› 2011, Vol. 29 ›› Issue (5): 473-476.

Previous Articles     Next Articles

Compressed Format Full-Text Index Based on Improved Huffman Code and Its Implement

 KAN Jun-Man   

  1. Department of Computer Science,Jilin Technology and Business College,Changchun 130062,China
  • Online:2011-09-24 Published:2011-11-29

Abstract:

To solve the problem of index structures compression of full-text indexes,we introduce the canonical Huffman code to encode the BWT(Burrows-Wheeler Transform) of a text. In the end, we present an efficient construction algorithm for this index, which is on-line and linear.Experimental results show that, without reducing the efficiency, the effective storage space compression is gained, which improves the effectiveness of the method.

Key words: full-text indexes, compressed, canonical Huffman code

CLC Number: 

  • TP391