J4 ›› 2009, Vol. 47 ›› Issue (05): 1013-1019.

• 计算机科学 • 上一篇    下一篇

基于列存储数据库的压缩态数据访问算法

黄鹏, 李占山, 张永刚, 鞠奇   

  1. 吉林大学 计算机科学与技术学院, 长春 130012|吉林大学 符号计算与知识工程教育部重点实验室, 长春 130012
  • 收稿日期:2009-02-16 出版日期:2009-09-26 发布日期:2009-11-03
  • 通讯作者: 李占山 E-mail:zslizsli@163.com.

Retrieving Values from Compressed Data inColumnoriented Databases

HUANG Peng, LI Zhanshan, ZHANG Yonggang, JU Qi   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China; |Symbol Computationand Knowledge Engineer of Ministry of Education, Jilin University, Changchun 130012, China
  • Received:2009-02-16 Online:2009-09-26 Published:2009-11-03
  • Contact: LI Zhanshan E-mail:zslizsli@163.com.

摘要:

基于列存储数据库数据访问的特点, 对主流整数压缩格式进行改进, 提出了3种整数压缩算法及相应的压缩态数据访问算法. 实验结果表明, 不解压而直接访问压缩态数据非常高效, 从而为数据的压缩态运算提供了有力支持.

关键词: 列存储数据库; 数据压缩; 压缩态数据访问; 压缩态SQL运算

Abstract:

Based on the characteristics of data access in column oriented databases, the format of prevalent integer compression schemes is improved, and three integer compression algorithms and corresponding accessing algorithms on compressed data were proposed. Experiments show that retrieving data directly from compressed data without decompression is also very efficient, which thus powerfully supports the future computation on compressed data.

Key words: columnoriented database; data compression; , compressed data accessing; compressed SQL operation

中图分类号: 

  • TP391