吉林大学学报(工学版) ›› 2016, Vol. 46 ›› Issue (1): 228-234.doi: 10.13229/j.cnki.jdxbgxb201601034

Previous Articles     Next Articles

Document compression scheme based on integer data

TE Ri-gen1, 2, 3, 4, JIANG Sheng1, 2, LI Xiong-fei3, 4, LI Jun3, 4   

  1. 1.Chang Guang Satellite Technology Co.,Ltd.,Changchun 130000,China;
    2.Changchun Institute of Optics,Fine Mechanics and Physics,Chinese academy of Science,Changchun 130033,China;
    3.Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education,Jilin University, Changchun 130012,China;
    4.College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2014-08-24 Online:2016-01-30 Published:2016-01-30

Abstract: A CSN-2 compression algorithm for integer data was proposed and applied to the compression of any documents. Moreover, the CSN-2 data compression algorithm does not need additional data support. A CSNE-2 decompression algorithm, which can properly restore the original data, was proposed by studying the CNS-2 decompression algorithm. It was proved that the results of the decompression algorithm are unique and correct in theoretical and experimental tests. Furthermore, it was demonstrated that the CSN-2 compression algorithm for the integer type of documents has a higher compression ratio, and could compress any documents compared with experiments of other compression programs.

Key words: computer software, data compression, compression, text compression, integer data

CLC Number: 

  • TP301
[1] 杨国为, 涂序彦, 庞杰. 基于虚拟信源的无损数据压缩方法研究[J]. 电子学报, 2003,31(5):728-731.
Yang Guo-wei, Tu Xu-yan, Pang Jie. The research of lossless data compression based on a virtual information source[J]. Acta Electronica Sinica, 2003,31(5):728-731.
[2] 纪震,周家锐,朱泽轩,等. 基于生物信息学特征的DNA 序列数据压缩算法[J]. 电子学报,2011,39(5): 991-995.
Ji Zhen, Zhou Jia-rui, Zhu Ze-xuan, et al. Bioinformatics features based DNA sequence data compression algorithm[J]. Acta Electronica Sinica, 2011,39(5):991-995.
[3] Chu D, Deshpande A, Hellerstein J M, et al. Approximate data collection in sensor networks using probabilistic models[C]∥ICDE '06 Proceedings of the 22nd International Conference on Data Engineering,DC, 2006:48-59.
[4] Najafi H, Lahouti F, Shiva M. AR modeling for temporal extension of correlated sensor network data[C]∥Software in Telecommunications and Computer Networks, Split, 2006:117-120.
[5] Borgne Y L, Bontempi G. Unsupervised and supervised compression with principal component analysis in wireless sensor networks[C]∥Pro of the Workshop on Knowledge Discovery from Data, 13th ACM International Conference on Knowledge Discovery and Data Mining, New York,2007: 94-103.
[6] Ganesan D,Estrin D,Heidemann J.DIMENSIONS: Why do we need a new data handling architecture for sensor networks[J].Acm Sigcomm Computer Communication Review,2003,33(1):143-148.
[7] 郑翠芳. 几种常用无损数据压缩算法研究[J]. 计算机技术与发展, 2011,21(9):73-76.
Zheng Cui-fang. Research of several common lossless data compression algorithms[J]. Computer Technology and Development, 2011,21(9):73-76.
[8] Shannon C E. A mathematical theory of communication[J]. The Bell System Technical Journal,1948,27(7):379-423.
[9] Tsang P, Liu J P, Cheung K. Modern methods for fast generation of digital holograms[J]. 3D Research, 2010,1(2):11-18.
[10] Wu J Z, Wang Y J, Ding L P, et al. Improving performance of network covert timing channel through Huffman coding[J]. Mathematical and Computer Modelling, In Press, Corrected Proof,2011,55(1):69-79.
[11] Jeong J, Jo J M. Adaptive Huffman coding of 2-D DCT coefficients for image sequence compression[J]. Signal Processing: Image Communication, 1995,7(1):1-11.
[12] Rissanen J,Langdon G G.Universal modeling and coding[J].Information Theory,1981,21(1):12-23.
[13] Miguel A,Prieto M, Adiego J. Natural language compression on Edge-Guided text preprocessing[J]. Information Sciences, 2011,181(24):5387-5411.
[14] Freschi V, Bogliolo A.A faster algorithm for the computation of string convolutions using LZ78 parsing[J]. Information Processing Letters, 2010,110(14-15):609-613.
[15] Arroyuelo D, Navarro G. Optimum string match choices in LZSS[J]. Information and Computation, 2011, 209(7):1070-1102.
[16] Lakhani G. Reducing coding redundancy in LZW[J]. Information Sciences, 2006, 176(10) : 1417-1434.
[17] Gödel K. Über formal unentscheidbare Sätze der principia mathematica und verwandter systeme[J]. Mathematics and Statistics, 1931, 38(1): 173-198.
[1] ZHAO Hong-wei, LIU Yu-qi, TE Ri-gen, CHEN Chang-zheng, ZANG Xue-bai. New compression algorithms based on finite sequence [J]. 吉林大学学报(工学版), 2018, 48(3): 882-886.
[2] MA Jian, FAN Jian-ping, LIU Feng, LI Hong-hui. The evolution model of objective-oriented software system [J]. 吉林大学学报(工学版), 2018, 48(2): 545-550.
[3] LIU Yao-hui, CHEN Qiao-xu, SONG Yu-lai, SHEN Yan-dong. Compressive behavior and mechanism of volcanic ash-SBS, rubber powder-SBS and SBS modified asphalt [J]. 吉林大学学报(工学版), 2017, 47(6): 1861-1867.
[4] LUO Yang-xia, GUO Ye. Software recognition based on features of data dependency [J]. 吉林大学学报(工学版), 2017, 47(6): 1894-1902.
[5] LIU Han-bing, ZHANG Hu-zhu, WANG Jing. Effect of dehydration on shear strength properties of compacted clayey soil [J]. 吉林大学学报(工学版), 2017, 47(2): 446-451.
[6] WANG Ke-yan, LI Yun-song, SONG Juan, LIAO Hui-lin, WU Xian-yun. Spatial-spectral lossless compression of hyperspectral images using local edge based prediction [J]. 吉林大学学报(工学版), 2017, 47(2): 677-685.
[7] YING Huan, WANG Dong-hui, WU Cheng-gang, WANG Zhe, TANG Bo-wen, LI Jian-jun. Efficient deterministic replay technique on commodity system environment [J]. 吉林大学学报(工学版), 2017, 47(1): 208-217.
[8] LI Yong, HUANG Zhi-qiu, WANG Yong, FANG Bing-wu. New approach of cross-project defect prediction based on multi-source data [J]. 吉林大学学报(工学版), 2016, 46(6): 2034-2041.
[9] LIU Li-ping, LIU Yong-bing, JI Lian-feng, CAO Zhan-yi, YANG Xiao-hong. Flow stress behavior of in situ particulate reinforced titanium atrix composite at elevated temperature [J]. 吉林大学学报(工学版), 2016, 46(4): 1197-1201.
[10] WANG Nian-bin, ZHU Guan-wen, ZHOU Lian-ke, WANG Hong-wei. Novel dataspace index for efficient processing of path query [J]. 吉林大学学报(工学版), 2016, 46(3): 911-916.
[11] CHEN Mian-shu, WANG Yuan-yuan, SANG Ai-jun, CHEN He-xin. KL transformation based the theory of multidimensional vector matrix [J]. 吉林大学学报(工学版), 2016, 46(2): 627-631.
[12] CHEN Peng-fei, TIAN Di, YANG Guang. Design and implementation of LIBS software based on MVC architecture [J]. 吉林大学学报(工学版), 2016, 46(1): 242-245.
[13] ZHANG Zhi-qiang, JIA Xiao-fei, YUAN Qiu-ju. Springback analysis of trip high strength steel based on Yoshida-Uemori model [J]. 吉林大学学报(工学版), 2015, 45(6): 1852-1856.
[14] LIU Lei, WANG Yan-yan, SHEN Chun, LI Yu-xiang, LIU Lei. Performance portable GPU parallel optimization technique on Bellman-Ford algorithm [J]. 吉林大学学报(工学版), 2015, 45(5): 1559-1564.
[15] FENG Xiao-ning, WANG Zhuo, ZHANG Xu. Formal method for routing protocol of WSN based on L-π calculus [J]. 吉林大学学报(工学版), 2015, 45(5): 1565-1571.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LIU Song-shan, WANG Qing-nian, WANG Wei-hua, LIN Xin. Influence of inertial mass on damping and amplitude-frequency characteristic of regenerative suspension[J]. 吉林大学学报(工学版), 2013, 43(03): 557 -563 .
[2] CHU Liang, WANG Yan-bo, QI Fu-wei, ZHANG Yong-sheng. Control method of inlet valves for brake pressure fine regulation[J]. 吉林大学学报(工学版), 2013, 43(03): 564 -570 .
[3] LI Jing, WANG Zi-han, YU Chun-xian, HAN Zuo-yue, SUN Bo-hua. Design of control system to follow vehicle state with HIL test beach[J]. 吉林大学学报(工学版), 2013, 43(03): 577 -583 .
[4] HU Xing-jun, LI Teng-fei, WANG Jing-yu, YANG Bo, GUO Peng, LIAO Lei. Numerical simulation of the influence of rear-end panels on the wake flow field of a heavy-duty truck[J]. 吉林大学学报(工学版), 2013, 43(03): 595 -601 .
[5] WANG Tong-jian, CHEN Jin-shi, ZHAO Feng, ZHAO Qing-bo, LIU Xin-hui, YUAN Hua-shan. Mechanical-hydraulic co-simulation and experiment of full hydraulic steering systems[J]. 吉林大学学报(工学版), 2013, 43(03): 607 -612 .
[6] ZHANG Chun-qin, JIANG Gui-yan, WU Zheng-yan. Factors influencing motor vehicle travel departure time choice behavior[J]. 吉林大学学报(工学版), 2013, 43(03): 626 -632 .
[7] MA Wan-jing, XIE Han-zhou. Integrated control of main-signal and pre-signal on approach of intersection with double stop line[J]. 吉林大学学报(工学版), 2013, 43(03): 633 -639 .
[8] YU De-xin, TONG Qian, YANG Zhao-sheng, GAO Peng. Forecast model of emergency traffic evacuation time under major disaster[J]. 吉林大学学报(工学版), 2013, 43(03): 654 -658 .
[9] XIAO Yun, LEI Jun-qing, ZHANG Kun, LI Zhong-san. Fatigue stiffness degradation of prestressed concrete beam under multilevel amplitude cycle loading[J]. 吉林大学学报(工学版), 2013, 43(03): 665 -670 .
[10] XIAO Rui, DENG Zong-cai, LAN Ming-zhang, SHEN Chen-liang. Experiment research on proportions of reactive powder concrete without silica fume[J]. 吉林大学学报(工学版), 2013, 43(03): 671 -676 .