Journal of Jilin University (Information Science Edition) ›› 2022, Vol. 40 ›› Issue (1): 71-76.

Previous Articles     Next Articles

Hierarchical Redundancy Elimination Optimization of Cloud Storage Based on Pearson Correlation Algorithm

YANG Hui#br# #br#   

  1. College of Big Data Engineering, College Kaili University, Kaili 556000, China
  • Received:2021-04-28 Online:2022-01-25 Published:2022-01-29

Abstract: Aiming at the problems of redundancy elimination efficiency and low recall rate in traditional cloud storage, a hierarchical redundancy elimination optimization method based on Pearson correlation algorithm is proposed. According to the attribute distribution similarity measure value of redundant information in the
hierarchical structure, the distance matrix of redundant information is constructed to classify the hierarchical redundant information by calculating the similarity between redundant information. By analyzing the structure of different types of redundant information, using data dimension reduction constraints and central limit principle,the objective function of redundant information feature space compression is constructed, and the hierarchical redundant information features are extracted based on the redundancy optimization hyperplane, the distance between redundant information sample points and positive and negative hyperplanes is calculated. The fuzzy factor is defined by Pearson correlation algorithm, and then the feature effectiveness in the hierarchical structure of cloud storage is defined. The effectiveness function of redundant information features is constructed, and the redundancy optimization of cloud storage hierarchy is realized. Experimental results show that the design method improves the redundancy removal efficiency and has good performance in recall rate.

Key words: Pearson correlation algorithm, cloud storage, hierarchical, redundant information, eliminate redundant optimization

CLC Number: 

  • TP312