Journal of Jilin University Science Edition

Previous Articles     Next Articles

Uncertain Data Generation Algorithm Based on Outlier Factor

LIU Gang1, TANG Dongkai1, WANG Hongmei1, HU Ming2   

  1. 1. School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China;2. School of Computer Technology and Engineering, Changchun Institute of Technology, Changchun 130012, China
  • Received:2017-06-20 Online:2018-07-26 Published:2018-07-31
  • Contact: HU Ming E-mail:huming@ccut.edu.cn

Abstract: Based on the uncertain data representation model, we proposed an uncertain data generation algorithm ACUDGen (attribute level continuous uncertain data set generation algorithm) for attribute level uncertain data. By introducing the outlier detection algorithmLOF (local outlier factor) algorithm, the algorithm used the outlier factor of each data object as the parameter to control the perturbation range of uncertain data objects, which could well satisfy the distribution characteristics of the original data and solve the problem of lack of the distribution characteristics of the original data in the present work. The experimental results show that the uncertain data set generated by the proposed algorithm has a better clustering effect, and reduces the influence of outier on the clustering results, so that the size of each data object MBR (minimum bounding rectangle) can be adaptively changed according to its own distribution characteristics.

Key words: outlier factor; ACUDGen algorithm, uncertain data, representation model

CLC Number: 

  • TP391