J4 ›› 2012, Vol. 50 ›› Issue (06): 1214-1217.

Previous Articles     Next Articles

Outlier Detecting Algorithm Based on Clusteringand Local Information

ZHANG Qiang1, WANG Chunxia2, ZHAO Jian3, WU Longju3, LI Jingyong3   

  1. 1. School of Computer Science, |Baicheng Teachers College, Baicheng 137000, Jilin Province, China;2. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China|3. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2012-01-18 Online:2012-11-26 Published:2012-11-26
  • Contact: LI Jingyong E-mail:lijingyong8888@126.com

Abstract:

Most existing outlier detection algorithms ignore local information of data sets, they are of low accuracy. We adopted a twophase algorithm based on k-means clustering algorithm, defined a new local stray factor as the standard to judge whether data objects are outliers. We also improved the process of detecting outliers and solved the above problem. Experiments show that our algorithm overcomes the shortcomings of existing methods, ensure the algorithm has linear time complexity and is able to find outliers in data sets more accurately and effectively.

Key words: outlier detecting, k-means clustering, local outlier factor

CLC Number: 

  • TP391