吉林大学学报(工学版) ›› 2009, Vol. 39 ›› Issue (06): 1607-1611.

• paper • Previous Articles     Next Articles

Algorithm of decision trees insensitive to data distribution

SUN Tao,LI Xiong-fei,LIU Li-juan   

  1. College of Computer Science and Technology,Jilin University,Changchun 130012,China
  • Received:2008-03-17 Online:2009-11-01 Published:2009-11-01

Abstract:

Traditional decision tree algorithms are sensitive to data distribution. The predictive accuracy of minority class is often decreased when the algorithm deals with skewed datasets. There exist some algorithms which can only handle the skewed datasets with only two kinds of classes. A new decision tree algorithm called DTID is proposed, which is insensitive to data distribution. Using this algorithm new cases of each minority class are generated to adjust the data distribution of the sample set, and the predictive accuracy of each minority class is improved. By adopting the modulus of each case previously, the running time of the algorithm is reduced. Experimental results show that, compared with C4.5 algorithm, the accuracy of DTID is obviously improved and it can obtain much better result even though there are many minority classes in the sample set.

Key words: artiftcial inteleigence, decision tree, skewed datasets, adjust the data distribution, modulusof sample

CLC Number: 

  • TP911 
[1] LI Hai-jian,DONG Hong-hui,SHI Yuan-chao,JIA Li-min,GUO Wei-feng. Vehicle classification with a single magnetic sensor for urban road [J]. 吉林大学学报(工学版), 2015, 45(1): 97-103.
[2] JIANG Lin, LIU Yan-Hang, GUAN Li-Wen, CHENG Ning-Bo. Coordination control design based on decision tree and fuzzy neural network [J]. 吉林大学学报(工学版), 2010, 40(05): 1350-1354.
[3] LIU Yan-Heng, SUN Lei, TIAN Da-Xin, WU Jing. Method of anomaly detection based on fusion principal components match [J]. 吉林大学学报(工学版), 2009, 39(05): 1314-1320.
[4] Qiu Chang-bo, Shi Meng, Li Rui-lu. College of Management, Jilin University, Changchun 130022, China [J]. 吉林大学学报(工学版), 2006, 36(04): 635-638.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!