吉林大学学报(工学版) ›› 2011, Vol. 41 ›› Issue (03): 716-720.

• paper • Previous Articles     Next Articles

Novel spam filtering method based on fuzzy adaptive particle swarm optimization

WANG Gang1,2,LIU Yuan-ning1,2,ZHANG Xiao-xu1,2,ZHAO Zheng-dong3,ZHU Xiao-dong1,2,LIU Zhen1,4   

  1. 1.College of Computer Science and Technology,Jilin University,Changchun 130012,China|2.Ministry Key Laboratory of Symbol Computation and Knowledge Engineering,Jilin University,Changchun 130012,China;3Network Center,Changchun University of Science and Technology,Changchun 130022,China;4.Graduate School of Engineering,Nagasaki Institute of Applied Science,Nagasaki |851-0193,Japan
  • Received:2010-08-11 Online:2011-05-01 Published:2011-05-01

Abstract:

A Novel Spam Filtering Method (NSFM) is proposed, which removes redundant attributes from the high dimensional attributes, and selects the attributes, which contribute to the classification accuracy, thus, to improve the classification rate of spam filtering. A fuzzy adaptive particle swarm algorithm is developed, which can dynamically control the inertia weight, learning factor and particle number factor using fuzzy control. The NSFM consists of three stages, kernel feature selection, feature selection and spam filtering. In the first stage, information gain is employed to calculate the information value of each feature, and construct a kernel feature set, thereby obtaining a number of kernel feature subsets. In the second stage, according to the kernel feature subset, IFAPSO is initialized and adjusted adaptively using the fuzzy controller, thus finishing spam filtering. In the final stage, support vector machine is used to classify the optimal feature subset and finish spam filtering. In this paper, PUl, LingSpam and SpamAssassin data sets are utilized. Through many comparative experiments, it is confirmed that the proposed method is adaptable and can select better feature subsets, thereby enhancing the classification accuracy rate effectively, and building up the performance of spam filtering. The NSFM has important practical value.

Key words: artificial intelligence, feature selection, particle swarm optimization, fuzzy control, spam filtering, support vector machines

CLC Number: 

  • TP18
[1] ZHAO Dong,SUN Ming-yu,ZHU Jin-long,YU Fan-hua,LIU Guang-jie,CHEN Hui-ling. Improved moth-flame optimization method based on combination of particle swarm optimization and simplex method [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1867-1872.
[2] KUI Hai-lin, BAO Cui-zhu, LI Hong-xue, LI Ming-da. Idling time prediction method based on least square support vector machine [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1360-1365.
[3] DONG Sa, LIU Da-you, OUYANG Ruo-chuan, ZHU Yun-gang, LI Li-na. Logistic regression classification in networked data with heterophily based on second-order Markov assumption [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1571-1577.
[4] GU Hai-jun, TIAN Ya-qian, CUI Ying. Intelligent interactive agent for home service [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1578-1585.
[5] LIU Yuan-ning, LIU Shuai, ZHU Xiao-dong, CHEN Yi-hao, ZHENG Shao-ge, SHEN Chun-zhuang. LOG operator and adaptive optimization Gabor filtering for iris recognition [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1606-1613.
[6] ZANG Guo-shuai, SUN Li-jun. Method based on inertial point for setting depth to rigid layer [J]. 吉林大学学报(工学版), 2018, 48(4): 1037-1044.
[7] WANG Xu, OUYANG Ji-hong, CHEN Gui-fen. Measurement of graph similarity based on vertical dimension sequence dynamic time warping method [J]. 吉林大学学报(工学版), 2018, 48(4): 1199-1205.
[8] ZHANG Hao, ZHAN Meng-ping, GUO Liu-xiang, LI Zhi, LIU Yuan-ning, ZHANG Chun-he, CHANG Hao-wu, WANG Zhi-qiang. Human exogenous plant miRNA cross-kingdom regulatory modeling based on high-throughout data [J]. 吉林大学学报(工学版), 2018, 48(4): 1206-1213.
[9] HUANG Hui, FENG Xi-an, WEI Yan, XU Chi, CHEN Hui-ling. An intelligent system based on enhanced kernel extreme learning machine for choosing the second major [J]. 吉林大学学报(工学版), 2018, 48(4): 1224-1230.
[10] HUANG Lan, JI Lin-ying, YAO Gang, ZHAI Rui-feng, BAI Tian. Construction of disease-symptom semantic net for misdiagnosis prompt [J]. 吉林大学学报(工学版), 2018, 48(3): 859-865.
[11] LI Xiong-fei, FENG Ting-ting, LUO Shi, ZHANG Xiao-li. Automatic music composition algorithm based on recurrent neural network [J]. 吉林大学学报(工学版), 2018, 48(3): 866-873.
[12] LIU Jie, ZHANG Ping, GAO Wan-fu. Feature selection method based on conditional relevance [J]. 吉林大学学报(工学版), 2018, 48(3): 874-881.
[13] WANG Xu, OUYANG Ji-hong, CHEN Gui-fen. Heuristic algorithm of all common subsequences of multiple sequences for measuring multiple graphs similarity [J]. 吉林大学学报(工学版), 2018, 48(2): 526-532.
[14] YANG Xin, XIA Si-jun, LIU Dong-xue, FEI Shu-min, HU Yin-ji. Target tracking based on improved accelerated gradient under tracking-learning-detection framework [J]. 吉林大学学报(工学版), 2018, 48(2): 533-538.
[15] LIU Xue-juan, YUAN Jia-bin, XU Juan, DUAN Bo-jia. Quantum k-means algorithm [J]. 吉林大学学报(工学版), 2018, 48(2): 539-544.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!