吉林大学学报(工学版) ›› 2016, Vol. 46 ›› Issue (4): 1209-1215.doi: 10.13229/j.cnki.jdxbgxb201604029

• Orginal Article • Previous Articles     Next Articles

Chinese anaphora resolution based on multi-pass sieve model

ZHOU Xuan-yu1, LIU Juan1, SHAO Peng1, 2, LUO Fei1, LIU Yang1   

  1. 1.School of Computer Science, Wuhan University, Wuhan 430072, China;
    2.State Key Laboratory of Software Engineering, Wuhan University, Wuhan 430072, China
  • Received:2015-03-11 Online:2016-07-20 Published:2016-07-20

Abstract: Most existing Chinese anaphora resolution models determine whether two mentions are coreferent by a binary classifier. This approach can lead to incorrect decisions as lower precision features often overwhelm the precision features. We propose a modified multi-pass sieve model for Chinese anaphora resolution to adapt to Chinese. We add a new semantic-based sieve to the original model for incorporating word sense information. The Web word sense information is imported to solve resource constraints. Furthermore, we modify the mention detection sieve based on the Chinese characters. The proposed model is evaluated on five different testing methods on the ACE2005 corpus. Results show that the proposed model outperforms two other baseline models by 4% and 9% respectively.

Key words: artificial intelligence, multi-pass sieve model, semantic information, anaphora resolution, natural language processing

CLC Number: 

  • TP391
[1] Hardmeier C,Federico M. Modelling pronominal anaphora in statistical machine translation[C]∥Proceedings of the International Workshop on Spoken Language Translation,Paris,2010:283-289.
[2] Doddington G, Mitchell A, Przybocki M. The automatic content extraction (ACE) program-tasks, data,and evaluation[DB/OL].http:∥www.comp.nus.edu.sg/rpnlpir/proceedings/lrec-2004/pdf/.pdf, 2012-05-11.
[3] Witte R, Krestel R, Bergler S. Context based mult- idocument summarization using fuzzy coreference cluster graphs[DB/OL].http:∥www. nlpir.nist.gov/projects/duc/pubs/2006.papers/20.final.pdf, 2012-05-06.
[4] ning approach to coreference resolution of noun phr- ases[J].Computational Linguistics,2001(4): 521-544.
[5] Raghunathan K, Lee H, Rangarajan S. A multipass sieve for coreference resolution[C]∥Massa-chusetts, MIT, 2010:492-501.
[6] Lee H, Peirsman Y, Chang A, et al. Stanford's multi-pass sieve coreference resolution system at the CoNLL-2011 shared task[C]∥In Proceedings of the Fifteenth Conference on Computational Natural Language Learning:Shared Task,Oregon,2011:28-34.
[7] Zhang Xiao-tian, Wu Chun-yang, Zhao Hai. Chinese coreference resolution via ordered filtering[C]∥In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning: Shared Task, Jeju,2012:95-99.
[8] 孔芳,朱巧明,周国栋,等.中英文指代消解中待消解项识别的研究[J].计算机研究与发展, 2012,49(5):1072-1085.
Kong Fang, Zhu Qiao-ming, Zhou Guo-dong,et al.Anap-horicity determination for coreference resolution in English and Chinese[J]. Journal of Computer Research and Development, 2012,49(5):1072-1085.
[9] 刘群,李素建.基于《知网》的词汇语义相似度计算[EB/OL].[2015-02-14]http:∥www.keenage.com,2013.
[10] Cilibrasi R L, Vitanyi P M. The google similarity distance[J].IEEE Transactions on Knowledge and Data Engineering,2007, 19(3): 370-383.
[11] Marc Vilain, John Burger, John Aberdeen,et al. A model theoretic coreference scoring scheme[C]∥In Proceedings of the 6th Message Understanding Conference,Stroudsburg,1995:45-52.
[12] Amit Bagga, Breck Baldwin. Algorithms for scoring coreference chains[C]∥In Proceedings of LREC,Granada,1998:563-566.
[13] Luo Xiao-qiang. On coreference resolution performance metrics[C]∥In Proceedings of HLT- EMNLP,Stroudsburg,2005:25-32.
[14] Ghosh.Handbook of Data Mining[M].Cleveland CRC Press,2001:247-277.
[15] Marta Recasens,Eduard Hovy.BLANC: Implementing the Rand Index for coreference evalu-ation[J].Natural Language Engineering,2011,17(4):485-510.
[1] DONG Sa, LIU Da-you, OUYANG Ruo-chuan, ZHU Yun-gang, LI Li-na. Logistic regression classification in networked data with heterophily based on second-order Markov assumption [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1571-1577.
[2] GU Hai-jun, TIAN Ya-qian, CUI Ying. Intelligent interactive agent for home service [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1578-1585.
[3] WANG Xu, OUYANG Ji-hong, CHEN Gui-fen. Measurement of graph similarity based on vertical dimension sequence dynamic time warping method [J]. 吉林大学学报(工学版), 2018, 48(4): 1199-1205.
[4] ZHANG Hao, ZHAN Meng-ping, GUO Liu-xiang, LI Zhi, LIU Yuan-ning, ZHANG Chun-he, CHANG Hao-wu, WANG Zhi-qiang. Human exogenous plant miRNA cross-kingdom regulatory modeling based on high-throughout data [J]. 吉林大学学报(工学版), 2018, 48(4): 1206-1213.
[5] HUANG Lan, JI Lin-ying, YAO Gang, ZHAI Rui-feng, BAI Tian. Construction of disease-symptom semantic net for misdiagnosis prompt [J]. 吉林大学学报(工学版), 2018, 48(3): 859-865.
[6] LI Xiong-fei, FENG Ting-ting, LUO Shi, ZHANG Xiao-li. Automatic music composition algorithm based on recurrent neural network [J]. 吉林大学学报(工学版), 2018, 48(3): 866-873.
[7] LIU Jie, ZHANG Ping, GAO Wan-fu. Feature selection method based on conditional relevance [J]. 吉林大学学报(工学版), 2018, 48(3): 874-881.
[8] WANG Xu, OUYANG Ji-hong, CHEN Gui-fen. Heuristic algorithm of all common subsequences of multiple sequences for measuring multiple graphs similarity [J]. 吉林大学学报(工学版), 2018, 48(2): 526-532.
[9] YANG Xin, XIA Si-jun, LIU Dong-xue, FEI Shu-min, HU Yin-ji. Target tracking based on improved accelerated gradient under tracking-learning-detection framework [J]. 吉林大学学报(工学版), 2018, 48(2): 533-538.
[10] LIU Xue-juan, YUAN Jia-bin, XU Juan, DUAN Bo-jia. Quantum k-means algorithm [J]. 吉林大学学报(工学版), 2018, 48(2): 539-544.
[11] QU Hui-yan, ZHAO Wei, QIN Ai-hong. A fast collision detection algorithm based on optimization operator [J]. 吉林大学学报(工学版), 2017, 47(5): 1598-1603.
[12] LI Jia-fei, SUN Xiao-yu. Clustering method for uncertain data based on spectral decomposition [J]. 吉林大学学报(工学版), 2017, 47(5): 1604-1611.
[13] SHAO Ke-yong, CHEN Feng, WANG Ting-ting, WANG Ji-chi, ZHOU Li-peng. Full state based adaptive control of fractional order chaotic system without equilibrium point [J]. 吉林大学学报(工学版), 2017, 47(4): 1225-1230.
[14] WANG Sheng-sheng, WANG Chuang-feng, GU Fang-ming. Spatio-temporal reasoning for OPRA direction relation network [J]. 吉林大学学报(工学版), 2017, 47(4): 1238-1243.
[15] MA Miao, LI Yi-bin. Multi-level image sequences and convolutional neural networks based human action recognition method [J]. 吉林大学学报(工学版), 2017, 47(4): 1244-1252.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LIU Song-shan, WANG Qing-nian, WANG Wei-hua, LIN Xin. Influence of inertial mass on damping and amplitude-frequency characteristic of regenerative suspension[J]. 吉林大学学报(工学版), 2013, 43(03): 557 -563 .
[2] WANG Tong-jian, CHEN Jin-shi, ZHAO Feng, ZHAO Qing-bo, LIU Xin-hui, YUAN Hua-shan. Mechanical-hydraulic co-simulation and experiment of full hydraulic steering systems[J]. 吉林大学学报(工学版), 2013, 43(03): 607 -612 .
[3] ZHANG Chun-qin, JIANG Gui-yan, WU Zheng-yan. Factors influencing motor vehicle travel departure time choice behavior[J]. 吉林大学学报(工学版), 2013, 43(03): 626 -632 .
[4] XIAO Rui, DENG Zong-cai, LAN Ming-zhang, SHEN Chen-liang. Experiment research on proportions of reactive powder concrete without silica fume[J]. 吉林大学学报(工学版), 2013, 43(03): 671 -676 .
[5] CHEN Si-guo, JIANG Xu, WANG Jian, LIU Yan-heng, DENG Wei-wen, DENG Jun-yi. Mashup of vehicular ad-hoc network and universal mobile telecommunications system[J]. 吉林大学学报(工学版), 2013, 43(03): 706 -710 .
[6] MENG Chao, SUN Zhi-xin, LIU San-min. Multiple execution paths for virus based on cloud computing[J]. 吉林大学学报(工学版), 2013, 43(03): 718 -726 .
[7] XIAN Shu, ZHENG Jin, LU Xing, ZHANG Shi-peng. Identification approach of P2P flow based on the content redistribution model[J]. 吉林大学学报(工学版), 2013, 43(03): 727 -733 .
[8] LYU Yuan-zhi, WANG Shi-gang, YU Jue-qiong, WANG Xiao-yu, LI Xue-song. Display characteristics of one-dimensional integral imaging in virtual mode based on lenticular lens array[J]. 吉林大学学报(工学版), 2013, 43(03): 753 -757 .
[9] WANG Dan, LI Yang, NIAN Gui-jun, WANG Ke. An inhomogeneity mask for spatial watermarking[J]. 吉林大学学报(工学版), 2013, 43(03): 771 -775 .
[10] FENG Lin-han, QIAN Zhi-hong, SHANG Ke-cheng, ZHU Shuang. Improved hidden node collision avoidance strategy based on IEEE802.15.4[J]. 吉林大学学报(工学版), 2013, 43(03): 776 -780 .