Journal of Jilin University Science Edition

Previous Articles     Next Articles

Method of Recognizing Unknown Words by Building SingleWord Dictionary

YU Tong, LIU Shufen   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2014-07-11 Online:2015-03-26 Published:2015-03-24
  • Contact: LIU Shufen E-mail:liusf@jlu.edu.cn

Abstract:

Chinese word segmentation is a very important task in information processing. The present Chinese word segmentation technology mainly relies on commonword dictionary. But the dictionary has no recognition capability for unknown words. The authors brought forth a method of using doubledictionary to recognize unknown words. The process is to build a commonword dictionary and a singleword dictionary, then combine  them for  segmentation, solving the inefficiency in recognizing unknown words. As a result, the accuracy rate can reach above 90%.

Key words: singleword dictionary, unknown words, Chinese word segmentation, doubledictionary

CLC Number: 

  • TP391.12