J4

Previous Articles     Next Articles

Crossing Ambiguity Segmentation Based on Statistical Rules

ZHAI Feng-wen, HE Feng-ling, ZUO Wan-li   

  1. (College of Software, Jilin University, Changchun 130012, China)
  • Received:2005-06-20 Revised:1900-01-01 Online:2006-03-26 Published:2006-03-26
  • Contact: ZUO Wan-li

Abstract: Chinese word segmentation is a base for Chinese Information Processing, and the ambiguity problem is a nodus of Chinese word segmentati on and more then 90% of ambiguity problems are crossing ambiguity, so the solution of the crossing ambiguity problem is an important part of Chinese word segmentation. After repeated experiments and analyses, 5 rules and an algorithm based on these 5 rules were proposed to segment crossing ambiguity. From experiment results, it can be found that the accuracy of DSfenci system we developed based on these 5 rules reaches to 95.22%, which is an excellent experiment result.

Key words: crossing ambiguity, rules, statistics

CLC Number: 

  • TP391.12