Journal of Jilin University Science Edition

Previous Articles     Next Articles

RuleBased Method for Unsupervised PartofSpeech Tagging

PENG Tao1, DAI Yaokang1, ZHU Fengtong1, ZHANG Bangzuo2, LIU Lu1, YAN Zhao1, QIAN Feng1   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. School of Computer Science and Information Technology, Northeast Normal University, Changchun 130117, China
  • Received:2014-09-24 Online:2015-09-26 Published:2015-09-29
  • Contact: YAN Zhao E-mail:yanzhao@jlu.edu.cn

Abstract:

A rulebased tagging method for unsupervised partofspeech was proposed. More than 200 grammar rules were used to create 26 kinds of rules functions. After it was preprocessed, the initial tags of words in the input sentence were obtained, the 26 kinds of rules functions were applied to each word to attain all the tags of the input sentence. Experimental results on Brown corpus show that the accuracy of our method is up to 93.95%, thus, our rulebased method is feasible and effective, and improves the accuracy and the simplicity of English partofspeech tagging.

Key words: partofspeech tagging, rulebased, unsupervised learning, rules function

CLC Number: 

  • TP181