J4

• 计算机 • Previous Articles     Next Articles

Chinese Text Chunking Based CRF

XU Zhongyi, HU Qian, LIU Lei   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2006-06-29 Revised:1900-01-01 Online:2007-05-26 Published:2007-05-26
  • Contact: LIU Lei

Abstract: A new method to solve Chinese text chunking was introduced as conditional random fields (CRF) model, by which Chinese text chunking transformed into labeling the words with their chunk tags and establishinga model for tagged corpus according to conditional random fields so as to predict the chunk ta g of each word. An F1 score of 85.5% is achieved by using the evaluation dataset of Chinese treebank of Beijing university, and obviously better than those of hidden Markov model and maximum entropy Markov model. Experimental results show that conditional random fields model is an effective way on Chinese text chunking and the strict Independence hypothesis and the label bias problem are avoided.

Key words: chunking, conditional random fields, feature function

CLC Number: 

  • TP391