J4
• 计算机科学 • Previous Articles Next Articles
WU Fenfen1,2, LIU Lei1, XIAO Xian1
Received:
Revised:
Online:
Published:
Contact:
Abstract: A heuristic information extraction algorithm is presented and an information extraction system is built with it. The system utilizes the semanteme characteristic and structure characteristic of the text to make the states with certain characteristics. On the basis of this result, we carried out extracting the remainder states having no characteristic with a algorithm incorporating backwarddynamicprogramming with forwardA* algorithm. We have tested 100 pieces of headers of computer science papers provided by the searchengine research group from CMU university of USA. The result shows the recall and the precision rate are all improved a lot compared with existing methods which are based on words and traditional Viterbi algorithm. In condusion, the heuristic algorithm is better on performance than Viterbi algorithm.
Key words: heuristic algorithm, text block, A* algorithm
CLC Number:
WU Fenfen,, LIU Lei, XIAO Xian. A Heuristic Information Extraction Algorithm[J].J4, 2007, 45(01): 73-76.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://xuebao.jlu.edu.cn/lxb/EN/
http://xuebao.jlu.edu.cn/lxb/EN/Y2007/V45/I01/73
Cited