吉林大学学报(理学版)

• 计算机科学 • 上一篇    下一篇

基于P-集合和FCA的中文领域本体学习方法

侯丽鑫1, 郑山红1, 赵辉1, 董亚则2, 彭馨仪2   

  1. 1. 长春工业大学 计算机科学与工程学院, 长春 130012; 2. 长春工业大学 软件职业技术学院, 长春 130012
  • 收稿日期:2012-09-24 出版日期:2013-07-26 发布日期:2013-08-06
  • 通讯作者: 郑山红 E-mail:bioszsh2007@yahoo.cn

Chinese Domain Ontology Learning Based onP-Sets and Formal Concept Analysis

HOU Lixin1, ZHENG Shanhong1, ZHAO Hui1, DONG Yaze2, PENG Xinyi2   

  1. 1. College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China;2. College of Software Vocational Technology, Changchun University of Technology, Changchun 130012, China
  • Received:2012-09-24 Online:2013-07-26 Published:2013-08-06
  • Contact: ZHENG Shanhong E-mail:bioszsh2007@yahoo.cn

摘要:

提出一种基于P集合和形式概念分析的中文领域本体学习方法. 该方法以非结构化中文文本为数据源, 通过引入P-集合理论获取形式背景, 在获取形式背景的基础上, 采用Godin]算法构造概念格, 并采用自定义映射规则实现概念格到中文领域本体的映射. 通过学习生物和水领域文本, 得到一个中文领域本体. 实验结果表明, 该方法能解决手工构建本体费时、 费力的问题, 且学习到的本体是形式化本体, 能被更好地共享和重用.

关键词: 领域本体学习, P-集合, 形式概念分析, 概念格

Abstract:

We provided a Chinese domain ontology learning method based on Psets and formal concept analysis which learns from unstructured Chinese texts, extracts the formal background with the P-sets. Then, it uses the Godin algorithm to construct a concept lattice based on the formal background and transforms the concept lattice into the OWL Chinese ontology with the mapping rules that we have defined. At last, we learned from the domain about biology and water via the method we proposed and got the Chinese domain ontology. The results show that the method can solve the problems of timeconsuming and energywasting. In addition, the ontology we got is a formal ontology, so it has more advantages in sharing and reusing.

Key words: domain ontology learning, P-sets, formal concept analysis, concept lattice

中图分类号: 

  • TP39