吉林大学学报(信息科学版) ›› 2014, Vol. 32 ›› Issue (6): 657-663.

• 论文 • 上一篇    下一篇

基于关联规则和语义规则的本体概念提取研究

贺海涛, 郑山红, 李万龙, 彭馨仪   

  1. 长春工业大学 计算机科学与工程学院, 长春 130012
  • 收稿日期:2014-08-26 出版日期:2014-11-25 发布日期:2015-01-09
  • 作者简介:贺海涛(1987—), 男, 湖南永州人, 长春工业大学硕士研究生, 主要从事本体、 智能系统和语义网研究, (Tel)86-18943158138(E-mail)hht12tjl@126.com;通讯作者: 郑山红(1970—), 女(朝鲜族), 长春人, 长春工业大学副教授, 博士, 硕士生导师, 主要从事智能系统与语义网的研究, (Tel)86-13756476636(E-mail)bioszsh2007@aliyun.com。
  • 基金资助:

    吉林省自然科学基金资助项目(20130101060JC); 吉林省教育厅十二五科学技术研究基金资助项目(2014131; 2014125)

Research on Domain Ontology Concept Extraction Based on Association Rules and Semantic Rules

HE Haitao, ZHENG Shanhong, LI Wanlong, PENG Xinyi   

  1. School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
  • Received:2014-08-26 Online:2014-11-25 Published:2015-01-09

摘要:

为解决基于非结构化文本的中文领域本体概念提取效率和准确率不理想的问题, 提出了一种基于关联规则和语义规则的领域本体概念提取方法。利用领域一致性和相关性检查以及关联规则分别获取候选概念和关系集合, 计算候选概念在领域术语关系中的深度和广度, 利用深度和广度信息反馈概念隶属度的思想, 定量分析术语与领域的隶属程度, 进行本体概念的领域隶属度检查, 完成领域本体概念的提取。实验结果表明, 该方法提高了领域本体概念的提取效率和准确率, 具有可行性和合理性, 领域本体概念的提取准确率提高了12%左右。

关键词: 本体概念提取, 关联规则, 语义规则, 领域隶属度检查

Abstract:

In order to solve the problems that extraction efficiency and the accuracy of Chinese domain ontology concept based on unstructured text is not ideal. We present a method of domain ontology concept extraction based on semantic rules and association rules. A set of candidate concepts and relationships are obtained by using field consistency, correlative checks and association rules, and the depth and breadth of relations of every concept in candidate concepts are computed, using the depth and breadth information to feedback the degree of membership between terminology and field, with the way of quantitative analysis to complete the extraction of domain ontology concepts. The experimental results show that this method has feasibility and rationality, the concept of domain ontology extraction accuracy increased by about 12%.

Key words: ontology concept extraction, association rules, semantic rules, domain membership checking

中图分类号: 

  • TP391