吉林大学学报(理学版)

• 计算机科学 • 上一篇    下一篇

一种基于语义相关度的XML关键字查询排序方法

李瑞霞, 苏守宝, 周先存   

  1. 皖西学院 信息工程学院, 安徽 六安 237012
  • 收稿日期:2013-04-13 出版日期:2013-11-26 发布日期:2013-11-21
  • 通讯作者: 李瑞霞 E-mail:lrx0219@wxc.edu.cn

XML Keyword Query Ranking Method Using Semantic Relevancy

LI Rui xia, SU Shou bao, ZHOU Xian cun   

  1. School of Information Engineering, West Anhui University, Lu’an 237012, Anhui Province, China
  • Received:2013-04-13 Online:2013-11-26 Published:2013-11-21
  • Contact: LI Rui xia E-mail:lrx0219@wxc.edu.cn

摘要:

针对XML文档半结构化的特点及传统tf\|idf方法仅考虑关键字在文档中出现的频率, 而未考虑XML文档中节点的语义信息问题, 利用向量空间模型, 设计一种基于XML关键字查询结果的相关度排序策略. 相关度计算充分考虑XML文档中各节点对文档的区分程度、 节点描述文档的明确程度及节点描述文档的直接程度, 以提高节点权重度量的准确性, 从而将最相关的信息提供给用户, 经DBLP数据集实验验证了该方法的有效性.

关键词: XML查询, 语义, 相关度, 排序

Abstract:

Aiming at the semi\|structured characteristics of XML document and the traditional tf\|idf method only considering the frequency of keyword in the document, not considering the lack of semantic information of the nodes in the XML document, we designed the relevance ranking strategies that were designed based on XML keyword searching results via the vector space model. To improve the accuracy of the measure of the node weights, correlation calculation fully considers the distinctive degree of the nodes in the XML document, the clear and direct degree of nodes describing the document so as to provide the most relevant information to users. Experimental results show that the proposed method is effective.

Key words: XML query, semantics, correlation, ranking

中图分类号: 

  • TP301