Journal of Jilin University(Engineering and Technology Edition) ›› 2021, Vol. 51 ›› Issue (5): 1792-1797.doi: 10.13229/j.cnki.jdxbgxb20200484

Previous Articles    

A new method for link validity assessment based on linked data

Man YUAN(),Yun-long JIANG,Chao HU   

  1. School of Computer and Information Technology,NorthEast Petroleum University,Daqing 163318,China
  • Received:2020-06-29 Online:2021-09-01 Published:2021-09-16

Abstract:

in order to access the linked validity more efficiently and accurately. This paper studies the methods and techniques of link validity assessment at home and abroad, and finds that some of the results of link validity research are only briefly mentioned in some literatures. Therefore,in this paper, an algorithm?for URI validity assessment is proposed, and the validity and efficiency of the algorithm are proved by theoretical analysis.Finally, the open data published by DBpedia was used for experimental verification. Through comparison of experimental results, the assessment effectiveness of this method was improved by 0.1% and the evaluation efficiency was four times higher than that of the conventional method.

Key words: computer software, linked data, data quality assessment, link validity, data quality

CLC Number: 

  • TP391

Table 1

Referential symbols and meaning"

符号含义
Count_null无效URI数量
Count_P评估的URI总数
N数据集中URI数量
α(Hi)URI网络协议有效性,Hi为某URI的协议部分
β(Ai)URI域名及端口有效性,Ai为某URI域名和端口
γ(Pi)URI中资源路径有效性,Pi为某URI中资源路径
?(U)该关联数据集URI有效性

Fig.1

process of assessment"

Fig.2

Result of assessment"

Table 2

Time of running methods"

运行次数常规方法评估耗时/s?评估耗时/s
平均值35 514.68 894.1
135 788.89 005.0
235 945.58 938.1
1135 537.28 807.3
1235 573.78 702.6

Table 3

Assessment level of detail"

评估方法协议 验证服务器连通性链接 可达性复检支持 多任务
常规方法
?方法
1 Berners-Lee T.Linked data[EB/OL].(2018-06-05).[2018-06-05].
2 SweoIG/TaskForces/CommunityProjects/LinkingOpenData[EB/OL].[2018-06-06].
3 The linked open data cloud[EB/OL].[2020-04-01].
4 刘炜. 关联数据: 概念、技术及应用展望[J]. 大学图书馆学报, 2011, 29(2): 5-12.
Liu Wei. Overview on linkeddata :concept ,technology and implementation[J]. Journal of Academic Libraries, 2011,29(2): 5-12.
5 付瑶. 图书馆关联数据质量控制研究[D]. 长春: 东北师范大学信息科学与技术学院, 2013.
Fu Yao. The study of the quality control of the library linked data[D]. Changchun: College of Information Science and Technology, Northeast Normal University, 2013.
6 程录庆. 数据约束对数据质量的影响研究[J]. 长江大学学报: 自然科学版, 2011, 8(5): 100-102.
Cheng Lu-qing. Data constraints on the impact of data quality[J]. Journal of Yangtze University(Natural Science Edition), 2011, 8(5): 100-102.
7 Yolanda G, Donovan A. Towards content trust of web resources[J]. Journal of Web Semantics, 2007, 5(4): 227-239.
8 Christian B, Richard C. Quality-driven information filtering using the WIQA policy framework[J]. Web Semantics: Science, Services and Agents on the World Wide Web, 2009, 7(1): 1-10.
9 Christoph B, Naumann F, Abedjan Z, et al. Profiling linked open data with ProLOD[C]∥2010 IEEE 26th International Conference on Data Engineering Workshops, Long Beach, 2010: 11260520.
10 Flemming A. Quality characteristics of linked data publishing datasources[J]. Master's Thesis, Humboldt-Universität of Berlin, 2010.
11 Shekarpour S, Katebi S D. Modeling and evaluation of trust with an extension in semantic web[J]. Journal of Web Semantics, 2010, 8(1): 26-36.
12 Fürber C, Hepp M. SWIQA-a semantic web information quality assessment framework[J]. Computer Science, 2011, 76: 8935047.
13 Jacobi I, Kagal L, Khandelwal A. Rule-based trust assessment on the semantic web[C]∥Rule-Based Reasoning, Programming, and Applications—5th International Symposium, RuleML 2011, Spain, 2011: 0831442.
14 Christophe G, Groth P T, Stadler C, et al. Assessing linked data mappings using network measures[C]∥Proceedings of the 9th Extended Semantic Web Conference, Heraklion, 2012: 12126405.
15 Kontokostas D, Westphal P, Auer S, et al. Test-driven evaluation of linked data quality[C]∥International Conference on World Wide Web. ACM, Seoul, 2014: 747-757.
16 Ruckhaus E, Baldizán O, Vidal E M. Analyzing linked data quality with LiQuate[J]. Lecture Notes in Computer Science, 2013, 8798: 488-493.
17 Zaveri A, Rula A, Maurino A, et al. Quality assessment for linked data:a survey[J]. Semantic Web, 2015, 7(1): 63-93.
18 Jeremy D, SÖren A, Christoph L. Luzzu—a methodology and framework for linked data quality assessment[J]. Journal of Data and Information Quality, 2016, 8(1): 2992786.
19 Mohammad R, Marco T, Giuseppe R, et al. A quality assessment approach for evolving knowledge bases[J]. Semantic Web, 2018, 10(2): 1-35.
20 Yang L, Huang L, Liu Z Z. Linked data crowdsourcing quality assessment based on domain professionalism[J]. Journal of Physics: Conference Series, 2019, 1187(5): 052085.
21 袁满, 胡超, 仇婷婷. 基于Linked data的数据完整性评估新方法[J]. 吉林大学学报: 工学版, 2020, 50(5): 1826-1831.
Yuan Man, Hu Chao, Qiu Ting-ting, A new method for data integrity assessment based on Linked data[J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(5): 1826-1831.
22 Hogan A, Jürgen U, Harth A, et al. An empirical survey of linked data conformance[J]. Journal of Web Semantics, 2012, 14: 14-44.
23 Acosta M, Zaveri A, Simperl E, et al. Crowdsourcing linked data quality assessment[C]∥International Semantic Web Conference. Berlin, Heidelberg:Springer, 2013: 260-276.
24 Christophe G, Groth P T, Stadler C, et al. Assessing linked data mappings using network measures[C]∥Proceedings of the 9th Extended Semantic Web Conference, Heraklion, 2012: 12126405.
25 王梦竹. 求解0-1背包问题算法研究[J]. 软件导刊, 2013, 12(8): 59-61.
Wang Meng-zhu. A research of algorithm for the 0-1 knapsack problem[J]. Software Guide, 2013, 12(8): 59-61.
26 欧阳丹彤, 高杰. 不一致术语集最小基数诊断的分支限界[J]. 吉林大学学报: 工学版, 2020, 50(4): 1449-1454.
Ouyang Dan-tong, Gao Jie, Branch and bound for computing the cardinality-minimal diagnosis of incoherent terminology[J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(4): 1449-1454.
27 Pivnichny J R, Samodovitz A J. Web browser which checks availability of hot links[J]. United States Patent, 1999, 8: 5974445.
[1] Shuai LYU,Jing LIU. Stochastic local search heuristic method based on deep reinforcement learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(4): 1420-1426.
[2] Xiao-hui WEI,Bing-yi SUN,Jia-xu CUI. Recommending activity to users via deep graph neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(1): 278-284.
[3] Man YUAN,Chao HU,Ting-ting QIU. A new method for data integrity assessment based on Linked data [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(5): 1826-1831.
[4] Lei LIU,Jie WENG,De-gui GUO. Static input determination method in partial evaluation for compiler test [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 262-267.
[5] MA Jian, FAN Jian-ping, LIU Feng, LI Hong-hui. The evolution model of objective-oriented software system [J]. 吉林大学学报(工学版), 2018, 48(2): 545-550.
[6] LUO Yang-xia, GUO Ye. Software recognition based on features of data dependency [J]. 吉林大学学报(工学版), 2017, 47(6): 1894-1902.
[7] YING Huan, WANG Dong-hui, WU Cheng-gang, WANG Zhe, TANG Bo-wen, LI Jian-jun. Efficient deterministic replay technique on commodity system environment [J]. 吉林大学学报(工学版), 2017, 47(1): 208-217.
[8] LI Yong, HUANG Zhi-qiu, WANG Yong, FANG Bing-wu. New approach of cross-project defect prediction based on multi-source data [J]. 吉林大学学报(工学版), 2016, 46(6): 2034-2041.
[9] WANG Nian-bin, ZHU Guan-wen, ZHOU Lian-ke, WANG Hong-wei. Novel dataspace index for efficient processing of path query [J]. 吉林大学学报(工学版), 2016, 46(3): 911-916.
[10] CHEN Peng-fei, TIAN Di, YANG Guang. Design and implementation of LIBS software based on MVC architecture [J]. 吉林大学学报(工学版), 2016, 46(1): 242-245.
[11] TE Ri-gen, JIANG Sheng, LI Xiong-fei, LI Jun. Document compression scheme based on integer data [J]. 吉林大学学报(工学版), 2016, 46(1): 228-234.
[12] FENG Xiao-ning, WANG Zhuo, ZHANG Xu. Formal method for routing protocol of WSN based on L-π calculus [J]. 吉林大学学报(工学版), 2015, 45(5): 1565-1571.
[13] LIU Lei, WANG Yan-yan, SHEN Chun, LI Yu-xiang, LIU Lei. Performance portable GPU parallel optimization technique on Bellman-Ford algorithm [J]. 吉林大学学报(工学版), 2015, 45(5): 1559-1564.
[14] LI Ming-zhe, WANG Jin-lin, CHEN Xiao, CHEN Jun. Architecture model of streaming media applications on network processors(VPL) [J]. 吉林大学学报(工学版), 2015, 45(5): 1572-1580.
[15] WANG Ke-chao, WANG Tian-tian, SU Xiao-hong, MA Pei-jun. Plagiarism detection in student programs based on frequent closed sequence mining [J]. 吉林大学学报(工学版), 2015, 45(4): 1260-1265.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!