Paper The following article is Open access

Linked Data Fusion Based on Similarity Calculation and K-Nearest Neighbor

Published under licence by IOP Publishing Ltd
, , Citation Yiming Chen 2022 J. Phys.: Conf. Ser. 2221 012043 DOI 10.1088/1742-6596/2221/1/012043

1742-6596/2221/1/012043

Abstract

The development of semantic web technology supports the continuous development of linked data and its applications. In order to make effective use of the growing linked data on the web, multi-source data should be fused, which is a key step in multi-source large-scale data analysis and management. Currently, merging these data can be difficult, because various sources usually provide multiple conflict descriptions for entities in the same real world. To complete the fusion, we need to solve the problem of object conflict. This paper proposes a linked data fusion method based on similarity calculation and k-nearest neighbor. This method has two contributions. Firstly, a similarity calculation method of linked data is proposed, which can effectively integrate URI nodes and blank nodes in linked data; Secondly, a literal type node fusion strategy based on k-nearest neighbor classification method is proposed, which realizes the automation of fusion and has the independence of data source. The results show that compared with other methods, this method can improve the conciseness and consistency and precision by up to 12.9%, 30.6% and 12%.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/2221/1/012043