Title
Geographic Knowledge Graph Attribute Normalization: Improving the Accuracy by Fusing Optimal Granularity Clustering and Co-Occurrence Analysis
Abstract
Expansion of the entity attribute information of geographic knowledge graphs is essentially the fusion of the Internet's encyclopedic knowledge. However, it lacks structured attribute information, and synonymy and polysemy always exist. These reduce the quality of the knowledge graph and cause incomplete and inaccurate semantic retrieval. Therefore, we normalize the attributes of a geographic knowledge graph based on optimal granularity clustering and co-occurrence analysis, and use structure and the semantic relation of the entity attributes to identify synonymy and correlation between attributes. Specifically: (1) We design a classification system for geographic attributes, that is, using a community discovery algorithm to classify the attribute names. The optimal clustering granularity is identified by the marker target detection algorithm. (2) We complete the fine-grained identification of attribute relations by analyzing co-occurrence relations of the attributes and rule inference. (3) Finally, the performance of the system is verified by manual discrimination using the case of "landscape, forest, field, lake and grass". The results show the following: (1) The average precision of spatial relations was 0.974 and the average recall was 0.937; the average precision of data relations was 0.977 and the average recall was 0.998. (2) The average F1 for similarity results is 0.473; the average F1 for co-occurrence analysis results is 0.735; the average F1 for rule-based modification results is 0.934; the results show that the accuracy is greater than 90%. Compared to traditional methods only focusing on similarity, the accuracy of synonymous attribute recognition improves the system and we are capable of identifying near-sense attributes. Integration of our system and attribute normalization can greatly improve both the processing efficiency and accuracy.
Year
DOI
Venue
2022
10.3390/ijgi11070360
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION
Keywords
DocType
Volume
attribute normalization, geographic knowledge graph, co-occurrence analysis, optimal clustering granularity
Journal
11
Issue
ISSN
Citations 
7
2220-9964
0
PageRank 
References 
Authors
0.34
0
7
Name
Order
Citations
PageRank
Chuan Yin100.34
Binyu Zhang200.34
Wanzeng Liu301.01
Mingyi Du402.70
Nana Luo500.34
Xi Zhai600.34
Tu Ba700.34