Title
Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time
Abstract
The integration of the classical Web (of documents) with the emerging Web of Data is a challenging vision. In this paper we focus on an integration approach during searching which aims at enriching the responses of non-semantic search systems (e.g. professional search systems, web search engines) with semantic information, i.e. Linked Open Data (LOD), and exploiting the outcome for providing an overview of the search space and allowing the users (apart from restricting it) to explore the related LOD. We use named entities (e.g. persons, locations, etc.) as the \"glue\" for automatically connecting search hits with LOD. We consider a scenario where this entity-based integration is performed at query time with no human effort, and no a-priori indexing, which is beneficial in terms of configurability and freshness. To realize this scenario one has to tackle various challenges. One spiny issue is that the number of identified entities can be high, the same is true for the semantic information about these entities that can be fetched from the available LOD (i.e. their properties and associations with other entities). To this end, in this paper we propose a Link Analysis-based method which is used for (a) ranking (and thus selecting to show) the more important semantic information related to the search results, (b) deriving and showing top-K semantic graphs. In the sequel, we report the results of a survey regarding the marine domain with promising results, and comparative results that illustrate the effectiveness of the proposed (Page Rank-based) ranking scheme. Finally, we report experimental results regarding efficiency showing that the proposed functionality can be offered even at query time.
Year
DOI
Venue
2014
10.1109/ICSC.2014.11
Semantic Computing
Keywords
Field
DocType
Internet,data mining,information analysis,query processing,LOD,Page Rank-based ranking scheme,Web of Data,entity mining,keyword-based search results,link analysis,linked open data,named entities,nonsemantic search systems,query time,search space,semantic information,top-K semantic graphs,entity mining,link analysis,linked data,results post-analysis
Data mining,Web search query,Semantic search,Query expansion,Semantic Web Stack,Information retrieval,Computer science,Search engine indexing,Web query classification,Ranking (information retrieval),Concept search
Conference
ISSN
Citations 
PageRank 
2325-6516
8
0.49
References 
Authors
16
2
Name
Order
Citations
PageRank
Pavlos Fafalios115419.76
Yannis Tzitzikas291.18