Facet Annotation Using Reference Knowledge Bases. - Citegraph

Paper Info

Title
Facet Annotation Using Reference Knowledge Bases.

Abstract
Faceted interfaces are omnipresent on the web to support data exploration and filtering. A facet is a triple: a domain (e.g., Book), a property (e.g., author, language), and a set of property values (e.g., Austen, Beauvoir, Coelho, Dostoevsky, Eco, Kerouac, Suskind, ..., French, English, German, Italian, Portuguese, Russian, ... ). Given a property (e.g., language), selecting one or more of its values (English and Italian) returns the domain entities (of type Book) that match the given values (the books that are written in English or Italian). To implement faceted interfaces in a way that is scalable to very large datasets, it is necessary to automate facet extraction. Prior work associates a facet domain with a set of homogeneous values, but does not annotate the facet property. In this paper, we annotate the facet property with a predicate from a reference Knowledge Base (KB) so as to maximize the semantic similarity between the property and the predicate. We define semantic similarity in terms of three new metrics: specificity, coverage, and frequency. Our experimental evaluation uses the DBpedia and YAGO KBs and shows that for the facet annotation problem, we obtain better results than a state-of-the-art approach for the annotation of web tables as modified to annotate a set of values.

Year	DOI	Venue
2018	10.1145/3178876.3186020	WWW '18: The Web Conference 2018 Lyon France April, 2018
Keywords	Field	DocType
Facet annotation, data sematics, data lifting, table annotation, faceted search, eCommerce	Semantic similarity,Data mining,Monad (category theory),Annotation,Information retrieval,Faceted search,Computer science,Facet (geometry),Knowledge base,Predicate (grammar),Scalability	Conference
ISBN	Citations	PageRank
978-1-4503-5639-8	0	0.34
References	Authors
29	3

Authors (3 rows)

Cited by (0 rows)

References (29 rows)

Name	Order	Citations	PageRank
Riccardo Porrini	1	24	2.88
Matteo Palmonari	2	450	44.73
Isabel F. Cruz	3	1747	296.58

1