Title
Tag recommendation for large-scale ontology-based information systems
Abstract
We tackle the problem of improving the relevance of automatically selected tags in large-scale ontology-based information systems. Contrary to traditional settings where tags can be chosen arbitrarily, we focus on the problem of recommending tags (e.g., concepts) directly from a collaborative, user-driven ontology. We compare the effectiveness of a series of approaches to select the best tags ranging from traditional IR techniques such as TF/IDF weighting to novel techniques based on ontological distances and latent Dirichlet allocation. All our experiments are run against a real corpus of tags and documents extracted from the ScienceWise portal, which is connected to ArXiv.org and is currently used by growing number of researchers. The datasets for the experiments are made available online for reproducibility purposes.
Year
DOI
Venue
2012
10.1007/978-3-642-35173-0_22
International Semantic Web Conference
Keywords
Field
DocType
tag recommendation,novel technique,traditional setting,available online,large-scale ontology-based information system,latent dirichlet allocation,idf weighting,traditional ir technique,ontological distance,best tag,sciencewise portal
Information system,Data mining,Ontology,Latent Dirichlet allocation,Weighting,Information retrieval,Computer science,Ranging,Probabilistic latent semantic analysis,Word-sense disambiguation,Database
Conference
Citations 
PageRank 
References 
5
0.39
11
Authors
6
Name
Order
Citations
PageRank
Roman Prokofyev1323.98
Alexey Boyarsky291.81
Oleg Ruchayskiy381.46
Karl Aberer46459662.26
Gianluca Demartini574454.56
o de troyer61708134.92