Abstract | ||
---|---|---|
The World Wide Web, the world's largest resource for information, has evolved from organizing information using controlled, top-down taxonomies to a bottom up approach that emphasizes assigning meaning to data via mechanisms such as the Social Web (Web 2.0). Tagging adds meta-data, (weak semantics) to the content available on the web. This research investigates the potential for repurposing this layer of meta-data. We propose a multi-phase approach that exploits user-defined tags to identify and extract domain-level concepts. We operationalize this approach and assess its feasibility by application to a publicly available tag repository. The paper describes insights gained from implementing and applying the heuristics contained in the approach, as well as challenges and implications of repurposing tags for extraction of domain-level concepts. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1007/978-3-642-22327-3_19 | NLDB |
Keywords | Field | DocType |
user-defined tag,top-down taxonomy,largest resource,social web,multi-phase approach,social tagging data,available tag repository,repurposing tag,weak semantics,world wide web,domain-level concept | Data science,Corporate sustainability,Metadata,World Wide Web,Social web,Repurposing,Computer science,Heuristics,Web 2.0,Operationalization,Semantics | Conference |
Volume | ISSN | Citations |
6716 | 0302-9743 | 1 |
PageRank | References | Authors |
0.37 | 11 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sandeep Purao | 1 | 792 | 124.46 |
Veda C. Storey | 2 | 2796 | 542.19 |
Vijayan Sugumaran | 3 | 975 | 92.17 |
Jordi Conesa | 4 | 165 | 28.98 |
Julià Minguillón | 5 | 92 | 17.93 |
Joan Casas | 6 | 1 | 0.37 |