Abstract | ||
---|---|---|
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles ("layers") that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities and make web archives directly exploitable by other systems and tools. To demonstrate their query capabilities, we build and query semantic layers for three different types of web archives. An experimental evaluation showed that a semantic layer can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.
|
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/JCDL.2017.7991555 | JCDL |
Keywords | Field | DocType |
semantic layer building,semantic layer querying,Web archiving,information source,RDF/S model,distributed framework,semantic profile building,semantic information,Web archive content,metadata information,Linked Data,structured repository,integration capability,query capability | World Wide Web,Semantic Web Stack,Information retrieval,Web archiving,Computer science,Web standards,Semantic Web,Data Web,Semantic analytics,Semantic grid,Social Semantic Web | Conference |
ISSN | ISBN | Citations |
2575-7865 | 978-1-5386-3861-3 | 4 |
PageRank | References | Authors |
0.52 | 22 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Pavlos Fafalios | 1 | 154 | 19.76 |
Helge Holzmann | 2 | 70 | 11.16 |
Vaibhav Kasturia | 3 | 5 | 1.21 |
Wolfgang Nejdl | 4 | 6633 | 556.13 |