Title
Accessing Heritage Documents according to Space Criteria within Digital Libraries
Abstract
The purpose of the Virtual Itineraries in the Pyrenees (PIV - Pyrénées Itinéraires Virtuels) project consists in managing a repository of electronic versions of books, newspapers, postcards, lithographs of the XIXth and XXth century. Information is mainly textual and presents many territorial aspects of the Pyrenees (a mountain range in the south west of France) (Casenave & al., 2004). This corpus is still relatively unknown. It is accessible only in regional museums and library archives. This is why the local media library supporting this project aims at the diffusion of these resource collections: their added-value remains centred on local cultural heritage and, therefore, geographic characteristics. To complete statistical and full-text analysis approaches, we propose a more accurate semantic approach to analyze and interpret geographic information contained in such a corpus (or in a query) (Marquesuzaà & al., 2005), (Etcheverry & al., 2005), (Sallaberry & al., 2006). ABSTRACT: Local cultural heritage document collections are characterized by contents strongly attached to a territory and its associated land history. Our contribution aims at enhancing such a content retrieval process efficiently each time a query includes geographic criteria. We propose a unified model for a formal representation of geographic information. This geographic model allows space features to be described independently of their representation mode (text, graphics) in the documents. We have developed a prototype implementing geographic Information Extraction (IE) and geographic Information Retrieval (IR) processes. We process geographic IE with semantic techniques combined to classic IE approaches. Then, we implement geographic IR with intersections researching algorithms: these algorithms search for all geocoded entities in the documents collections indexes which intersect any entity in the user's query. This paper focuses on IR and Visualization proposals relying on the geospatial characteristics of cultural heritage corpora.
Year
Venue
Keywords
2008
JDIM
cultural heritage,geographic information retrieval system,non-structured documents,digital libraries,digital document management keywords: geographic model,geographic information retrieval and visualization,text analysis,digital library,indexation,unified model
Field
DocType
Volume
Geospatial analysis,Data mining,World Wide Web,Geocoding,Cultural heritage,Information retrieval,Computer science,Geographic information retrieval,Local information systems,Information extraction,Digital library,GIS and public health
Journal
6
Issue
Citations 
PageRank 
1
0
0.34
References 
Authors
23
4
Name
Order
Citations
PageRank
Christophe Marquesuzaà174.34
Patrick Etcheverry2246.85
Christian Sallaberry37519.65
Mustapha Baziz4958.17