Abstract | ||
---|---|---|
The purpose of the Virtual Itineraries in the Pyrenees (PIV - Pyrénées Itinéraires Virtuels) project consists in managing a repository of electronic versions of books, newspapers, postcards, lithographs of the XIXth and XXth century. Information is mainly textual and presents many territorial aspects of the Pyrenees (a mountain range in the south west of France) (Casenave & al., 2004). This corpus is still relatively unknown. It is accessible only in regional museums and library archives. This is why the local media library supporting this project aims at the diffusion of these resource collections: their added-value remains centred on local cultural heritage and, therefore, geographic characteristics. To complete statistical and full-text analysis approaches, we propose a more accurate semantic approach to analyze and interpret geographic information contained in such a corpus (or in a query) (Marquesuzaà & al., 2005), (Etcheverry & al., 2005), (Sallaberry & al., 2006). ABSTRACT: Local cultural heritage document collections are characterized by contents strongly attached to a territory and its associated land history. Our contribution aims at enhancing such a content retrieval process efficiently each time a query includes geographic criteria. We propose a unified model for a formal representation of geographic information. This geographic model allows space features to be described independently of their representation mode (text, graphics) in the documents. We have developed a prototype implementing geographic Information Extraction (IE) and geographic Information Retrieval (IR) processes. We process geographic IE with semantic techniques combined to classic IE approaches. Then, we implement geographic IR with intersections researching algorithms: these algorithms search for all geocoded entities in the documents collections indexes which intersect any entity in the user's query. This paper focuses on IR and Visualization proposals relying on the geospatial characteristics of cultural heritage corpora. |
Year | Venue | Keywords |
---|---|---|
2008 | JDIM | cultural heritage,geographic information retrieval system,non-structured documents,digital libraries,digital document management keywords: geographic model,geographic information retrieval and visualization,text analysis,digital library,indexation,unified model |
Field | DocType | Volume |
Geospatial analysis,Data mining,World Wide Web,Geocoding,Cultural heritage,Information retrieval,Computer science,Geographic information retrieval,Local information systems,Information extraction,Digital library,GIS and public health | Journal | 6 |
Issue | Citations | PageRank |
1 | 0 | 0.34 |
References | Authors | |
23 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Christophe Marquesuzaà | 1 | 7 | 4.34 |
Patrick Etcheverry | 2 | 24 | 6.85 |
Christian Sallaberry | 3 | 75 | 19.65 |
Mustapha Baziz | 4 | 95 | 8.17 |