Abstract | ||
---|---|---|
Linking historical datasets and making them available on the Web has increasingly become a subject of research in the field of digital humanities. In this paper, we focus on discovering links between ships from a dataset of Dutch maritime events and a historical archive of newspaper articles. We apply a heuristic-based method for finding and filtering links between ship instances; subsequently, we use machine learning for article classification to be used for enhanced filtering in combination with domain features. We evaluate the resulting links, using manually annotated samples as gold standard. The resulting links are made available as Linked Open Data, thus enriching the original data. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/978-3-319-15168-7_32 | Lecture Notes in Computer Science |
Keywords | Field | DocType |
Text classification,Machine learning,Record linkage,Entity linkage,Historical research,Digital humanities,Digital history | Data mining,Heuristic,Information retrieval,Computer science,Linked data,Newspaper,Comparative historical research,Digital history | Conference |
Volume | ISSN | Citations |
8852 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 18 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andrea Bravo Balado | 1 | 0 | 0.34 |
Victor de Boer | 2 | 181 | 29.78 |
Guus Schreiber | 3 | 1448 | 150.58 |