Title
Disentangling the Structure of Tables in Scientific Literature.
Abstract
Within the scientific literature, tables are commonly used to present factual and statistical information in a compact way, which is easy to digest by readers. The ability to "understand" the structure of tables is key for information extraction in many domains. However, the complexity and variety of presentation layouts and value formats makes it difficult to automatically extract roles and relationships of table cells. In this paper, we present a model that structures tables in a machine readable way and a methodology to automatically disentangle and transform tables into the modelled data structure. The method was tested in the domain of clinical trials: it achieved an F-score of 94.26% for cell function identification and 94.84% for identification of inter-cell relationships.
Year
DOI
Venue
2016
10.1007/978-3-319-41754-7_14
Lecture Notes in Computer Science
Keywords
Field
DocType
Table mining,Text mining,Data management,Data modelling,Natural language processing
Data mining,Data structure,Data modeling,Scientific literature,Concept mining,Text mining,Computer science,Information extraction,Natural language processing,Artificial intelligence,Data management
Conference
Volume
ISSN
Citations 
9612
0302-9743
1
PageRank 
References 
Authors
0.35
13
4
Name
Order
Citations
PageRank
Nikola Milosevic1312.38
Cassie Gregson261.18
Robert Hernandez310.35
Goran Nenadic422813.18