Abstract | ||
---|---|---|
Densely-packed but structured scientific data are typically presented in the form of tables, which often appear in raster image form. To interpret data from scanned tables, understanding their hierarchical structure is vital. To further address the vast variability of table representations, we propose a fully automatic methodology that uses a bottom-up reasoning that is independent on the existence of representation features, such as lines. We evaluate our approach on the ICDAR 2013 dataset and demonstrate its effectiveness on detecting tables cells and their content and classifying header and data cells. For detecting the cell hierarchy, we demonstrate results on synthetic data due to lack of ground truth. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/ICDAR.2017.250 | 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) |
Keywords | Field | DocType |
raster image form,scanned tables,hierarchical structure,table representations,fully automatic methodology,representation features,tables cells,synthetic data,scientific data,ICDAR 2013 dataset,cell hierarchy | Computer vision,Hierarchical cell structure,Raster graphics,Markov process,Pattern recognition,Visualization,Computer science,Feature extraction,Synthetic data,Ground truth,Artificial intelligence,Header | Conference |
Volume | ISSN | ISBN |
02 | 1520-5363 | 978-1-5386-3587-2 |
Citations | PageRank | References |
0 | 0.34 | 6 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Waleed Farrukh | 1 | 2 | 0.71 |
Antonio Foncubierta-Rodriguez | 2 | 156 | 17.13 |
Anca-Nicoleta Ciubotaru | 3 | 0 | 0.68 |
Guillaume Jaume | 4 | 1 | 2.75 |
Costas Bejas | 5 | 0 | 0.34 |
Orcun Goksel | 6 | 183 | 25.45 |
Maria Gabrani | 7 | 2 | 7.92 |