Title
TableBank - Table Benchmark for Image-based Table Detection and Recognition.
Abstract
We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet. Existing research for image-based table detection and recognition usually fine-tunes pre-trained models on out-of-domain data with a few thousands human labeled examples, which is difficult to generalize on real world applications. With TableBank that contains 417K high-quality labeled tables, we build several strong baselines using state-of-the-art models with deep neural networks. We make TableBank publicly available (https://github.com/doc-analysis/TableBank) and hope it will empower more deep learning approaches in the table detection and recognition task.
Year
Venue
DocType
2020
LREC
Conference
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Minghao Li112.12
Lizhen Cui215438.68
Shaohan Huang35710.29
Furu Wei41956107.57
Ming Zhou54262251.74
Zhoujun Li6964115.99