Title
Rethinking Table Structure Recognition Using Sequence Labeling Methods
Abstract
Table structure recognition is an important task in document analysis and attracts the attention of many researchers. However, due to the diversity of table types and the complexity of table structure, the performances of table structure recognition methods are still not well enough in practice. Row and column separators play a significant role in the two-stage table structure recognition and a better row and column separator segmentation result can improve the final recognition results. Therefore, in this paper, we present a novel deep learning model to detect row and column separators. This model contains a convolution encoder and two parallel row and column decoders. The encoder can extract the visual features by using convolution blocks; the decoder formulates the feature map as a sequence and uses a sequence labeling model, bidirectional long short-term memory networks (BiLSTM) to detect row and column separators. Experiments have been conducted on PubTabNet and the model is benchmarked on several available datasets, including Pub-TabNet, UNLV ICDAR13, ICDAR19. The results show that our model has a state-of-the-art performance than other strong models. In addition, our model shows a better generalization ability. The code is available on this site (www github.com/L597383845/row-col-table-recognition).
Year
DOI
Venue
2021
10.1007/978-3-030-86331-9_35
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II
Keywords
DocType
Volume
Table structure recognition, Encoder-decoder, Row and column separators segmentation, Sequence labeling model
Conference
12822
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
11
8
Name
Order
Citations
PageRank
Yibo Li1276.36
Yilun Huang251.78
Ziyi Zhu300.34
Lemeng Pan400.34
Yongshuai Huang500.34
Lin Du601.35
Zhi Tang725653.42
Liangcai Gao820930.92