Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation - Citegraph

Paper Info

Title
Text recognition in multimedia documents: a study of two neural-based OCRs using and avoiding character segmentation

Abstract
Text embedded in multimedia documents represents an important semantic information that helps to automatically access the content. This paper proposes two neural-based optical character recognition (OCR) systems that handle the text recognition problem in different ways. The first approach segments a text image into individual characters before recognizing them, while the second one avoids the segmentation step by integrating a multi-scale scanning scheme that allows to jointly localize and recognize characters at each position and scale. Some linguistic knowledge is also incorporated into the proposed schemes to remove errors due to recognition confusions. Both OCR systems are applied to caption texts embedded in videos and in natural scene images and provide outstanding results showing that the proposed approaches outperform the state-of-the-art methods.

Year	DOI	Venue
2014	10.1007/s10032-013-0202-7	International Journal on Document Analysis and Recognition
Keywords	Field	DocType
character segmentation,convolutional neural network,language model,ocr	Pattern recognition,Computer science,Segmentation,Convolutional neural network,Optical character recognition,Semantic information,Artificial intelligence,Natural language processing,Multimedia,Machine learning,Language model,Text recognition	Journal
Volume	Issue	ISSN
17	1	1433-2825
Citations	PageRank	References
8	0.52	41
Authors
4

Authors (4 rows)

Cited by (8 rows)

References (41 rows)

Name	Order	Citations	PageRank
Khaoula Elagouni	1	37	2.84
Christophe Garcia	2	35	3.12
Franck Mamalet	3	302	16.35
Pascale Sébillot	4	139	21.00

1