Abstract | ||
---|---|---|
In this paper we propose a novel method based on Recurrent Neural Networks (RNN) for text recognition in Arabic news video frames. In fact, embedded texts in videos represent a rich source of information for indexing and automatic processing of multimedia documents. However, text recognition in video is not trivial due to many challenges like background complexity (e.g., presence of text-like objects), unknown text size/font with various colours and degraded text quality. The proposed system presents a segmentation-free recognition technique (i.e. no prior required segmentation of words into characters) using a RNN architecture. This technique relies specifically on a Multi-Dimensional Long Short Term Memory (MDLSTM) with a Connectionist Temporal Classification (CTC) output layer. Our system has been evaluated on the public AcTiV-R dataset. The obtained results are very promising. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/AICCSA.2017.126 | 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA) |
Keywords | Field | DocType |
Arabic video text recognition, RNN, MDLSTM, CTC, AcTiV-R database | Arabic,Computer science,Segmentation,Font,Search engine indexing,Recurrent neural network,Image segmentation,Speech recognition,Real-time computing,Connectionism,Text recognition | Conference |
ISSN | Citations | PageRank |
2161-5322 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Oussama Zayene | 1 | 7 | 2.82 |
Soumaya Essefi Amamou | 2 | 0 | 0.34 |
Najoua Essoukri Ben Amara | 3 | 209 | 41.48 |