Title
Temporally consistent caption detection in videos using a spatiotemporal 3D method
Abstract
Captions are text or logos superimposed on videos during a postproduction process. Caption detection in videos is useful for a variety of applications. For many applications, temporal consistency and stability is very important. Most of the prior work adopts certain post-processing procedures to smooth detected caption bounding boxes over time. Although these approaches mitigate the effect of the temporal inconsistency problem, they are unable to eliminate the problem. In this paper, we present a new caption detection algorithm that detects the 3D bounding boxes of caption regions in spatiotemporal volume space. 2D bounding boxes are then created by slicing the 3D bounding boxes. Since all the 2D bounding boxes corresponding to a caption area are sliced from one 3D bounding box, they are identical over time, thus ensuring temporal consistency of the result. The experiment results show that our new approach not only generates temporally consistent results but also results in higher detection accuracy.
Year
DOI
Venue
2009
10.1109/ICIP.2009.5413544
ICIP
Keywords
Field
DocType
caption detection,video signal processing,caption regions,experiment result,certain post-processing procedure,spatiotemporal volume space,higher detection accuracy,temporally consistent caption detection,spatiotemporal processing,caption area,temporal inconsistency problem,temporal consistency,spatiotemporal 3d method,video text detection,new caption detection algorithm,logos,logo detection,3d bounding boxes,video postproduction process,new approach,text analysis,caption region,video ocr,pixel,feature extraction
Computer vision,Pattern recognition,Computer science,Slicing,Feature extraction,Pixel,Artificial intelligence,Temporal consistency,Minimum bounding box,Bounding overwatch
Conference
ISSN
ISBN
Citations 
1522-4880 E-ISBN : 978-1-4244-5655-0
978-1-4244-5655-0
0
PageRank 
References 
Authors
0.34
8
3
Name
Order
Citations
PageRank
Dong-Qing Zhang147940.49
Sitaram Bhagavathy21149.82
Joan Llach39910.01