Title
Caption-aided speech detection in videos
Abstract
This paper presents a novel audio-visual fusion method for speech detection, which is an important front-end for content-based video processing. This approach aims to extract homogeneous speech segments from the accompanying audio stream in real-world movie/TV videos with the help of video captions. Note that captions are mainly created to help viewers to follow the dialog, rather than to accurately locate the speech regions. We propose a caption-aided speech detection approach, which makes use of both caption information and audio information. The inaccurate positions of the captions are refined through using audio features (pitch and MFCCs) and BIC-based acoustic change detection. Comparison experiments against several other traditional speech detection approaches are conducted, showing that the proposed approach improves the speech detection performance greatly.
Year
DOI
Venue
2008
10.1109/ICASSP.2008.4517566
ICASSP
Keywords
Field
DocType
speech detection,caption detection,video signal processing,speech processing,bayesian information criterion,bayesian information criterion (bic),bayes methods,speech segment extraction,content-based video processing,index terms— speech detection,audio feature,feature extraction,audio stream,pitch,caption-aided speech detection,audio signal processing,bic-based acoustic change detection,audio-visual fusion method,signal detection,bayesian information criterion bic,real-world movie/tv video,change detection,speech segmentation,indexing terms,video processing,front end
Speech processing,Speech coding,Computer science,Artificial intelligence,Audio signal processing,Computer vision,Video processing,Speech analytics,Pattern recognition,Voice activity detection,Audio mining,Speech recognition,Acoustic model
Conference
ISSN
ISBN
Citations 
1520-6149 E-ISBN : 978-1-4244-1484-0
978-1-4244-1484-0
2
PageRank 
References 
Authors
0.40
7
5
Name
Order
Citations
PageRank
Zhijian Ou15719.29
Zhijian Ou25719.29
Wei Hu318214.17
Tao Wang423823.70
Yimin Zhang535928.66