Title
Semantic Concept Annotation of Consumer Videos at Frame-Level Using Audio
Abstract
With the increasing use of audio sensors in user generated content UGC collection, semantic concept annotation using audio streams has become an important research problem. Huawei initiates a grand challenge in the International Conference on Multimedia & Expo ICME 2014: Huawei Accurate and Fast Mobile Video Annotation Challenge. In this paper, we present our semantic concept annotation system using audio stream only for the Huawei challenge. The system extracts audio stream from the video data and low-level acoustic features from the audio stream. Bag-of-feature representation is generated based on the low-level features and is used as input feature to train the support vector machine SVM concept classifier. The experimental results show that our audio-only concept annotation system can detect semantic concepts significantly better than random guess. It can also provide important complementary information to the visual-based concept annotation system for performance boost.
Year
DOI
Venue
2014
10.1007/978-3-319-13168-9_12
PCM
Field
DocType
Citations 
User-generated content,Annotation,Information retrieval,Computer science,Support vector machine,Video annotation,Video content analysis,Classifier (linguistics)
Conference
3
PageRank 
References 
Authors
0.43
16
6
Name
Order
Citations
PageRank
Junwei Liang1449.79
Qin Jin263966.86
Xixi He382.26
Gang Yang45315.64
JiePing Xu5459.72
Xirong Li6119168.62