Semantic Concept Annotation of Consumer Videos at Frame-Level Using Audio - Citegraph

Paper Info

Title
Semantic Concept Annotation of Consumer Videos at Frame-Level Using Audio

Abstract
With the increasing use of audio sensors in user generated content UGC collection, semantic concept annotation using audio streams has become an important research problem. Huawei initiates a grand challenge in the International Conference on Multimedia & Expo ICME 2014: Huawei Accurate and Fast Mobile Video Annotation Challenge. In this paper, we present our semantic concept annotation system using audio stream only for the Huawei challenge. The system extracts audio stream from the video data and low-level acoustic features from the audio stream. Bag-of-feature representation is generated based on the low-level features and is used as input feature to train the support vector machine SVM concept classifier. The experimental results show that our audio-only concept annotation system can detect semantic concepts significantly better than random guess. It can also provide important complementary information to the visual-based concept annotation system for performance boost.

Year	DOI	Venue
2014	10.1007/978-3-319-13168-9_12	PCM
Field	DocType	Citations
User-generated content,Annotation,Information retrieval,Computer science,Support vector machine,Video annotation,Video content analysis,Classifier (linguistics)	Conference	3
PageRank	References	Authors
0.43	16	6

Authors (6 rows)

Cited by (3 rows)

References (16 rows)

Name	Order	Citations	PageRank
Junwei Liang	1	44	9.79
Qin Jin	2	639	66.86
Xixi He	3	8	2.26
Gang Yang	4	53	15.64
JiePing Xu	5	45	9.72
Xirong Li	6	1191	68.62

1