Abstract | ||
---|---|---|
With the increasing use of audio sensors in user generated content UGC collection, semantic concept annotation using audio streams has become an important research problem. Huawei initiates a grand challenge in the International Conference on Multimedia & Expo ICME 2014: Huawei Accurate and Fast Mobile Video Annotation Challenge. In this paper, we present our semantic concept annotation system using audio stream only for the Huawei challenge. The system extracts audio stream from the video data and low-level acoustic features from the audio stream. Bag-of-feature representation is generated based on the low-level features and is used as input feature to train the support vector machine SVM concept classifier. The experimental results show that our audio-only concept annotation system can detect semantic concepts significantly better than random guess. It can also provide important complementary information to the visual-based concept annotation system for performance boost. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/978-3-319-13168-9_12 | PCM |
Field | DocType | Citations |
User-generated content,Annotation,Information retrieval,Computer science,Support vector machine,Video annotation,Video content analysis,Classifier (linguistics) | Conference | 3 |
PageRank | References | Authors |
0.43 | 16 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Junwei Liang | 1 | 44 | 9.79 |
Qin Jin | 2 | 639 | 66.86 |
Xixi He | 3 | 8 | 2.26 |
Gang Yang | 4 | 53 | 15.64 |
JiePing Xu | 5 | 45 | 9.72 |
Xirong Li | 6 | 1191 | 68.62 |