Abstract | ||
---|---|---|
Video genre identification methods are frequently based on image or motion analysis, which are relatively time-consuming processes. Since such approaches are tractable by batch processing, as-soon-as-possible identification requires faster methods. In this paper, we investigate the use of audio-only methods for on-the-fly video classification. We propose to use several acoustic feature streams and we evaluate various combination schemes at the frame or at the score level. Results are compared to those obtained by humans, according to the listening duration. Although the system based on model combination slightly outperforms the humans on very soon detection. The latter remain significantly more accurate on long sessions. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/ICASSP.2010.5496233 | 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING |
Keywords | Field | DocType |
Video genre identification, video classification, audio processing, speech processing | Speech processing,Mel-frequency cepstrum,Pattern recognition,Computer science,Search engine indexing,Speech recognition,Feature extraction,Batch processing,Artificial intelligence,Motion analysis,Audio signal processing,Artificial neural network | Conference |
ISSN | Citations | PageRank |
1520-6149 | 5 | 0.55 |
References | Authors | |
8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mickael Rouvier | 1 | 79 | 15.32 |
Georges Linares | 2 | 87 | 19.73 |
Driss Matrouf | 3 | 404 | 41.80 |