Title
RUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues.
Abstract
This paper summarizes our eorts for the rst time participation in the Violent Scene Detection subtask of the MediaEval 2015 Aective Impact of Movies Task. We build violent scene detectors using both audio and visual cues. In particular, the audio cue is represented by bag-of-audio-words with sher vector encoding. The visual cue is exploited by extracting CNN features from video frames. The detectors are implemented using two-class linear SVM classiers. Evaluation shows that the audio detectors and the visual detectors are comparable and complementary to each other. Among our submissions, multi-modal late fusion leads to the best performance.
Year
Venue
Field
2015
MediaEval
Sensory cue,Computer vision,Computer science,Artificial intelligence,Encoding (memory),Linear svm
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
9
7
Name
Order
Citations
PageRank
Qin Jin163966.86
Xirong Li2119168.62
Haibing Cao300.34
Yujia Huo4344.41
Shuai Liao5443.86
Gang Yang6329.38
JiePing Xu7459.72