RUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues. - Citegraph

Paper Info

Title
RUCMM at MediaEval 2015 Affective Impact of Movies Task: Fusion of Audio and Visual Cues.

Abstract
This paper summarizes our eorts for the rst time participation in the Violent Scene Detection subtask of the MediaEval 2015 Aective Impact of Movies Task. We build violent scene detectors using both audio and visual cues. In particular, the audio cue is represented by bag-of-audio-words with sher vector encoding. The visual cue is exploited by extracting CNN features from video frames. The detectors are implemented using two-class linear SVM classiers. Evaluation shows that the audio detectors and the visual detectors are comparable and complementary to each other. Among our submissions, multi-modal late fusion leads to the best performance.

Year	Venue	Field
2015	MediaEval	Sensory cue,Computer vision,Computer science,Artificial intelligence,Encoding (memory),Linear svm
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
9	7

Authors (7 rows)

Cited by (0 rows)

References (9 rows)

Name	Order	Citations	PageRank
Qin Jin	1	639	66.86
Xirong Li	2	1191	68.62
Haibing Cao	3	0	0.34
Yujia Huo	4	34	4.41
Shuai Liao	5	44	3.86
Gang Yang	6	32	9.38
JiePing Xu	7	45	9.72

1