Title
IF-MCA: Importance Factor-Based Multiple Correspondence Analysis for Multimedia Data Analytics.
Abstract
Multimedia concept detection is a challenging topic due to the well-known class imbalance issue, where the data instances are distributed unevenly across different classes. This problem becomes even more prominent when the minority class that contains an extremely small proportion of the data represents the concept of interest as has occurred in many real-world applications such as frauds in banking transactions and goal events in soccer videos. Traditional data mining approaches often have difficulty handling largely skewed data distributions. To address this issue, in this paper, an importance-factor (IF)-based multiple correspondence analysis (MCA) framework is proposed to deal with the imbalanced datasets. Specifically, a hierarchical information gain analysis method, which is inspired by the decision tree algorithm, is presented for critical feature selection and IF assignment. Then, the derived IF is incorporated with the MCA algorithm for effective concept detection and retrieval. The comparison results in video concept detection using the disaster dataset and the soccer dataset demonstrate the effectiveness of the proposed framework.
Year
DOI
Venue
2018
10.1109/TMM.2017.2760623
IEEE Trans. Multimedia
Keywords
Field
DocType
Feature extraction,Decision trees,Algorithm design and analysis,Training,Multimedia communication,Data mining,Testing
Data mining,Decision tree,Multiple correspondence analysis,Algorithm design,Feature selection,Data analysis,Computer science,Feature extraction,Analytics,Multimedia,Decision tree learning
Journal
Volume
Issue
ISSN
20
4
1520-9210
Citations 
PageRank 
References 
2
0.36
0
Authors
6
Name
Order
Citations
PageRank
Yimin Yang116211.16
Samira Pouyanfar214113.06
Haiman Tian3878.99
Min Chen420711.80
Shu-Ching Chen51978182.74
Mei-Ling Shyu61863141.25