Bridging low-level features and high-level semantics via fMRI brain imaging for video classification - Citegraph

Paper Info

Title
Bridging low-level features and high-level semantics via fMRI brain imaging for video classification

Abstract
The multimedia content analysis community has made significant effort to bridge the gap between low-level features and high-level semantics perceived by human cognitive systems such as real-world objects and concepts. In the two fields of multimedia analysis and brain imaging, both topics of low-level features and high level semantics are extensively studied. For instance, in the multimedia analysis field, many algorithms are available for multimedia feature extraction, and benchmark datasets are available such as the TRECVID. In the brain imaging field, brain regions that are responsible for vision, auditory perception, language, and working memory are well studied via functional magnetic resonance imaging (fMRI). This paper presents our initial effort in marrying these two fields in order to bridge the gaps between low-level features and high-level semantics via fMRI brain imaging. Our experimental paradigm is that we performed fMRI brain imaging when university student subjects watched the video clips selected from the TRECVID datasets. At current stage, we focus on the three concepts of sports, weather, and commercial-/advertisement specified in the TRECVID 2005. Meanwhile, the brain regions in vision, auditory, language, and working memory networks are quantitatively localized and mapped via task-based paradigm fMRI, and the fMRI responses in these regions are used to extract features as the representation of the brain's comprehension of semantics. Our computational framework aims to learn the most relevant low-level feature sets that best correlate the fMRI-derived semantics based on the training videos with fMRI scans, and then the learned models are applied to larger scale test datasets without fMRI scans for category classifications. Our result shows that: 1) there are meaningful couplings between brain's fMRI responses and video stimuli, suggesting the validity of linking semantics and low-level features via fMRI; 2) The computationally learned low-level feature sets from fMRI-derived semantic features can significantly improve the classification of video categories in comparison with that based on original low-level features.

Year	DOI	Venue
2010	10.1145/1873951.1874016	ACM Multimedia 2001
Keywords	Field	DocType
fmri scan,brain imaging,high-level semantics,video classification,fmri brain imaging,task-based paradigm fmri,brain imaging field,brain region,low-level feature,low-level feature set,fmri response,semantics,brain computer interface,feature extraction,working memory	Computer vision,Functional magnetic resonance imaging,TRECVID,Computer science,Brain–computer interface,Working memory,Feature extraction,Artificial intelligence,Natural language processing,Neuroimaging,Perception,Semantics	Conference
Citations	PageRank	References
16	0.97	14
Authors
17

Authors (17 rows)

Cited by (16 rows)

References (14 rows)

Name	Order	Citations	PageRank
Xintao Hu	1	118	13.53
Fan Deng	2	96	7.56
Kaiming Li	3	385	30.92
Tuo Zhang	4	233	32.92
Hanbo Chen	5	287	27.40
Xi Jiang	6	311	37.88
Jinglei Lv	7	205	26.70
Dajiang Zhu	8	320	36.72
Carlos Faraco	9	107	7.00
Degang Zhang	10	128	10.01
Arsham Mesbah	11	24	1.89
Junwei Han	12	3501	194.57
Xian-Sheng Hua	13	6566	328.17
Li Xie	14	43	4.86
L. Stephen Miller	15	130	9.26
Lei Guo	16	1661	142.63
Tianming Liu	17	1033	112.95

1