Title | ||
---|---|---|
Boosting cross-media retrieval via visual-auditory feature analysis and relevance feedback |
Abstract | ||
---|---|---|
Different types of multimedia data express high-level semantics from different aspects. How to learn comprehensive high-level semantics from different types of data and enable efficient cross-media retrieval becomes an emerging hot issue. There are abundant statistical and semantic correlations among heterogeneous low-level media content, which makes it challenging to query cross-media data effectively. In this paper, we propose a new cross-media retrieval method based on short-term and long-term relevance feedback. Our method mainly focuses on two typical types of media data, i.e. image and audio. First, we build multimodal representation via statistical canonical correlation between image and audio feature matrices, and define cross-media distance metric for similarity measure; then we propose optimization strategy based on relevance feedback, which fuses short-term learning results and long-term accumulated knowledge into the objective function. Experiments on image-audio dataset have demonstrated the superiority of our method over several existing algorithms. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2647868.2654975 | ACM Multimedia 2001 |
Keywords | Field | DocType |
cross-media retrieval,feature analysis,relevance feedback,retrieval models | Relevance feedback,Information retrieval,Similarity measure,Computer science,Canonical correlation,Metric (mathematics),Data type,Artificial intelligence,Boosting (machine learning),Semantics,Machine learning,Pattern recognition (psychology) | Conference |
Citations | PageRank | References |
6 | 0.40 | 11 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hong Zhang | 1 | 14 | 4.54 |
Junsong Yuan | 2 | 3703 | 187.68 |
Xingyu Gao | 3 | 17 | 2.29 |
Zhenyu Chen | 4 | 6 | 0.40 |