In-Memory Distributed Indexing for Large-Scale Media Data Retrieval - Citegraph

Paper Info

Title
In-Memory Distributed Indexing for Large-Scale Media Data Retrieval

Abstract
Data retrieval serves a critical role in the development of multimedia applications. However, due to the exponential growth of multimedia data, high-speed and efficient indexing is becoming more and more difficult than ever. In this paper, we propose a novel approach to speed up the retrieval process by adopting a distributed computing paradigm through the Apache Spark framework. Utilizing search trees in a Big Data ecosystem leads to fast and cost-effective media database retrievals by caching indexing structures into memory and aggregating ranked results with flexibilities for users to specify the importance of search cues. We conducted computational experiments on large-scaled vector files for remote sensing image database and synthesized pollen image database to demonstrate the effectiveness and scalability of our system with reasonably high accuracy.

Year	DOI	Venue
2017	10.1109/ISM.2017.38	2017 IEEE International Symposium on Multimedia (ISM)
Keywords	Field	DocType
In-Memory Computing,Big Data,Distributed Indexing,Image Database Retrieval	Vector graphics,Spark (mathematics),Information retrieval,Ranking,Pattern recognition,Computer science,Data retrieval,Search engine indexing,Artificial intelligence,Big data,Scalability,Speedup	Conference
ISBN	Citations	PageRank
978-1-5386-2938-3	0	0.34
References	Authors
0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yinmiao Ma	1	0	0.34
Danlu Liu	2	1	3.06
Grant J. Scott	3	214	22.19
Jeffrey K. Uhlmann	4	2435	263.94
Chi-Ren Shyu	5	656	67.58

1