Title
Content Sifting Storage: Achieving Fast Read for Large-scale Image Dataset Analysis
Abstract
Analyzing large-scale image dataset requires all images to be read from disks first, leading to high read latency. Therefore, we propose a Content Sifting Storage (CSS) system, which aims to reduce the read latency by only reading sifted relevant data. CSS generates embedded content metadata via deep learning and manages the metadata via Semantic Hamming Graph, which achieves fast read based on content similarity meeting the given analysis. Extensive experimental results on image datasets show that compared with conventional semantic storage systems, our CSS can greatly reduce the read latency by 82.21% to 94.8% with more than 98% recall rate.
Year
DOI
Venue
2020
10.1109/DAC18072.2020.9218738
2020 57th ACM/IEEE Design Automation Conference (DAC)
Keywords
DocType
ISSN
Content Sifting Storage,Semantic Hamming Graph,read latency,large-scale image dataset
Conference
0738-100X
ISBN
Citations 
PageRank 
978-1-7281-1085-1
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Yu Liu149230.80
Hong Jiang22137157.96
Yangtao Wang3275.85
Ke Zhou445251.98
Yifei Liu522.05
Li Liu611.38