Content Sifting Storage: Achieving Fast Read for Large-scale Image Dataset Analysis - Citegraph

Paper Info

Title
Content Sifting Storage: Achieving Fast Read for Large-scale Image Dataset Analysis

Abstract
Analyzing large-scale image dataset requires all images to be read from disks first, leading to high read latency. Therefore, we propose a Content Sifting Storage (CSS) system, which aims to reduce the read latency by only reading sifted relevant data. CSS generates embedded content metadata via deep learning and manages the metadata via Semantic Hamming Graph, which achieves fast read based on content similarity meeting the given analysis. Extensive experimental results on image datasets show that compared with conventional semantic storage systems, our CSS can greatly reduce the read latency by 82.21% to 94.8% with more than 98% recall rate.

Year	DOI	Venue
2020	10.1109/DAC18072.2020.9218738	2020 57th ACM/IEEE Design Automation Conference (DAC)
Keywords	DocType	ISSN
Content Sifting Storage,Semantic Hamming Graph,read latency,large-scale image dataset	Conference	0738-100X
ISBN	Citations	PageRank
978-1-7281-1085-1	0	0.34
References	Authors
0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yu Liu	1	492	30.80
Hong Jiang	2	2137	157.96
Yangtao Wang	3	27	5.85
Ke Zhou	4	452	51.98
Yifei Liu	5	2	2.05
Li Liu	6	1	1.38

1