Title
Big Datasets for Research: A Survey on Flagship Conferences
Abstract
It is obvious that big data can bring us new opportunities to discover valuable information. Apparently, corresponding big datasets are powerful tools for scholars, which connect theoretical studies to reality. They can help scholars to evaluate their achievements and find new problems. In recent years, there has been a significant growth in research data repositories and registries. However, these infrastructures are fragmented across institutions, countries and research domains. As such, finding research datasets is not a trivial task for many researchers. Thus we investigated 195 papers regarding big data on some notable international conferences in recent 3 years, and also gathered 285 datasets mentioned in them. In this paper, we present and analyze our survey results in terms of the status quo of big data research and datasets from different aspects. In particular, we propose two different taxonomies of big datasets and classify our surveyed datasets into them. In addition, we also give a brief introduction about 7 widely accepted data collections online. Finally, some basic principles for scholars in choosing and using big datasets are given.
Year
DOI
Venue
2016
10.1109/BigDataCongress.2016.62
2016 IEEE International Congress on Big Data (BigData Congress)
Keywords
Field
DocType
big data,datasets,survey
Data science,Data mining,World Wide Web,Status quo,Computer science,Big data
Conference
ISSN
ISBN
Citations 
2379-7703
978-1-5090-2623-4
0
PageRank 
References 
Authors
0.34
4
6
Name
Order
Citations
PageRank
Yi Wei122.05
Shijun Liu212033.80
Jiao Sun300.34
Li-zhen Cui428271.41
Li Pan53918.95
Lei Wu67317.47