Title | ||
---|---|---|
Why good data analysts need to be critical synthesists. Determining the role of semantics in data analysis. |
Abstract | ||
---|---|---|
In this article, we critically examine the role of semantic technology in data driven analysis. We explain why learning from data is more than just analyzing data, including also a number of essential synthetic parts that suggest a revision of George Boxs model of data analysis in statistics. We review arguments from statistical learning under uncertainty, workflow reproducibility, as well as from philosophy of science, and propose an alternative, synthetic learning model that takes into account semantic conflicts, observation, biased model and data selection, as well as interpretation into background knowledge. The model highlights and clarifies the different roles that semantic technology may have in fostering reproduction and reuse of data analysis across communities of practice under the conditions of informational uncertainty. We also investigate the role of semantic technology in current analysis and workflow tools, compare it with the requirements of our model, and conclude with a roadmap of 8 challenging research problems which currently seem largely unaddressed. We explain why learning from data is more than just analyzing data, including synthetic tasks.We provide arguments from statistical learning, workflow reproducibility, and philosophy.We propose a learning model that highlights the roles of semantic technology in data analysis.Based on this model, we review current analysis and workflow tools and Semantic Web research.We propose a roadmap of 8 challenging research problems which currently seem largely unaddressed. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1016/j.future.2017.02.046 | Future Generation Comp. Syst. |
Keywords | Field | DocType |
Data driven analysis,Learning,Semantic Web,e-Science,Data science | Data science,Semantic technology,Data-driven,Information retrieval,e-Science,Computer science,Semantic Web,Workflow,Semantics,Semantic computing,Semantic data model | Journal |
Volume | Issue | ISSN |
72 | C | 0167-739X |
Citations | PageRank | References |
5 | 0.43 | 48 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Simon Scheider | 1 | 246 | 23.90 |
Frank O. Ostermann | 2 | 112 | 11.15 |
Benjamin Adams | 3 | 73 | 7.78 |