Title
Linking Educational Resources on Data Science
Abstract
The availability of massive datasets in genetics, neuroimaging, mobile health, and other subfields of biology and medicine promises new insights but also poses significant challenges. To realize the potential of big data in biomedicine, the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative, funding several centers of excellence in biomedical data analysis and a Training Coordinating Center (TCC) tasked with facilitating online and in-person training of biomedical researchers in data science. A major initiative of the BD2K TCC is to automatically identify, describe, and organize data science training resources available on the Web and provide personalized training paths for users. In this paper, we describe the construction of ERuDIte, the Educational Resource Discovery Index for Data Science, and its release as linked data. ERuDIte contains over 11,000 training resources including courses, video tutorials, conference talks, and other materials. The metadata for these resources is described uniformly using Schema.org. We use machine learning techniques to tag each resource with concepts from the Data Science Education Ontology, which we developed to further describe resource content. Finally, we map references to people and organizations in learning resources to entities in DBpedia, DBLP, and ORCID, embedding our collection in the web of linked data. We hope that ERuDIte will provide a framework to foster open linked educational resources on the Web.
Year
DOI
Venue
2019
10.1609/aaai.v33i01.33019404
AAAI
Field
DocType
Volume
Data science,Ontology,Metadata,Educational resources,Computer science,Linked data,Biomedicine,Big data,Excellence
Conference
33
Citations 
PageRank 
References 
1
0.37
0
Authors
5
Name
Order
Citations
PageRank
José Luis Ambite1958110.89
Jonathan Gordon2659.29
Lily Fierro311.72
Gully Burns4233.53
Joel Mathew510.37