Title
A framework for creating knowledge graphs of scientific software metadata
Abstract
An increasing number of researchers rely on computational methods to generate or manipulate the results described in their scientific publications. Software created to this end-scientific software-is key to understanding, reproducing, and reusing existing work in many disciplines, ranging from Geosciences to Astronomy or Artificial Intelligence. However, scientific software is usually challenging to find, set up, and compare to similar software due to its disconnected documentation (dispersed in manuals, readme files, websites, and code comments) and the lack of structured metadata to describe it. As a result, researchers have to manually inspect existing tools to understand their differences and incorporate them into their work. This approach scales poorly with the number of publications and tools made available every year. In this paper we address these issues by introducing a framework for automatically extracting scientific software metadata from its documentation (in particular, their readme files); a methodology for structuring the extracted metadata in a Knowledge Graph (KG) of scientific software; and an exploitation framework for browsing and comparing the contents of the generated KG. We demonstrate our approach by creating a KG with metadata from over 10,000 scientific software entries from public code repositories.
Year
DOI
Venue
2021
10.1162/qss_a_00167
QUANTITATIVE SCIENCE STUDIES
Keywords
DocType
Volume
FAIR, knowledge graphs, metadata, metadata extraction, scientific software
Journal
2
Issue
Citations 
PageRank 
4
1
0.38
References 
Authors
0
2
Name
Order
Citations
PageRank
Aidan Kelley110.38
Daniel Garijo210.38