Title
DaSEA - A Dataset for Software Ecosystem Analysis
Abstract
Software package managers facilitate reuse and rapid construction of software systems. Since evermore software is distributed via package managers, researchers and practitioners require explicit data of software dependency networks that are opaquely formed by dependency relations between software packages. To reason about increasingly complex software products and ecosystems, researchers and practitioners rely either on publicly available datasets like the seemingly unattended libraries.io [14] or they mine problem-specific data from software ecosystems repeatedly and non-transparently. Therefore, we present the DaSEA dataset, which contains metadata of software packages, their versions, and dependencies from multiple ecosystems (currently six programming languages and five operating system package managers). Alongside the dataset, we provide an extensible open-source tool under the same name that is used to create updated versions of the DaSEA dataset allowing studies of evolution of software ecosystems.
Year
DOI
Venue
2022
10.1145/3524842.3528004
2022 IEEE/ACM 19th International Conference on Mining Software Repositories (MSR)
Keywords
DocType
ISSN
Software Engineering,Dataset,Package Managers,Dependency Networks
Conference
2574-3848
ISBN
Citations 
PageRank 
978-1-6654-5210-6
0
0.34
References 
Authors
21
4
Name
Order
Citations
PageRank
Petya Buchkova100.34
Joakim Hey Hinnerskov200.34
Kasper Olsen300.34
Rolf-Helge Pfeiffer400.34