Title
Eclipse vs. Mozilla: A Comparison of Two Large-Scale Open Source Problem Report Repositories
Abstract
Bug tracking systems play an important role in the development and maintenance of large-scale software systems. Having access to open source bug tracking systems has allowed researchers to take advantage of rich datasets and propose solutions to manage duplicate report classification, developer assignment and quality assessment. In spite of research advances, our understanding of the content of these repositories remains limited, primarily because of their size. In many cases, researchers analyze small portions of datasets thus limiting the understanding of the dynamics of problem reporting. The objective of this study is to explore the properties of two large-scale open source problem report repositories. The Eclipse dataset, at the time of download, consisted of 363; 770 reports spanning 11+ years, whereas Mozilla contained 699; 085 reports spanning 14+ years.Our research examines the evolution of datasets over time by analyzing the changes in the repository and the profiles of users who submit problem reports. We provide quantitative evidence on how submitter's maturity reduces the propensity to submit poor quality, insignificant or duplicate reports. We show that a diverse user base, characteristic of Mozilla, creates challenges for the development team as they spend more time triaging, rather than fixing, issues. Finally, we provide the research community with a series of observations and suggestions on how to study large-scale problem repositories.
Year
DOI
Venue
2015
10.1109/HASE.2015.45
High Assurance Systems Engineering
Keywords
Field
DocType
program debugging,public domain software,software maintenance,eclipse dataset,mozilla characteristic,mozilla dataset,developer assignment,download time,duplicate report classification management,large-scale open source problem report repositories,large-scale software system development,large-scale software system maintenance,open source bug tracking systems,problem reporting dynamics,quality assessment,quantitative analysis,research community,user base,pragmatics,maintenance engineering,computer bugs,software systems
World Wide Web,Duplicate report,Computer science,Software bug,Tracking system,Download,Software system,Eclipse,Maintenance engineering,Limiting,Reliability engineering
Conference
ISSN
Citations 
PageRank 
1530-2059
6
0.46
References 
Authors
17
4
Name
Order
Citations
PageRank
Sean Banerjee19613.42
Jordan Helmick260.46
Zahid Syed3334.19
Bojan Cukic4181297.62