Title
An Empirical Study on Real Bugs for Machine Learning Programs
Abstract
Due to the availability of various open source Machine Learning (ML) tools and libraries, developers nowadays can easily implement their purposes by just invoking machine learning APIs without knowing the details of the algorithm. However, the owners of ML tools and libraries usually pay more attention to the correctness and functionality of their algorithm, while spending much less effort on maintaining their code and keeping their code at a high quality level. Considering the popularity of machine learning in today's world, low quality ML tools and libraries can have a huge impact on the software products that use ML algorithms. So in this paper, we conduct an empirical study on real machine learning bugs to examine their patterns and how they evolve over time. We collect three popular machine learning projects on Github, and manually analyzed 329 closed bugs from the perspectives of their bug category, fix pattern, fix scale, fix duration, and type of software maintenance. The results show that (1) there are seven categories of bugs in machine learning programs; (2) twelve different fix patterns are commonly used to fix the bugs; (3) 63.83% of the patches belong to micro-scale-fix and small-scale-fix, and 68.39% of the bugs are fixed within one month; (4) 47.77% of the bug fixes belong to corrective activity from the view of software maintenance.
Year
DOI
Venue
2017
10.1109/APSEC.2017.41
2017 24th Asia-Pacific Software Engineering Conference (APSEC)
Keywords
Field
DocType
empirical study,machine learning programs,bug,bug fix
Computer science,Software bug,Popularity,Correctness,Software,Artificial intelligence,Software maintenance,Documentation,Empirical research,Maintenance engineering,Machine learning
Conference
ISSN
ISBN
Citations 
1530-1362
978-1-5386-3682-4
6
PageRank 
References 
Authors
0.44
18
6
Name
Order
Citations
PageRank
Xiaobing Sun1447.22
Tianchi Zhou2110.82
Gengjie Li360.44
Jiajun Hu4403.06
Hui Yang5302.82
Bin Li631830.27