Abstract | ||
---|---|---|
Identifying repetitive code changes benefits developers, tool builders, and researchers. Tool builders can automate the popular code changes, thus improving the productivity of developers. Researchers can better understand the practice of code evolution, advancing existing code assistance tools and benefiting developers even further. Unfortunately, existing research either predominantly uses coarse-grained Version Control System (VCS) snapshots as the primary source of code evolution data or considers only a small subset of program transformations of a single kind - refactorings. We present the first approach that identifies previously unknown frequent code change patterns from a fine-grained sequence of code changes. Our novel algorithm effectively handles challenges that distinguish continuous code change pattern mining from the existing data mining techniques. We evaluated our algorithm on 1,520 hours of code development collected from 23 developers, and showed that it is effective, useful, and scales to large amounts of data. We analyzed some of the mined code change patterns and discovered ten popular kinds of high-level program transformations. More than half of our 420 survey participants acknowledged that eight out of ten transformations are relevant to their programming activities. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2568225.2568317 | ICSE |
Keywords | Field | DocType |
code changes,data mining,distribution, maintenance, and enhancement,program transformation | Data science,Static program analysis,Program transformation,Software engineering,Source code,Computer science,Real-time computing,KPI-driven code analysis,Snapshot (computer storage),Code refactoring,Change patterns,Code review | Conference |
Citations | PageRank | References |
35 | 0.85 | 32 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stas Negara | 1 | 218 | 9.23 |
Mihai Codoban | 2 | 113 | 3.92 |
Danny Dig | 3 | 1570 | 79.66 |
Ralph E. Johnson | 4 | 1790 | 264.74 |