Title
TA-RE: an exchange language for mining software repositories
Abstract
Software repositories have been getting a lot of attention from researchers in recent years. In order to analyze software repositories, it is necessary to first extract raw data from the version control and problem tracking systems. This poses two challenges: (1) extraction requires a non-trivial effort, and (2) the results depend on the heuristics used during extraction. These challenges burden researchers that are new to the community and make it difficult to benchmark software repository mining since it is almost impossible to reproduce experiments done by another team. In this paper we present the TA-RE corpus. TA-RE collects extracted data from software repositories in order to build a collection of projects that will simplify extraction process. Additionally the collection can be used for benchmarking. As the first step we propose an exchange language capable of making sharing and reusing data as simple as possible.
Year
DOI
Venue
2006
10.1145/1137983.1137990
MSR
Keywords
Field
DocType
software repository,reusing data,extraction process,extract raw data,challenges burden researcher,benchmark software repository mining,ta-re corpus,non-trivial effort,mining software repository,recent year,exchange language,prediction,version control,analysis,corpus,tracking system
Data mining,Software design description,Software engineering,Software analytics,Package development process,Computer science,Software system,Software verification and validation,Software construction,Software development,Database,Software mining
Conference
ISBN
Citations 
PageRank 
1-59593-397-2
16
1.62
References 
Authors
20
9
Name
Order
Citations
PageRank
Sunghun Kim13036114.11
Thomas Zimmermann25947271.61
Miryung Kim3185682.00
Ahmed E. Hassan45959287.68
Audris Mockus54031308.78
Tudor Girba672940.01
Martin Pinzger72147120.49
E. James Whitehead81794131.24
Andreas Zeller95697303.71