Title
Traceability challenge 2011: using TraceLab to evaluate the impact of local versus global IDF on trace retrieval.
Abstract
Numerous trace retrieval algorithms incorporate the standard tf-idf (term frequency, inverse document frequency) technique to weight various terms. In this paper we address Grand Challenge C-GC1 by comparing the effectiveness of computing idf based only on the local terms in the query, versus computing it based on general term usage as documented in the American National Corpus. We also address Grand Challenges L-GC1 and L-GC2 by setting ourselves the additional task of designing and conducting the experiments using the alpha-release of TraceLab. TraceLab is an experimental workbench which allows researchers to graphically model and execute a traceability experiment as a workflow of components. Results of the experiment show that the local idf approach exceeds or matches the global approach in all of the cases studied.
Year
DOI
Venue
2011
10.1145/1987856.1987874
EFSE@ICSE
Keywords
Field
DocType
traceability challenge,experiment show,grand challenge c-gc1,term frequency,traceability experiment,local idf approach,general term usage,global approach,trace retrieval,global idf,local term,inverse document frequency,various term,traceability,graphical model
Workbench,Data mining,tf–idf,Computer science,American National Corpus,Grand Challenges,Workflow,Retrieval algorithm,Traceability
Conference
Citations 
PageRank 
References 
7
0.61
6
Authors
7
Name
Order
Citations
PageRank
Adam Czauderna11387.32
Marek Gibiec2622.35
Greg Leach3744.20
Yubin Li4271.97
Yonghee Shin541116.48
Ed Keenan6975.94
Jane Cleland-Huang72204139.78