Title
Testing noisy numerical data for monotonic association
Abstract
Rank correlation measures are intended to measure to which extent there is a monotonic association between two observables. While they are mainly designed for ordinal data, they are not ideally suited for noisy numerical data. In order to better account for noisy data, a family of rank correlation measures has previously been introduced that replaces classical ordering relations by fuzzy relations with smooth transitions-thereby ensuring that the correlation measure is continuous with respect to the data. The given paper briefly repeats the basic concepts behind this family of rank correlation measures and investigates it from the viewpoint of robust statistics. Then, on this basis, we introduce a framework of novel rank correlation tests. An extensive experimental evaluation using a large number of simulated data sets is presented which demonstrates that the new tests indeed outperform the classical variants in terms of type II error rates without sacrificing good performance in terms of type I error rates. This is mainly due to the fact that the new tests are more robust to noise for small samples. The Gaussian rank correlation estimator turned out to be the best choice in situations where no prior knowledge is available about the data, whereas the new family of robust gamma test provides an advantage in situations where information about the noise distribution is available. An implementation of all robust rank correlation tests used in this paper is available as an R package from the CRAN repository.
Year
DOI
Venue
2013
10.1016/j.ins.2012.11.026
Inf. Sci.
Keywords
Field
DocType
noisy numerical data,correlation measure,rank correlation measure,gaussian rank correlation estimator,noisy data,robust rank correlation test,new test,testing noisy numerical data,simulated data set,ordinal data,novel rank correlation test,monotonic association,robust statistics,rank correlation
Rank correlation,Monotonic function,Data set,Ordinal data,Robust statistics,Gaussian,Artificial intelligence,Type I and type II errors,Mathematics,Machine learning,Estimator
Journal
Volume
ISSN
Citations 
245,
0020-0255
5
PageRank 
References 
Authors
0.59
11
3
Name
Order
Citations
PageRank
Ulrich Bodenhofer170568.02
Martin Krone271.68
Frank Klawonn3705101.95