Title
Randomization Techniques for Data Mining Methods
Abstract
Data mining research has concentrated on inventing novel methods for finding interesting information from large masses of data. This has indeed led to many new computational tasks and some interesting algorithmic developments. However, there has been less emphasis on issues of significance testing of the discovered patterns or models. We discuss the issues in testing the results of data mining methods, and review some of the recent work in the development of scalable algorithmic techniques for randomization tests for data mining methods. We consider suitable null models and generation algorithms for randomization of 0-1 -matrices, arbitrary real valued matrices, and segmentations. We also discuss randomization for database queries.
Year
DOI
Venue
2008
10.1007/978-3-540-85713-6_1
ADBIS (Local Proceedings)
Keywords
Field
DocType
randomization techniques,data mining research,data mining method,scalable algorithmic technique,generation algorithm,database query,interesting algorithmic development,large masse,data mining methods,significance testing,randomization test,interesting information,data mining,null model,generic algorithm,random testing
Data mining,Significance testing,Computer science,Matrix (mathematics),Randomization techniques,Theoretical computer science,Database,Scalability
Conference
Volume
ISSN
Citations 
5207
0302-9743
3
PageRank 
References 
Authors
0.48
4
1
Name
Order
Citations
PageRank
Heikki Mannila165951495.69