Abstract | ||
---|---|---|
This paper presents recurring concept drifts (RCD), a framework that offers an alternative approach to handle data streams that suffer from recurring concept drifts (on-line learning). It creates a new classifier to each context found and stores a sample of data used to build it. When a new concept drift occurs, the algorithm compares the new context to previous ones using a non-parametric multivariate statistical test to verify if both contexts come from the same distribution. If so, the corresponding classifier is reused. The RCD framework is compared with several algorithms (among single and ensemble approaches), in both artificial and real data sets, chosen from frequently used algorithms and data sets in the concept drift research area. We claim the proposed framework had better average ranks in data sets with abrupt and gradual concept drifts compared to both the single classifiers and the ensemble approaches that use the same base learner. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1016/j.patrec.2013.02.005 | Pattern Recognition Letters |
Keywords | Field | DocType |
concept drift research area,concept drift framework,new concept drift,gradual concept,ensemble approach,concept drift,rcd framework,data stream,new classifier,data streams | Data mining,Data stream mining,Data set,Pattern recognition,Computer science,Multivariate statistics,Concept drift,Artificial intelligence,Classifier (linguistics),Statistical hypothesis testing,Machine learning | Journal |
Volume | Issue | ISSN |
34 | 9 | 0167-8655 |
Citations | PageRank | References |
18 | 0.69 | 30 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Paulo Mauricio Goncalves | 1 | 32 | 3.33 |
Roberto Souto Maior de Barros | 2 | 64 | 6.33 |