Complexity of Data Subsets Generated by the Random Subspace Method: An Experimental Investigation - Citegraph

Paper Info

Title
Complexity of Data Subsets Generated by the Random Subspace Method: An Experimental Investigation

Abstract
We report the results from an experimental investigation on the complexity of data subsets generated by the Random Subspace method. The main aim of this study is to analyse the variability of the complexity among the generated subsets. Four measures of complexity have been used, three from [4]: the minimal spanning tree (MST), the adherence subsets measure (ADH), the maximal feature efficiency (MFE); and a cluster label consistency measure (CLC) proposed in [7]. Our results with the UCI "wine" data set relate the variability in data complexity to the number of features used and the presence of redundant features.

Year	DOI	Venue
2001	10.1007/3-540-48219-9_35	Multiple Classifier Systems
Keywords	Field	DocType
minimal spanning tree	Computer science,Random subspace method,Algorithm,Information complexity,Data complexity,Minimum spanning tree,Statistical analysis	Conference
ISBN	Citations	PageRank
3-540-42284-6	5	0.53
References	Authors
5	4

Authors (4 rows)

Cited by (5 rows)

References (5 rows)

Name	Order	Citations	PageRank
Ludmila I. Kuncheva	1	4942	244.34
Fabio Roli	2	4846	311.69
Gian Luca Marcialis	3	774	60.54
Catherine A. Shipp	4	255	11.34

1