Title
A Criterion Based on the Mahalanobis Distance for Cluster Analysis with Subsampling
Abstract
A two-level data set consists of entities of a higher level (say populations), each one being composed of several units of the lower level (say individuals). Observations are made at the individual level, whereas population characteristics are aggregated from individual data. Cluster analysis with subsampling of populations is a cluster analysis based on individual data that aims at clustering populations rather than individuals. In this article, we extend existing optimality criteria for cluster analysis with subsampling of populations to deal with situations where population characteristics are not the mean of individual data. A new criterion that depends on the Mahalanobis distance is also defined. The criteria are compared using simulated examples and an ecological data set of tree species in a tropical rain forest.
Year
DOI
Venue
2012
10.1007/s00357-012-9100-9
J. Classification
Keywords
Field
DocType
Cluster analysis,Mahalanobis distance,Model-based clustering,Species classification,Subsampling,Variance criterion
Econometrics,Variance Criterion,Population,Tree species,Mahalanobis distance,Statistics,Cluster analysis,Mathematics
Journal
Volume
Issue
ISSN
29
1
0176-4268
Citations 
PageRank 
References 
1
0.40
3
Authors
2
Name
Order
Citations
PageRank
Nicolas Picard110.40
Avner Bar-Hen214812.81