Abstract | ||
---|---|---|
In many scientific fields, it is common to encounter raster data sets consisting of categorical data, such as soil type or land usage of a terrain. A problem that arises in the presence of such data is the following: given a raster G of n cells storing categorical data, compute for every cell c in G the number of different categories appearing within a window centered at c. The window can either be a (2r+1)x(2r+1) square or a disk of radius r for a positive integer parameter r. We call this the categorical richness problem, and we present two algorithms for it: one for square windows that runs in O(n) time and one for circular windows that runs in O((1+K/r)n) time, where K is the number of distinct categories in G. The algorithms are not only efficient in theory, but also in practice: our experiments show that our algorithms can handle raster data of hundreds of millions of cells. The categorical richness problem is related to colored range counting, where the goal is to preprocess a colored point set such that we can efficiently count the number of colors appearing inside a query range. We present a data structure for colored range counting in R2 for the case where query ranges are squares. Our structure uses O(n polylog n) storage and has O(polylog n) query time, which is significantly better than what is known for arbitrary rectangular ranges. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1145/2820783.2820825 | SIGSPATIAL/GIS |
Keywords | Field | DocType |
Algorithms for geographic data, raster data sets, patch richness, color counting, topographic heterogeneity | Integer,Raster data,Data structure,Data mining,Discrete mathematics,Colored,Raster graphics,Categorical variable,Terrain,Mathematics,Computation | Conference |
Citations | PageRank | References |
0 | 0.34 | 6 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mark De Berg | 1 | 1497 | 153.24 |
Constantinos Tsirogiannis | 2 | 12 | 5.93 |
Bryan T. Wilkinson | 3 | 43 | 4.18 |