Abstract | ||
---|---|---|
Time series subsequence matching (or signal searching) has importance in a variety of areas in health care informatics. These areas include case-based diagnosis and treatment as well as the discovery of trends and correlations between data. Much of the traditional research in signal searching has focused on high dimensional Ä-NN matching. However, the results of Ä-NN are often small and yield minimal information gain; especially with higher dimensional data. This paper proposes a randomized Monte Carlo sampling method to broaden search criteria such that the query results are an accurate sampling of the complete result set. The proposed method is shown both theoretically and empirically to improve information gain. The number of query results are increased by several orders of magnitude over approximate exact matching schemes and fall within a Gaussian distribution. The proposed method also shows excellent performance as the majority of overhead added by sampling can be mitigated through parallelization. Experiments are run on both simulated and real-world biomedicai datasets. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/BIBM.2012.6392646 | BIBM |
Keywords | DocType | Citations |
randomized Monte Carlo sampling,high dimensional,NN matching,query result,higher dimensional data,approximate exact matching scheme,time series search,time series subsequence matching,accurate sampling,Monte Carlo approach,information gain,proposed method | Conference | 0 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alex Bui | 1 | 318 | 48.20 |
Majid Sarrafzadeh | 2 | 3103 | 317.63 |
Bobak Mortazavi | 3 | 45 | 8.38 |
Jonathan Woodbridge | 4 | 69 | 8.50 |