Abstract | ||
---|---|---|
Outlier detection is crucial in building a highly predictive model. In this study, we proposed an enhanced Monte Carlo outlier detection method by establishing cross-prediction models based on determinate normal samples and analyzing the distribution of prediction errors individually for dubious samples. One simulated and three real datasets were used to illustrate and validate the performance of our method, and the results indicated that this method outperformed Monte Carlo outlier detection in outlier diagnosis. After these outliers were removed, the value of validation by Kovats retention indices and the root mean square error of prediction decreased from 3.195 to 1.655, and the average cross-validation prediction error decreased from 2.0341 to 1.2780. This method helps establish a good model by eliminating outliers. (c) 2015 Wiley Periodicals, Inc. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1002/jcc.24026 | JOURNAL OF COMPUTATIONAL CHEMISTRY |
Keywords | Field | DocType |
outlier detection,enhanced Monte Carlo outlier detection,validation | Anomaly detection,Data mining,Monte Carlo method,Mean squared prediction error,Computer science,Mean squared error,Outlier,Kovats retention index,Datasets as Topic | Journal |
Volume | Issue | ISSN |
36.0 | 25 | 0192-8651 |
Citations | PageRank | References |
0 | 0.34 | 4 |
Authors | ||
6 |