Abstract | ||
---|---|---|
Data mining aims to find patterns in organizational databases. However, most techniques in mining do not consider knowledge of the quality of the database. In this work, we show how to incorporate into classification mining recent advances in the data quality field that view a database as the product of an imprecise manufacturing process where the flaws/defects are captured in quality matrices. We develop a general purpose method of incorporating data quality matrices into the data mining classification task. Our work differs from existing data preparation techniques since while other approaches detect and fix errors to ensure consistency with the entire data set our work makes use of the apriori knowledge of how the data is produced/manufactured. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1016/j.ejor.2008.07.019 | European Journal of Operational Research |
Keywords | Field | DocType |
Data manufacturing,Data quality,Data preparation,Application of data mining | Data field,Data mining,Data processing,Data stream mining,Data quality,A priori and a posteriori,Data pre-processing,Information extraction,Data integrity,Mathematics | Journal |
Volume | Issue | ISSN |
197 | 2 | 0377-2217 |
Citations | PageRank | References |
3 | 0.49 | 10 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ian Davidson | 1 | 1274 | 77.79 |
Giri Kumar Tayi | 2 | 303 | 54.82 |