Abstract | ||
---|---|---|
Standard Support Vector Machines (SVM) often performs poorly on imbalanced datasets, because it could not get a high accuracy of prediction on the minority class of data as well as another class. We proposed a new example dependent costs SVM method, from which it can get more sensitive hyperplane by selecting penalty for every sample according to its corresponding distribution. Firstly, this paper discusses how to create an Example Dependent Costs SVM based on Data Distribution (DDEDC- SVM), and then we proposes a direct method to determine the parameters, i.e., "Average Density", in order to reduce the time for their selection via traditional cross validation. Experimental results show that this method can improve the performance of SVM on imbalanced datasets efficiently and effectively. |
Year | DOI | Venue |
---|---|---|
2013 | 10.4304/jcp.8.1.91-96 | JOURNAL OF COMPUTERS |
Keywords | Field | DocType |
support vector machines, imbalanced data, example dependent costs, average density | Data mining,Direct method,Pattern recognition,Computer science,Support vector machine,Artificial intelligence,Hyperplane,Cross-validation,Machine learning | Journal |
Volume | Issue | ISSN |
8 | 1 | 1796-203X |
Citations | PageRank | References |
2 | 0.39 | 11 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xin Jin | 1 | 2 | 0.39 |
Li Yujian | 2 | 207 | 31.77 |
Yihua Zhou | 3 | 7 | 5.33 |
Zhi Cai | 4 | 56 | 11.26 |