Title
Neighborhood Preprocessing SVM for Large-Scale Data Sets Classification
Abstract
Support vector machine (SVM) has been a promising method for data mining and machine learning in recent years. However, the training complexity of SVM is highly dependent on the size of a data set. A preprocessing support vector machines (P-SVM) method for large-scale data set classification is presented to speed up SVM training. By analyzing the neighbor classification feature for each sample in training data set, a decision criterion was built to keep or delete this sample from the original data set without losing the classification. The new method can provide an SVM with high quality samples. Experiments with random data and UCI databases show that SVM with our new preprocessing method retains the high quality of training data set and the classification accuracy very well.
Year
DOI
Venue
2008
10.1109/FSKD.2008.94
FSKD (2)
Keywords
Field
DocType
neighborhood preprocessing svm,large-scale data,learning (artificial intelligence),neighbor classification feature,random data,large-scale data sets classification,support vector machine,data mining,classification accuracy,classification,machine learning,svm training,support vector machines,new method,original data,training data,kernel,accuracy,learning artificial intelligence
Structured support vector machine,Kernel (linear algebra),Training set,Data set,Ranking SVM,Pattern recognition,Computer science,Support vector machine,Preprocessor,Artificial intelligence,Machine learning,Speedup
Conference
Volume
ISBN
Citations 
2
978-0-7695-3305-6
2
PageRank 
References 
Authors
0.42
3
3
Name
Order
Citations
PageRank
Guangxi Chen172.56
Jian Xu241.17
Xiao-Lin Xiang351.52