Title
Fast convex-hull vector machine for training on large-scale ncRNA data classification tasks.
Abstract
Support vector machine (SVM) has been becoming a provably effective tool for non-coding RNA (ncRNA) data classification. However, as the species and sizes of ncRNA sequences quickly increase, its training time becomes intolerable and even impractical for large scale data. Although many fast SVM-based classification techniques have been developed, their applicability heavily depends on the involved formulations and particularly the computational reduction of the corresponding kernel matrix. In this paper, based on the latest advance in fast two-dimensional convex hull approximation with asymptotic linear time complexity, a fast convex-hull vector machine (CHVM) is developed to achieve a breakthrough of the applicability limitation of SVM-based classification techniques and provide more choices for large-scale ncRNA data classification tasks. By projecting a dataset onto all the corresponding two-dimensional projection combinations, CHVM first extracts the boundary vectors quickly for the whole training dataset in the kernel space, and then attempts to form the convex hull vectors for the whole kernelized training set by integrating all the obtained boundary vectors. Finally, the convex hull vectors are presented as the inputs to a SVM classifier, regardless of the adopted SVM's formulation. The experimental results on three large-scale ncRNA datasets indicate that CHVM outperforms the five SVM based classifiers, random forest (RF) and back propagation neural networks (BP), especially in training time.
Year
DOI
Venue
2018
10.1016/j.knosys.2018.03.029
Knowledge-Based Systems
Keywords
Field
DocType
Large scale ncRNA data classification,Fast convex hull approximation,Support vector machines,Kernelization
Training set,Kernel (linear algebra),Pattern recognition,Computer science,Support vector machine,Convex hull,Artificial intelligence,Data classification,Svm classifier,Time complexity,Random forest,Machine learning
Journal
Volume
ISSN
Citations 
151
0950-7051
2
PageRank 
References 
Authors
0.37
30
3
Name
Order
Citations
PageRank
Xiaoqing Gu1449.30
Korris Fu-lai Chung213110.51
Shitong Wang31485109.13