Title
A novel privacy-preserving probability transductive classifiers from group probabilities based on regression model
Abstract
Group probability classifier learning is an emerging and promising learning technique, especially in privacy-preserving data mining. It is used to train a classifier from a group probability dataset, where the class labels of each sample are unknown while the probabilities of each class in the given data groups of the whole dataset are available. The existing work is mainly based on the inverse calibration (IC) strategy to obtain the estimated labels for data in the group probability dataset and then make use of classical classification algorithms such as support vector machine (SVM) model to train the desired classifier. A critical challenge of the exiting IC-based methods lies in the difficulty of designing an ideal IC function for label estimation and the methods are sensitive to the adopted IC function. In order to overcome this shortcoming, a novel probability transductive classifier that does not involve IC in the learning procedure is proposed, where the probability values are directly used as the output of the training data for the model training. Particularly, on the training data with the output being continuous real values, the existing classical regression model can be easily adopted to model the group probability classification problem. For a future testing data, the model output of the obtained group probability classification model can present the probability that the testing data belong to the positive class. With a given threshold, the final class label of the testing data can be obtained for the classification task. The experimental results on synthetic datasets and real UCI datasets show that the proposed method is more effective than the existing methods.
Year
DOI
Venue
2015
10.3233/IFS-151621
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS
Keywords
Field
DocType
Privacy preserving,regression model,probability transductive,group probability,classification
Transduction (machine learning),Inverse,Pattern recognition,Regression analysis,Computer science,Support vector machine,Test data,Artificial intelligence,Classifier (linguistics),Statistical classification,Machine learning,Generative model
Journal
Volume
Issue
ISSN
29
2
1064-1246
Citations 
PageRank 
References 
0
0.34
15
Authors
6
Name
Order
Citations
PageRank
Yizhang Jiang138227.24
Zhaohong Deng264735.34
Kup-Sze Choi352647.41
Pengjiang Qian413311.25
Wenjun Hu52354124.48
Shitong Wang61485109.13