Active data labeling for improved classifier generalizability. - Citegraph

Paper Info

Title
Active data labeling for improved classifier generalizability.

Abstract
Existing statistical learning methods perform well when evaluated on training and test data drawn from the same distribution. In practice, however, these distributions are not always the same. In this paper we derive an estimable upper bound on the test error rate that depends on a new probability distance measure between training and test distributions. Furthermore, we identify a non-parametric estimator for this distance measure that can be estimated directly from data. We show how this new probability distance measure can be used to construct algorithmic tools that improve performance. In particular, motivated by our upper bound, we propose a new active learning algorithm for domain adaptation. Comparative results confirm the efficacy of the active learning algorithm on a set of 12 speech classification tasks.

Year	DOI	Venue
2015	10.1016/j.sigpro.2014.09.016	Signal Processing
Keywords	Field	DocType
classification,active learning	Generalizability theory,Active learning,Pattern recognition,Upper and lower bounds,Computer science,Domain adaptation,Word error rate,Test data,Artificial intelligence,Classifier (linguistics),Machine learning,Estimator	Journal
Volume	Issue	ISSN
108	C	0165-1684
Citations	PageRank	References
1	0.36	10
Authors
2

Authors (2 rows)

Cited by (1 rows)

References (10 rows)

Name	Order	Citations	PageRank
Visar Berisha	1	76	22.38
Douglas Cochran	2	1	0.36

1