Parallel Feature Selection Inspired by Group Testing. - Citegraph

Paper Info

Title
Parallel Feature Selection Inspired by Group Testing.

Abstract
This paper presents a parallel feature selection method for classification that scales up to very high dimensions and large data sizes. Our original method is inspired by group testing theory, under which the feature selection procedure consists of a collection of randomized tests to be performed in parallel. Each test corresponds to a subset of features, for which a scoring function may be applied to measure the relevance of the features in a classification task. We develop a general theory providing sufficient conditions under which true features are guaranteed to be correctly identified. Superior performance of our method is demonstrated on a challenging relation extraction task from a very large data set that have both redundant features and sample size in the order of millions. We present comprehensive comparisons with state-of-the-art feature selection methods on a range of data sets, for which our method exhibits competitive performance in terms of running time and accuracy. Moreover, it also yields substantial speedup when used as a pre-processing step for most other existing methods.

Year	Venue	Field
2014	ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014)	Data mining,Data set,Pattern recognition,Feature selection,Feature (computer vision),Computer science,Artificial intelligence,Group testing,Sample size determination,Machine learning,Speedup,Relationship extraction
DocType	Volume	ISSN
Conference	27	1049-5258
Citations	PageRank	References
6	0.47	19
Authors
7

Authors (7 rows)

Cited by (6 rows)

References (19 rows)

Name	Order	Citations	PageRank
Yingbo Zhou	1	263	19.43
Utkarsh Porwal	2	31	4.12
Ce Zhang	3	803	83.39
Hung Q. Ngo	4	670	56.62
Long Nguyen	5	6	0.47
Ré Christopher	6	3422	192.34
Venu Govindaraju	7	3521	422.00

1