Title
Discriminating Variable Star Candidates in Large Image Databases from the HiTS Survey Using NMF.
Abstract
New instruments and technologies are allowing the acquisition of large amounts of data from astronomical surveys. Nowadays there is a pressing need for autonomous methods to discriminate the interesting astronomical objects in the vast sky. The High Cadence Transient Survey (HiTS) project is an astronomical survey that is trying to find a rare transient event that occurs during the first instants of a supernova. In this paper we propose an autonomous method to discriminate stellar variability from the HiTS database, that uses a feature extraction scheme based on Non-negative matrix factorization (NMF). Using NMF, dictionaries of image prototypes that represent the data in a compact way are obtained. The projections of the dataset into these dictionaries are fed into a random forest classifier. NMF is compared with other feature extraction schemes, on a subset of 500,000 transient candidates from the HiTS survey. With NMF a better class separability at feature level is obtained which enhances the classification accuracy significantly. Using the NMF features less than 4% of the true stellar transients are lost, at a manageable false positive rate of 0.1%.
Year
DOI
Venue
2015
10.1016/j.procs.2015.07.276
Procedia Computer Science
Keywords
Field
DocType
Data mining,Astronomy,Supernovae,Machine learning,Non-negative matrix factorization
False positive rate,Data mining,Computer science,Artificial intelligence,Astronomical Objects,Random forest,Astronomical survey,Pattern recognition,Matrix decomposition,Sky,Feature extraction,Non-negative matrix factorization,Machine learning,Database
Conference
Volume
ISSN
Citations 
53
1877-0509
0
PageRank 
References 
Authors
0.34
2
4
Name
Order
Citations
PageRank
Pablo Huijse1103.54
Pablo A. Estévez237236.29
Francisco Förster341.21
Emanuel Berrocal400.34