Abstract | ||
---|---|---|
Training data are critical in face recognition systems. Labeling a large scale dataset for a particular domain needs lots of manpower. Without dataset related to current face recognition domain, we can’t get a strong face recognition model with existing public datasets. In this paper, we propose a semi-supervised method to automatically construct strong dataset which can be trained to achieve better performance on the target domain from massive weakly labeled data. In the case of Asian face recognition, a well trained VRCN model by CASIA, which achieves 98.63% on LFW and 91.76% on YTF, only achieves 88.53% recognition rate on our test dataset of Asian faces. We collect 530,560 weakly labeled Asian face images of 7962 identities, and get a cleaned dataset of size 285,933. Model trained by the cleaned dataset with VRCN network and same strategy achieves 95.33% recognition rate on the Asian face test dataset (6.8% improved). |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/s11063-018-9839-z | Neural Processing Letters |
Keywords | Field | DocType |
Face recognition,Dataset construction,Model enhancing | Training set,Facial recognition system,Pattern recognition,Artificial intelligence,Labeled data,Mathematics | Journal |
Volume | Issue | ISSN |
49.0 | 3.0 | 1573-773X |
Citations | PageRank | References |
2 | 0.36 | 17 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Wei Xu | 1 | 4 | 11.47 |
Junyu Wu | 2 | 2 | 0.36 |
Shengyong Ding | 3 | 255 | 7.66 |
Linggan Lian | 4 | 2 | 0.36 |
Hongyang Chao | 5 | 495 | 36.96 |