Title
Identifying label noise in time-series datasets
Abstract
Reliably labeled datasets are crucial to the performance of supervised learning methods. Time-series data pose additional challenges. Data points lying on borders between classes can be mislabeled due to perception limitations of human labelers. Sensor measurements may not be directly interpretable by humans. Thus label noise cannot be manually removed. As a result, time-series datasets often contain a significant amount of label noise that can degrade the performance of machine learning models. This work focuses on label noise identification and removal by extending previous methods developed for static instances to the domain of time-series data. We use a combination of deep learning and visualization algorithms to facilitate automatic noise removal. We show that our approach can identify mislabeled instances, which results in improved classification accuracy on four synthetic and two real publicly available human activity datasets.
Year
DOI
Venue
2020
10.1145/3410530.3414366
UbiComp/ISWC '20: 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2020 ACM International Symposium on Wearable Computers Virtual Event Mexico September, 2020
DocType
ISBN
Citations 
Conference
978-1-4503-8076-8
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Gentry Atkinson100.68
Vangelis Metsis200.34