Title
Active Learning Without Unlabeled Samples: Generating Questions And Labels Using Monte Carlo Tree Search
Abstract
Classification of short texts (e.g. reviews, sentences) is a well-defined task, usually attempted in regimes where data is abundant. This is, however, not always the case. Limited data availability is very common in industrial settings and seriously hinders the performance of any classification task - it is not always obvious how to perform data augmentation. In this work, we apply the Recurrent Neural Network and Monte Carlo Tree Search (MCTS) to generate unlabelled questions. We use Human In-the-Loop to help decide whether 1) the generated questions are meaningful or not 2) label them into correct categories. We show that generated data leads to improved classification performance in comparison to the vanilla dataset.
Year
DOI
Venue
2019
10.1109/BigData47090.2019.9006276
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)
Keywords
Field
DocType
Sentence generation, MCTS, active learning, Human-In-the-Loop
Data mining,Monte Carlo tree search,Active learning,Data availability,Computer science,Recurrent neural network,Artificial intelligence,Human-in-the-loop,Sentence generation,Machine learning
Conference
ISSN
Citations 
PageRank 
2639-1589
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Sathish K. Sankarpandi100.34
Spyros Samothrakis200.34
luca citi316827.88
Peter Brady400.34