Title
Regularized Phrase-Based Topic Model for Automatic Question Classification With Domain-Agnostic Class Labels
Abstract
Classification of questions according to domain-agnostic class labels relies on a suitable feature extraction process. We propose the use of phrases that is more effective than using words to represent questions. The proposed phrase-based topic modeling technique employs asymmetric priors that are scaled with a new C-value for nested regular expressions. In addition, to suppress high-frequency words in phrases, we deploy term weightages computed using the modified distinguishing feature selector. The proposed approach also incorporates a new topic regularization mechanism to facilitate efficient mapping of questions to class labels. We validate the performance of our proposed model via four datasets across different domain-agnostic class labels comprising question types, reasoning capabilities, and cognitive complexities. Results obtained highlight that the proposed technique outperforms existing methods in terms of macro-average F1 score.
Year
DOI
Venue
2021
10.1109/TASLP.2021.3126937
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Keywords
DocType
Volume
Computational modeling, Speech processing, Feature extraction, Semantics, Syntactics, Linguistics, Complexity theory, Automatic question classification, topic modeling, nested phrase mining, regular expression, term weighting schemes
Journal
29
Issue
ISSN
Citations 
1
2329-9290
0
PageRank 
References 
Authors
0.34
21
3
Name
Order
Citations
PageRank
S. Supraja101.01
Andy W. H. Khong2235.29
Sivanagaraja Tatinati3175.36