Re-evaluating the need for Modelling Term-Dependence in Text Classification Problems. - Citegraph

Paper Info

Title
Re-evaluating the need for Modelling Term-Dependence in Text Classification Problems.

Abstract
A substantial amount of research has been carried out in developing machine learning algorithms that account for term dependence in text classification. These algorithms offer acceptable performance in most cases but they are associated with a substantial cost. They require significantly greater resources to operate. This paper argues against the justification of the higher costs of these algorithms, based on their performance in text classification problems. In order to prove the conjecture, the performance of one of the best dependence models is compared to several well established algorithms in text classification. A very specific collection of datasets have been designed, which would best reflect the disparity in the nature of text data, that are present in real world applications. The results show that even one of the best term dependence models, performs decent at best when compared to other independence models. Coupled with their substantially greater requirement for hardware resources for operation, this makes them an impractical choice for being used in real world scenarios.

Year	Venue	Field
2017	arXiv: Information Retrieval	Data mining,Computer science,Conjecture
DocType	Volume	Citations
Journal	abs/1710.09085	0
PageRank	References	Authors
0.34	0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sounak Banerjee	1	0	0.34
Prasenjit Majumder	2	173	25.15
Mandar Mitra	3	3092	338.20

1