The Text Classification Based on Big Data Analysis for Keyword Definition Using Stemming - Citegraph

Paper Info

Title
The Text Classification Based on Big Data Analysis for Keyword Definition Using Stemming

Abstract
Software for steaming Ukrainian-language texts has been developed and implemented, and methods for classifying texts written in Ukrainian using the Porter algorithm. The software product is made in the Python programming language, using the NLTK library. An analysis of existing methods such as classification, clustering and others was performed. Methods of vectorisation of text data and patterns of keeping the dictionary have been considered. Moreover, information about previously analysed data has been saved.

Year	DOI	Venue
2021	10.1109/CSIT52700.2021.9648764	2021 IEEE 16th International Conference on Computer Sciences and Information Technologies (CSIT)
Keywords	DocType	Volume
stemming,lemmatisation,neural network,Bayesian classifier,python programming language,word model,natural language,Ukrainian texts,classification,clustering,Python,NLTK,text classification	Conference	1
ISSN	ISBN	Citations
2766-3655	978-1-6654-4258-9	0
PageRank	References	Authors
0.34	2	5

Authors (5 rows)

Cited by (0 rows)

References (2 rows)

Name	Order	Citations	PageRank
Andrii Berko	1	0	1.01
Yurii Matseliukh	2	0	0.34
Yurii Ivaniv	3	0	0.34
Lyubomyr Chyrun	4	0	1.35
Vadim Schuchmann	5	0	1.35

1