Title
Pattern Recognition Method For Classification Of Agricultural Scientific Papers In Polish
Abstract
Calculation of text similarity is an essential task for the text analysis and classification. It be can based, e.g., on Jaccard, cosine or other similar measures. Such measures consider the text as a bag-of-words and, therefore, lose some syntactic and semantic features of its sentences. This article presents a different measure based on the so-called artificial sentence pattern (ASP) method. This method has been developed to analyze texts in the Polish language which has very rich inflection. Therefore, ASP has utilized syntactic and semantic rules of the Polish language. Nevertheless, we argue that it admits extensions to other languages. As a result of the analysis, we have obtained several hypernodes which contain the most important words. Each hypernode corresponds to one of the examined documents, the latter being published papers from agriculture domain written in Polish. Experimental results obtained from that set of papers have been described and discussed. Those results have been visually illustrated using graphs of hypernodes and compared with Jaccard and cosine measures.
Year
DOI
Venue
2018
10.1007/978-3-030-00692-1_43
COMPUTER VISION AND GRAPHICS ( ICCVG 2018)
Field
DocType
Volume
Graph,Pattern recognition,Computer science,Inflection,Polish,Jaccard index,Artificial intelligence,Syntax,Sentence
Conference
11114
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
5
2
Name
Order
Citations
PageRank
Piotr Wrzeciono121.47
Waldemar Karwowski212031.49