Abstract | ||
---|---|---|
In this paper, we experimentally study the degree to which the length of a short text affects its comprehensiveness and readability, within quantitative linguistics. The quantitative linguistics focus mainly in analysis of large text collections and one of the major scientific theories in use is the Menzerath-Altmann law. In this paper we attempt to define the quantitative analysis framework for short texts consisting approximately of one or two sentences, due to the fact that they are considered very important in many scientific fields. To achieve the aim of this paper, a coherence statistical testing process of three variables was created for short texts. The implementation of that was possible through experimental and statistical evaluation. Upon completion of the above-mentioned evaluation, the statistical results showed that short text coherence, comprehensiveness and readability are fully achieved in short texts consisting of 14 words, when three predetermined variables are associated and vice versa. To prove the above hypothesis the theory of Vector Space Model and Kendall's Coefficient of Concordance were used. The assessment of statistical results concluded that the above hypothesis can be fully met for a number of cases with a probability p > 99%. Moreover, in the experiment were used short texts in English language but it was proven that language can be considered irrelevant. To corroborate this, a smaller scale experiment with short texts in the German language was conducted and hypothesis was confirmed that the proposed model of this paper can be applied in all short texts regardless of their linguistic origin. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1080/09296174.2016.1142328 | JOURNAL OF QUANTITATIVE LINGUISTICS |
Field | DocType | Volume |
Predetermined variables,Information retrieval,Computer science,Scientific theory,Coherence (physics),Readability,Artificial intelligence,Natural language processing,Linguistics,Quantitative linguistics,Statistical hypothesis testing | Journal | 23.0 |
Issue | ISSN | Citations |
2 | 0929-6174 | 1 |
PageRank | References | Authors |
0.41 | 12 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sylvia Poulimenou | 1 | 1 | 1.09 |
Sofia Stamou | 2 | 136 | 17.45 |
Sozon Papavlasopoulos | 3 | 21 | 4.79 |
Marios Poulos | 4 | 109 | 15.71 |