Title
Short Text Coherence Hypothesis.
Abstract
In this paper, we experimentally study the degree to which the length of a short text affects its comprehensiveness and readability, within quantitative linguistics. The quantitative linguistics focus mainly in analysis of large text collections and one of the major scientific theories in use is the Menzerath-Altmann law. In this paper we attempt to define the quantitative analysis framework for short texts consisting approximately of one or two sentences, due to the fact that they are considered very important in many scientific fields. To achieve the aim of this paper, a coherence statistical testing process of three variables was created for short texts. The implementation of that was possible through experimental and statistical evaluation. Upon completion of the above-mentioned evaluation, the statistical results showed that short text coherence, comprehensiveness and readability are fully achieved in short texts consisting of 14 words, when three predetermined variables are associated and vice versa. To prove the above hypothesis the theory of Vector Space Model and Kendall's Coefficient of Concordance were used. The assessment of statistical results concluded that the above hypothesis can be fully met for a number of cases with a probability p > 99%. Moreover, in the experiment were used short texts in English language but it was proven that language can be considered irrelevant. To corroborate this, a smaller scale experiment with short texts in the German language was conducted and hypothesis was confirmed that the proposed model of this paper can be applied in all short texts regardless of their linguistic origin.
Year
DOI
Venue
2016
10.1080/09296174.2016.1142328
JOURNAL OF QUANTITATIVE LINGUISTICS
Field
DocType
Volume
Predetermined variables,Information retrieval,Computer science,Scientific theory,Coherence (physics),Readability,Artificial intelligence,Natural language processing,Linguistics,Quantitative linguistics,Statistical hypothesis testing
Journal
23.0
Issue
ISSN
Citations 
2
0929-6174
1
PageRank 
References 
Authors
0.41
12
4
Name
Order
Citations
PageRank
Sylvia Poulimenou111.09
Sofia Stamou213617.45
Sozon Papavlasopoulos3214.79
Marios Poulos410915.71