Abstract | ||
---|---|---|
Collocational prepositional phrases like ten koste van (at the expense of), met het oog op (with an eye on), and onder het mom van (under the pretext of) are patterns of the form P-NP-P, which have a non-compositional semantics and which are syntactically rigid or idiosyncratic. We present a number of linguistic tests which set such items apart from regularly built prepositional phrases. To find candidate strings which should be included in a computational lexicon as collocational prepositional phrases, we extract all instances of the relevant pattern from a corpus annotated with POS tags, Next, we introduce a number of statistical tests (mutual information, log-likelihood, and chi(2)) to find those instances which behave like strong collocations. The strongest collocations according to the statistical tests are compared with lists of such items presented elsewhere, and were evaluated by human judges. |
Year | Venue | Keywords |
---|---|---|
2001 | LANGUAGE AND COMPUTERS : STUDIES IN PRACTICAL LINGUISTICS | statistical test,mutual information |
Field | DocType | Issue |
Of the form,Computer science,Pretext,Lexicon,Mutual information,Artificial intelligence,Natural language processing,Linguistics,Semantics,Statistical hypothesis testing | Conference | 45.0 |
ISSN | Citations | PageRank |
0921-5034 | 1 | 1.03 |
References | Authors | |
7 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gosse Bouma | 1 | 483 | 70.88 |
begona villada moiron | 2 | 4 | 1.84 |