Title
Corpus-based Acquisition of Collocational Prepositional Phrases.
Abstract
Collocational prepositional phrases like ten koste van (at the expense of), met het oog op (with an eye on), and onder het mom van (under the pretext of) are patterns of the form P-NP-P, which have a non-compositional semantics and which are syntactically rigid or idiosyncratic. We present a number of linguistic tests which set such items apart from regularly built prepositional phrases. To find candidate strings which should be included in a computational lexicon as collocational prepositional phrases, we extract all instances of the relevant pattern from a corpus annotated with POS tags, Next, we introduce a number of statistical tests (mutual information, log-likelihood, and chi(2)) to find those instances which behave like strong collocations. The strongest collocations according to the statistical tests are compared with lists of such items presented elsewhere, and were evaluated by human judges.
Year
Venue
Keywords
2001
LANGUAGE AND COMPUTERS : STUDIES IN PRACTICAL LINGUISTICS
statistical test,mutual information
Field
DocType
Issue
Of the form,Computer science,Pretext,Lexicon,Mutual information,Artificial intelligence,Natural language processing,Linguistics,Semantics,Statistical hypothesis testing
Conference
45.0
ISSN
Citations 
PageRank 
0921-5034
1
1.03
References 
Authors
7
2
Name
Order
Citations
PageRank
Gosse Bouma148370.88
begona villada moiron241.84