Title
Tools for Collocation Extraction: Preferences for Active vs. Passive
Abstract
We present and partially evaluate procedures for the extraction of noun+verb collocation candidates from German text corpora, along with their morphosyntactic preferences, especially for the active vs. passive voice. We start from tokenized, tagged, lemmatized and chunked text, and we use extraction patterns formulated in the CQP corpus query language. We discuss the results of a precision evaluation, on administrative texts from the European Union: we find a considerable amount of specialized collocations, as well as general ones and complex predicates; overall the precision is considerably higher than that of a statistical extractor used as a baseline.
Year
Venue
Field
2008
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008
Computer science,Speech recognition,Natural language processing,Collocation extraction,Artificial intelligence
DocType
Citations 
PageRank 
Conference
2
0.44
References 
Authors
5
2
Name
Order
Citations
PageRank
Ulrich Heid119040.48
Marion Weller250.90