Title
Combining compound recognition and PCFG-LA parsing with word lattices and conditional random fields
Abstract
The integration of compounds in a parsing procedure has been shown to improve accuracy in an artificial context where such expressions have been perfectly preidentified. This article evaluates two empirical strategies to incorporate such multiword units in a real PCFG-LA parsing context: (1) the use of a grammar including compound recognition, thanks to specialized annotation schemes for compounds; (2) the use of a state-of-the-art discriminative compound prerecognizer integrating endogenous and exogenous features. We show how these two strategies can be combined with word lattices representing possible lexical analyses generated by the recognizer. The proposed systems display significant gains in terms of multiword recognition and often in terms of standard parsing accuracy. Moreover, we show through an Oracle analysis that this combined strategy opens promising new research directions.
Year
DOI
Venue
2013
10.1145/2483969.2483970
TSLP
Keywords
Field
DocType
word lattice,artificial context,compound recognition,oracle analysis,multiword recognition,combined strategy,conditional random field,state-of-the-art discriminative compound prerecognizer,parsing procedure,combining compound recognition,real pcfg-la parsing context,multiword unit,standard parsing accuracy,parsing,conditional random fields
Conditional random field,Top-down parsing,S-attributed grammar,Expression (mathematics),Computer science,Oracle,Speech recognition,Bottom-up parsing,Artificial intelligence,Natural language processing,Parsing,Discriminative model
Journal
Volume
Issue
ISSN
10
3
1550-4875
Citations 
PageRank 
References 
4
0.41
31
Authors
3
Name
Order
Citations
PageRank
Matthieu Constant17712.61
Joseph Le Roux217516.34
Anthony Sigogne3404.44