Title | ||
---|---|---|
Combining compound recognition and PCFG-LA parsing with word lattices and conditional random fields |
Abstract | ||
---|---|---|
The integration of compounds in a parsing procedure has been shown to improve accuracy in an artificial context where such expressions have been perfectly preidentified. This article evaluates two empirical strategies to incorporate such multiword units in a real PCFG-LA parsing context: (1) the use of a grammar including compound recognition, thanks to specialized annotation schemes for compounds; (2) the use of a state-of-the-art discriminative compound prerecognizer integrating endogenous and exogenous features. We show how these two strategies can be combined with word lattices representing possible lexical analyses generated by the recognizer. The proposed systems display significant gains in terms of multiword recognition and often in terms of standard parsing accuracy. Moreover, we show through an Oracle analysis that this combined strategy opens promising new research directions. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1145/2483969.2483970 | TSLP |
Keywords | Field | DocType |
word lattice,artificial context,compound recognition,oracle analysis,multiword recognition,combined strategy,conditional random field,state-of-the-art discriminative compound prerecognizer,parsing procedure,combining compound recognition,real pcfg-la parsing context,multiword unit,standard parsing accuracy,parsing,conditional random fields | Conditional random field,Top-down parsing,S-attributed grammar,Expression (mathematics),Computer science,Oracle,Speech recognition,Bottom-up parsing,Artificial intelligence,Natural language processing,Parsing,Discriminative model | Journal |
Volume | Issue | ISSN |
10 | 3 | 1550-4875 |
Citations | PageRank | References |
4 | 0.41 | 31 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Matthieu Constant | 1 | 77 | 12.61 |
Joseph Le Roux | 2 | 175 | 16.34 |
Anthony Sigogne | 3 | 40 | 4.44 |