Posterior Sparsity in Unsupervised Dependency Parsing - Citegraph

Paper Info

Title
Posterior Sparsity in Unsupervised Dependency Parsing

Abstract
A strong inductive bias is essential in unsupervised grammar induction. In this paper, we explore a particular sparsity bias in dependency grammars that encourages a small number of unique dependency types. We use part-of-speech (POS) tags to group dependencies by parent-child types and investigate sparsity-inducing penalties on the posterior distributions of parent-child POS tag pairs in the posterior regularization (PR) framework of Graça et al. (2007). In experiments with 12 different languages, we achieve significant gains in directed attachment accuracy over the standard expectation maximization (EM) baseline, with an average accuracy improvement of 6.5%, outperforming EM by at least 1% for 9 out of 12 languages. Furthermore, the new method outperforms models based on standard Bayesian sparsity-inducing parameter priors with an average improvement of 5% and positive gains of at least 1% for 9 out of 12 languages. On English text in particular, we show that our approach improves performance over other state-of-the-art techniques.

Year	DOI	Venue
2011	10.5555/1953048.1953062	Journal of Machine Learning Research
Keywords	Field	DocType
average improvement,parent-child type,unsupervised dependency parsing,average accuracy improvement,attachment accuracy,posterior regularization,posterior distribution,posterior sparsity,particular sparsity bias,parent-child pos tag pair,group dependency,dependency grammar,dependency parsing	Rule-based machine translation,Inductive bias,Grammar induction,Computer science,Dependency grammar,Regularization (mathematics),Artificial intelligence,Pattern recognition,Expectation–maximization algorithm,Speech recognition,Prior probability,Machine learning,Bayesian probability	Journal
Volume	ISSN	Citations
12,	1532-4435	17
PageRank	References	Authors
0.77	24	5

Authors (5 rows)

Cited by (17 rows)

References (24 rows)

Name	Order	Citations	PageRank
Jennifer Gillenwater	1	400	15.19
Kuzman Ganchev	2	737	35.21
João Graça	3	295	11.19
Fernando Pereira	4	17717	2124.79
Ben Taskar	5	3175	209.33

1