Abstract | ||
---|---|---|
We introduce a substantial update of the Prague Czech-English Dependency Treebank, a parallel corpus manually annotated at the deep syntactic layer of linguistic representation. The English part consists of the Wall Street Journal (WSJ) section of the Penn Treebank. The Czech part was translated from the English source sentence by sentence. This paper gives a high level overview of the underlying linguistic theory (the so-called tectogrammatical annotation) with some details of the most important features like valency annotation, ellipsis reconstruction or coreference. |
Year | Venue | Keywords |
---|---|---|
2012 | LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | parallel corpus,parallel treebank,deep syntactic treebank |
DocType | Citations | PageRank |
Conference | 16 | 1.08 |
References | Authors | |
3 | 16 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jan Hajic | 1 | 1184 | 109.62 |
Eva Hajicová | 2 | 96 | 33.19 |
Jarmila Panevová | 3 | 45 | 12.07 |
Petr Sgall | 4 | 127 | 35.81 |
Ondřej Bojar | 5 | 1701 | 122.71 |
Silvie Cinková | 6 | 53 | 10.55 |
Eva Fucíková | 7 | 21 | 5.95 |
Marie Mikulová | 8 | 20 | 3.97 |
Petr Pajas | 9 | 148 | 15.42 |
Jan Popelka | 10 | 26 | 2.63 |
Jirí Semecký | 11 | 21 | 3.29 |
Jana Sindlerová | 12 | 33 | 5.82 |
Jan Stepánek | 13 | 86 | 7.37 |
Josef Toman | 14 | 19 | 1.56 |
Zdeňka Urešová | 15 | 51 | 11.81 |
Zdenek Zabokrtský | 16 | 193 | 22.23 |