Abstract | ||
---|---|---|
We present coreference annotation on parallel Czech-English texts of the Prague Czech-English Dependency Treebank (PCEDT). The paper describes innovations made to PCEDT 2.0 concerning coreference, as well as the coreference information already present there. We characterize the coreference annotation scheme, give the statistics and compare our annotation with the coreference annotation in Ontonotes and Prague Dependency Treebank for Czech. We also present the experiments made using this corpus to improve the alignment of coreferential expressions, which helps us to collect better statistics of correspondences between types of coreferential relations in Czech and English. The corpus released as PCEDT 2.0 Coref is publicly available. |
Year | Venue | Keywords |
---|---|---|
2016 | LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | parallel corpus,bilingual coreference,alignment,Czech,English |
Field | DocType | Citations |
Coreference,Czech,Computer science,Natural language processing,Treebank,Artificial intelligence,Linguistics | Conference | 1 |
PageRank | References | Authors |
0.37 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Anna Nedoluzhko | 1 | 30 | 6.71 |
Michal Novák | 2 | 4 | 3.79 |
Silvie Cinková | 3 | 53 | 10.55 |
Marie Mikulová | 4 | 20 | 3.97 |
Jirí Mírovský | 5 | 65 | 16.79 |