Abstract | ||
---|---|---|
This paper introduces a manually annotated dataset for named entity recognition (NER) in micro-blogging text for Romanian language. It contains gold annotations for 9 entity classes and expressions: persons, locations, organizations, time expressions, legal references, disorders, chemicals, medical devices and anatomical parts. Furthermore, word embeddings models computed on a larger micro-blogging corpus are made available. Finally, several NER models are trained and their performance is evaluated against the newly introduced corpus. |
Year | Venue | DocType |
---|---|---|
2022 | International Conference on Computational Linguistics | Conference |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vasile Pais | 1 | 0 | 1.69 |
Verginica Barbu Mititelu | 2 | 25 | 11.35 |
Elena Irimia | 3 | 24 | 6.76 |
Maria Mitrofan | 4 | 1 | 2.72 |
Carol Luca Gasan | 5 | 0 | 0.34 |
Roxana Micu | 6 | 0 | 0.34 |