Title | ||
---|---|---|
The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998-2016. |
Abstract | ||
---|---|---|
In this work we present the Talk of Norway (ToN) data set, a collection of Norwegian Parliament speeches from 1998 to 2016.
Every speech is richly annotated with metadata harvested from different sources, and augmented with language type, sentence, token, lemma, part-of-speech, and morphological feature annotations. We also present a pilot study on party classification in the Norwegian Parliament, carried out in the context of a cross-faculty collaboration involving researchers from both Political Science and Computer Science. Our initial experiments demonstrate how the linguistic and institutional annotations in ToN can be used to gather insights on how different aspects of the political process affect classification.
|
Year | DOI | Venue |
---|---|---|
2018 | 10.1007/s10579-018-9411-5 | Language Resources and Evaluation |
Keywords | Field | DocType |
Computational political sciences, Computational social science, Language technology, Natural language processing, Parliamentary proceedings | Norwegian,Metadata,Political science,Computational sociology,Parliament,Natural language processing,Artificial intelligence,Politics,Sentence,Lemma (mathematics),Language technology | Journal |
Volume | Issue | ISSN |
52 | 3 | 1574-020X |
Citations | PageRank | References |
0 | 0.34 | 6 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Emanuele Lapponi | 1 | 48 | 4.61 |
Martin G. Søyland | 2 | 0 | 0.34 |
Erik Velldal | 3 | 103 | 19.11 |
Stephan Oepen | 4 | 533 | 61.08 |