Title
The Talk of Norway: a richly annotated corpus of the Norwegian parliament, 1998-2016.
Abstract
In this work we present the Talk of Norway (ToN) data set, a collection of Norwegian Parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata harvested from different sources, and augmented with language type, sentence, token, lemma, part-of-speech, and morphological feature annotations. We also present a pilot study on party classification in the Norwegian Parliament, carried out in the context of a cross-faculty collaboration involving researchers from both Political Science and Computer Science. Our initial experiments demonstrate how the linguistic and institutional annotations in ToN can be used to gather insights on how different aspects of the political process affect classification.
Year
DOI
Venue
2018
10.1007/s10579-018-9411-5
Language Resources and Evaluation
Keywords
Field
DocType
Computational political sciences, Computational social science, Language technology, Natural language processing, Parliamentary proceedings
Norwegian,Metadata,Political science,Computational sociology,Parliament,Natural language processing,Artificial intelligence,Politics,Sentence,Lemma (mathematics),Language technology
Journal
Volume
Issue
ISSN
52
3
1574-020X
Citations 
PageRank 
References 
0
0.34
6
Authors
4
Name
Order
Citations
PageRank
Emanuele Lapponi1484.61
Martin G. Søyland200.34
Erik Velldal310319.11
Stephan Oepen453361.08