Title
User Review Sites as a Resource for Large-Scale Sociolinguistic Studies
Abstract
Sociolinguistic studies investigate the relation between language and extra-linguistic variables. This requires both representative text data and the associated socio-economic meta-data of the subjects. Traditionally, sociolinguistic studies use small samples of hand-curated data and meta-data. This can lead to exaggerated or false conclusions. Using social media data offers a large-scale source of language data, but usually lacks reliable socio-economic meta-data. Our research aims to remedy both problems by exploring a large new data source, international review websites with user profiles. They provide more text data than manually collected studies, and more meta-data than most available social media text. We describe the data and present various pilot studies, illustrating the usefulness of this resource for sociolinguistic studies. Our approach can help generate new research hypotheses based on data-driven findings across several countries and languages.
Year
DOI
Venue
2015
10.1145/2736277.2741141
WWW
Keywords
DocType
Citations 
Language-analysis techniques, Multi-lingual and cross-lingual analysis and mining, Social science research based on social media, Insights from natural-language analysis of social media, Novel applications
Conference
14
PageRank 
References 
Authors
0.61
13
3
Name
Order
Citations
PageRank
Dirk Hovy149040.44
Anders Johannsen215012.12
Anders Søgaard368481.68