A Semantics-based Approach to Disclosure Classification in User-Generated Online Content. - Citegraph

Paper Info

Title
A Semantics-based Approach to Disclosure Classification in User-Generated Online Content.

Abstract
As users engage in public discourse, the rate of voluntarily disclosed personal information has seen a steep increase. So-called self-disclosure can result in a number of privacy concerns. Users are often unaware of the sheer amount of personal information they share across online forums, commentaries, and social networks, as well as the power of modern AI to synthesize and gain insights from this data. This paper presents an approach to detect emotional and informational self-disclosure in natural language. We hypothesize that identifying frame semantics can meaningfully support this task. Specifically, we use Semantic Role Labeling to identify the lexical units and their semantic roles that signal self-disclosure. Experimental results on Reddit data show the performance gain of our method when compared to standard text classification methods based on BiLSTM, and BERT. In addition to improved performance, our approach provides insights into the drivers of disclosure behaviors.

Year	DOI	Venue
2020	10.18653/V1/2020.FINDINGS-EMNLP.312	EMNLP
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Chandan Akiti	1	0	1.01
Anna Cinzia Squicciarini	2	1301	106.30
Sarah Michele Rajtmajer	3	31	10.06

1