Title
Semi-Supervised Psychometric Scoring Of Document Collections
Abstract
We describe a generic computational approach that can be used in developing methods for psychometric profiling. Our approach is based on semi-supervised analysis of document collections using topic modeling. The method depends on a supervisor providing a set of seed documents, grouped by abstract themes, such as Schwartz values or personality traits; and possibly a separate background document corpus. Instead of casting the problem into a standard classification framework, we interpret the group labels as a guide for finding distinguishing features. During training, we train each group of documents associated with a theme separately by using nonnegative matrix factorization to obtain theme specific topic distributions. In the analysis, we decompose a new document using the model learned during training to arrive at the theme scores. We demonstrate our approach on two psychometric profiling theories (Schwartz and Big Five) and evaluate our Schwartz scores with leaveone-out cross-validation method and compare Big Five scores to independent surveys, which are much more costly to carry out.
Year
DOI
Venue
2018
10.1109/ICDMW.2018.00194
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW)
Keywords
Field
DocType
non-negative matrix factorization, semi-supervised learning, Schwartz theory of basic human values, big five personality traits, psychometric profiling, personality recognition
Big Five personality traits,Task analysis,Profiling (computer programming),Computer science,Feature extraction,Artificial intelligence,Natural language processing,Encyclopedia,Non-negative matrix factorization,Topic model,Semantics,Machine learning
Conference
ISSN
Citations 
PageRank 
2375-9232
0
0.34
References 
Authors
0
8
Name
Order
Citations
PageRank
Burak Suyunu101.01
Gonul Ayci200.68
Mine Ögretir300.34
A. Taylan Cemgil472.54
Suzan Uskudarli5105.19
Hamza Zeytinoglu620.83
Bülent Özel700.34
Arman Boyaci864.22