Title
Text Mining For Information Systems Researchers: An Annotated Topic Modeling Tutorial.
Abstract
Analysts have estimated that more than 80 percent of today's data is stored in unstructured form (e.g., text, audio, image, video)-much of it expressed in rich and ambiguous natural language. Traditionally, to analyze natural language, one has used qualitative data-analysis approaches, such as manual coding. Yet, the size of text data sets obtained from the Internet makes manual analysis virtually impossible. In this tutorial, we discuss the challenges encountered when applying automated text-mining techniques in information systems research. In particular, we showcase how to use probabilistic topic modeling via Latent Dirichlet allocation, an unsupervised text-mining technique, with a LASSO multinomial logistic regression to explain user satisfaction with an IT artifact by automatically analyzing more than 12,000 online customer reviews. For fellow information systems researchers, this tutorial provides guidance for conducting text-mining studies on their own and for evaluating the quality of others.
Year
Venue
Keywords
2016
COMMUNICATIONS OF THE ASSOCIATION FOR INFORMATION SYSTEMS
Text Mining,Topic Modeling,Latent Dirichlet Allocation,Online Customer Reviews,User Satisfaction
Field
DocType
Volume
Data science,Information system,Text mining,Computer science,Topic model
Journal
39
ISSN
Citations 
PageRank 
1529-3181
8
0.43
References 
Authors
0
4
Name
Order
Citations
PageRank
Stefan Debortoli1344.04
Oliver Müller2392.69
Iris A. Junglas357730.32
Jan vom Brocke41206101.21