Title
On developing indicators with text analytics: exploring concept vectors applied to English and Chinese texts
Abstract
This paper investigates how high-quality, vocabulary-based classifiers, useful for competitive intelligence, can be found for relatively small corpora of publicly available documents. Two corpora of recent annual reports are examined and compared, one in English and one in Chinese. The paper tests whether vocabularies can predict whether firms are relatively innovative or not, examining vocabularies of both content words and function words. We find that indeed the tested vocabularies do produce effective indicators or classifiers and, surprisingly, that function words are especially effective. The paper also provides extensive conceptual and theoretical background to frame the investigation in the context of an EMCUT problematic, that of mapping entities to classification schemes using information derived from text.
Year
DOI
Venue
2014
10.1007/s10257-013-0228-x
Information Systems and e-Business Management
Keywords
Field
DocType
annual reports,content analysis,vector semantics,competitive intelligence,machine learning,text mining
Competitive intelligence,Content analysis,Text mining,Computer science,Classification scheme,Artificial intelligence,Natural language processing,Vocabulary
Journal
Volume
Issue
ISSN
12
3
1617-9854
Citations 
PageRank 
References 
0
0.34
16
Authors
4
Name
Order
Citations
PageRank
Steven O. Kimbrough1600103.93
Christine Chou261.12
Yi-Ting Chen3324.33
Hilary Lin400.34