Title | ||
---|---|---|
On developing indicators with text analytics: exploring concept vectors applied to English and Chinese texts |
Abstract | ||
---|---|---|
This paper investigates how high-quality, vocabulary-based classifiers, useful for competitive intelligence, can be found for relatively small corpora of publicly available documents. Two corpora of recent annual reports are examined and compared, one in English and one in Chinese. The paper tests whether vocabularies can predict whether firms are relatively innovative or not, examining vocabularies of both content words and function words. We find that indeed the tested vocabularies do produce effective indicators or classifiers and, surprisingly, that function words are especially effective. The paper also provides extensive conceptual and theoretical background to frame the investigation in the context of an EMCUT problematic, that of mapping entities to classification schemes using information derived from text. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/s10257-013-0228-x | Information Systems and e-Business Management |
Keywords | Field | DocType |
annual reports,content analysis,vector semantics,competitive intelligence,machine learning,text mining | Competitive intelligence,Content analysis,Text mining,Computer science,Classification scheme,Artificial intelligence,Natural language processing,Vocabulary | Journal |
Volume | Issue | ISSN |
12 | 3 | 1617-9854 |
Citations | PageRank | References |
0 | 0.34 | 16 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Steven O. Kimbrough | 1 | 600 | 103.93 |
Christine Chou | 2 | 6 | 1.12 |
Yi-Ting Chen | 3 | 32 | 4.33 |
Hilary Lin | 4 | 0 | 0.34 |