Title
Pre-processing online financial text for sentiment classification: A natural language processing approach
Abstract
Online financial textual information contains a large amount of investor sentiment, i.e. subjective assessment and discussion with respect to financial instruments. An effective solution to automate the sentiment analysis of such large amounts of online financial texts would be extremely beneficial. This paper presents a natural language processing (NLP) based pre-processing approach both for noise removal from raw online financial texts and for organizing such texts into an enhanced format that is more usable for feature extraction. The proposed approach integrates six NLP processing steps, including a developed syntactic and semantic combined negation handling algorithm, to reduce noise in the online informal text. Three-class sentiment classification is also introduced in each system implementation. Experimental results show that the proposed pre-processing approach outperforms other pre-processing methods. The combined negation handling algorithm is also evaluated against three standard negation handling approaches.
Year
DOI
Venue
2014
10.1109/CIFEr.2014.6924063
CIFEr
Keywords
Field
DocType
raw online financial texts,nlp based preprocessing approach,noise removal,online informal text,pattern classification,financial data processing,negation handling algorithm,natural language processing approach,financial instruments,emotion recognition,feature extraction,internet,investor sentiment analysis,online financial text preprocessing,natural language processing,text analysis,three-class sentiment classification,online financial textual information,sentiment analysis,niobium,semantics
USable,Negation,Computer science,Implementation,Financial instrument,Artificial intelligence,Natural language processing,Syntax,Information retrieval,Sentiment analysis,Feature extraction,Finance,Semantics
Conference
ISSN
Citations 
PageRank 
2380-8454
5
0.45
References 
Authors
13
5
Name
Order
Citations
PageRank
Fan Sun150.79
Ammar Belatreche225623.11
Sonya Coleman321636.84
T. Martin Mcginnity451866.30
Yuhua Li5111353.63