Title
Harnessing English Sentiment Lexicons for Polarity Detection in Urdu Tweets: A Baseline Approach
Abstract
It is human instinct to express emotions, and with increasing use of social media, it is more often being expressed through text messages than ever before. The emotions and sentiments encoded in these short text messages are of keen interest to various marketing and advertising agencies. Thus, various lexicons and algorithms have been devised for English, and French language to extract these hidden sentiments. On the other hand, Urdu (or Hindi) the third widely-spoken language in the world [1], lacks any such sentiment lexicons or algorithms. Instead of starting from scratch, we make use of the existing English sentiment lexicons to develop the first sentiment lexicon for Urdu. This lexicon will serve as a baseline for future lexicons developed through more intimate knowledge of Urdu language. Furthermore, we compare its performance with various machine learning (ML) approaches. We also make public the labeled dataset developed by us for Urdu sentiment analysis. We hope that this lexicon and dataset will serve as a benchmark for evaluation of future lexicons and ML approaches for the Urdu language.
Year
DOI
Venue
2017
10.1109/ICSC.2017.68
2017 IEEE 11th International Conference on Semantic Computing (ICSC)
Keywords
Field
DocType
sentiments,lexicon,urdu,plorarity
Social media,Hindi,Sentiment analysis,Computer science,Lexicon,Urdu,Artificial intelligence,Natural language processing,Linguistics
Conference
ISSN
ISBN
Citations 
2325-6516
978-1-5090-4285-2
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Muhammad Yaseen Khan100.34
Shah Muhammad Emaduddin200.34
Khurum Nazir Junejo3576.08