Title | ||
---|---|---|
Part-of-Speech Tagging for Code-Mixed English-Hindi Twitter and Facebook Chat Messages. |
Abstract | ||
---|---|---|
The paper reports work on collecting and annotating code-mixed English-Hindi social media text (Twitter and Facebook messages), and experiments on automatic tagging of these corpora, using both a coarse-grained and a fine-grained part-ofspeech tag set. We compare the performance of a combination of language specific taggers to that of applying four machine learning algorithms to the task (Conditional Random Fields, Sequential Minimal Optimization, Naive Bayes and Random Forests), using a range of different features based on word context and wordinternal information. |
Year | Venue | Field |
---|---|---|
2015 | RANLP | Conditional random field,Social media,Naive Bayes classifier,Hindi,Computer science,Part-of-speech tagging,Cyberpsychology,Natural language processing,Artificial intelligence,Random forest,Sequential minimal optimization |
DocType | Citations | PageRank |
Conference | 11 | 0.83 |
References | Authors | |
24 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Anupam Jamatia | 1 | 12 | 3.28 |
Björn Gambäck | 2 | 155 | 36.86 |
Amitava Das | 3 | 198 | 42.49 |