Classifying Homographs in Japanese Social Media Texts Using a User Interest Model. - Citegraph

Paper Info

Title
Classifying Homographs in Japanese Social Media Texts Using a User Interest Model.

Abstract
The analysis of text data from social media is hampered by irrelevant noisy data, such as homographs. Noisy data is not usable and makes analysis, such as counting estimates, of the target data diffcult, which adversely affects the quality of the analysis results. We focus on this issue and propose a method to classify homographs that are contained in social media texts (i.e. Twitter) using topic models. We also report the results of an evaluation experiment. In the evaluation experiment, the proposed method showed an accuracy improvement of 8.5% and a reduction of 16.5% in the misidentification rate compared with conventional methods.

Year	DOI	Venue
2014	10.1016/j.procs.2014.08.168	Procedia Computer Science
Keywords	DocType	Volume
Social media,Twitter,Homographs,Semantic analysis,Topic modelling,Latent Dirichlet allocation	Conference	35
ISSN	Citations	PageRank
1877-0509	0	0.34
References	Authors
6	2

Authors (2 rows)

Cited by (0 rows)

References (6 rows)

Name	Order	Citations	PageRank
Tomohiko Harada	1	1	1.45
Kazuhiko Tsuda	2	108	47.18

1