Title
Mobile phone name extraction from internet forums: a semi-supervised approach
Abstract
Collecting users' feedback on products from Internet forums is challenging because users often mention a product with informal abbreviations or nicknames. In this paper, we propose a method named Gren to recognize and normalize mobile phone names from domain-specific Internet forums. Instead of directly recognizing phone names from sentences as in most named entity recognition tasks, we propose an approach to generating candidate names as the first step. The candidate names capture short forms, spelling variations, and nicknames of products, but are not noise free. To predict whether a candidate name mention in a sentence indeed refers to a specific phone model, a Conditional Random Field (CRF)-based name recognizer is developed. The CRF model is trained by using a large set of sentences obtained in a semi-automatic manner with minimal manual labeling effort. Lastly, a rule-based name normalization component maps a recognized name to its formal form. Evaluated on more than 4000 manually labeled sentences with about 1000 phone name mentions, Gren outperforms all baseline methods. Specifically, it achieves precision and recall of 0.918 and 0.875 respectively, with the best feature setting. We also provide detailed analysis of the intermediate results obtained by each of the three components in Gren.
Year
DOI
Venue
2016
10.1007/s11280-015-0361-1
World Wide Web
Keywords
Field
DocType
Mobile phone,Name recognition and normalization,Internet forum
Conditional random field,Data mining,Computer science,Precision and recall,Phone,Spelling,Mobile phone,Named-entity recognition,Sentence,The Internet
Journal
Volume
Issue
ISSN
19
5
1386-145X
Citations 
PageRank 
References 
1
0.35
18
Authors
2
Name
Order
Citations
PageRank
Yangjie Yao121.04
Aixin Sun23071156.89