Title
Extracting named entities using support vector machines
Abstract
Identifying proper names, like gene names, DNAs, or proteins is useful to help researchers to mining the text information. Learning to extract proper names in natural language text is a named entity recognition (NER) task. Previous studies focus on combining abundant human made rules, trigger words, to enhance the system performance. However these methods require domain experts to build up these rules and word set which relies on lots of human efforts. In this paper, we present a robust named entity recognition system based on support vector machines (SVM). By integrating with rich feature set and the proposed mask method, the system performance is satisfactory on the MUC-7 and biology named entity recognition tasks which outperforms famous machine learning-based method, such as hidden markov model (HMM), and maximum entropy model (MEM). We compare our method to previous systems that were performed on the same data set. The experiments show that when training with the MUC-7 data set, our system achieves 86.4 in F(β=1) rate and 81.57 for the biology corpus. Besides, our named entity system is able to handle real time processing applications, the turn around time on a 63 K words document set is less than 30 seconds.
Year
DOI
Venue
2006
10.1007/11683568_8
KDLL
Keywords
Field
DocType
support vector machine,system performance,entity recognition system,entity recognition task,entity recognition,word set,rich feature set,proper name,entity system,previous system,k words document set,machine learning,ir,natural language,real time processing,hidden markov model,maximum entropy model,proper names
Computer science,Support vector machine,Natural language,Artificial intelligence,Turnaround time,Natural language processing,Principle of maximum entropy,Hidden Markov model,Named-entity recognition,Gene nomenclature,Proper noun,Machine learning
Conference
Volume
ISSN
ISBN
3886
0302-9743
3-540-32809-2
Citations 
PageRank 
References 
16
0.74
17
Authors
4
Name
Order
Citations
PageRank
Yu-Chieh Wu124723.16
Teng-Kai Fan2302.81
Yue-Shi Lee354341.14
Show-Jane Yen4537130.05