Title
Robust and efficient multiclass SVM models for phrase pattern recognition
Abstract
Phrase pattern recognition (phrase chunking) refers to automatic approaches for identifying predefined phrase structures in a stream of text. Support vector machines (SVMs)-based methods had shown excellent performance in many sequential text pattern recognition tasks such as protein name finding, and noun phrase (NP)-chunking. Even though they yield very accurate results, they are not efficient for online applications, which need to handle hundreds of thousand words in a limited time. In this paper, we firstly re-examine five typical multiclass SVM methods and the adaptation to phrase chunking. However, most of them were inefficient when the number of phrase types scales. We thus introduce the proposed two new multiclass SVM models that make the system substantially faster in terms of training and testing while keeps the SVM accurate. The two methods can also be applied to similar tasks such as named entity recognition and Chinese word segmentation. Experiments on CoNLL-2000 chunking and Chinese base-chunking tasks showed that our method can achieve very competitive accuracy and at least 100 times faster than the state-of-the-art SVM-based phrase chunking method. Besides, the computational time complexity and the time cost analysis of our methods were also given in this paper.
Year
DOI
Venue
2008
10.1016/j.patcog.2008.02.010
Pattern Recognition
Keywords
Field
DocType
state-of-the-art svm-based phrase,natural language processing,efficient multiclass svm model,limited time,entity recognition,predefined phrase structure,computational time complexity,phrase types scale,phrase chunk,new multiclass svm model,support vector machines,multiclass classification,machine learning,phrase pattern recognition,noun phrase,support vector machine,time complexity,pattern recognition,cost analysis
Noun phrase,Pattern recognition,Computer science,Phrase chunking,Support vector machine,Phrase,Speech recognition,Text segmentation,Artificial intelligence,Chunking (psychology),Named-entity recognition,Multiclass classification
Journal
Volume
Issue
ISSN
41
9
Pattern Recognition
Citations 
PageRank 
References 
38
1.38
40
Authors
3
Name
Order
Citations
PageRank
Yu-Chieh Wu124723.16
Yue-Shi Lee254341.14
Jie-Chi Yang335043.91