Title
An effective two-stage model for exploiting non-local dependencies in named entity recognition
Abstract
This paper shows that a simple two-stage approach to handle non-local dependencies in Named Entity Recognition (NER) can outperform existing approaches that handle non-local dependencies, while being much more computationally efficient. NER systems typically use sequence models for tractable inference, but this makes them unable to capture the long distance structure present in text. We use a Conditional Random Field (CRF) based NER system using local features to make predictions and then train another CRF which uses both local information and features extracted from the output of the first CRF. Using features capturing non-local dependencies from the same document, our approach yields a 12.6% relative error reduction on the F1 score, over state-of-the-art NER systems using local-information alone, when compared to the 9.3% relative error reduction offered by the best systems that exploit non-local information. Our approach also makes it easy to incorporate non-local information from other documents in the test corpus, and this gives us a 13.3% error reduction over NER systems using local-information alone. Additionally, our running time for inference is just the inference time of two sequential CRFs, which is much less than that directly model the dependencies and do approximate inference.
Year
DOI
Venue
2006
10.3115/1220175.1220316
ACL
Keywords
Field
DocType
tractable inference,ner system,error reduction,non-local dependency,approach yield,approximate inference,relative error reduction,inference time,effective two-stage model,entity recognition,state-of-the-art ner system,non-local information
Computer science,Approximate inference,Artificial intelligence,Natural language processing,CRFS,Conditional random field,F1 score,Pattern recognition,Inference,Exploit,Named-entity recognition,Approximation error,Machine learning
Conference
Volume
Citations 
PageRank 
P06-1
67
4.75
References 
Authors
12
2
Name
Order
Citations
PageRank
Vijay Krishnan119311.34
Christopher D. Manning2225791126.22