Abstract | ||
---|---|---|
With the rapid growth of the available information on the Internet, it is more difficult for us to find the relevant information quickly on the Web. Named Entity Recognition (NER), one of the key techniques in some web information processing tools such as information retrieval and information extraction, has been paid more and more attention. In this paper we address the problem of Chinese NER using a hybrid-statistical model. This study is concentrated on entity names (personal names, location names and organization names), temporal expressions (dates and times) and number expressions. The method is characterized as follows: firstly, NER and Part-of-Speech tagging have been integrated into a unified framework; secondly, it combines Hidden Markov Model (HMM) with Maximum Entropy Model (MEM) by taking MEM as a sub-model invoked in Viterbi algorithm; thirdly, the Part-of-Speech information of the context has been used in MEM. The experiment shows that the hybrid-statistical model could achieve preferable results of Chinese NER, in which the F1 value ranges from 74% to 92% for all kinds of named entities on an open-test data. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/978-3-540-31849-1_86 | APWeb |
Keywords | Field | DocType |
part-of-speech information,information retrieval,available information,entity recognition,maximum entropy model,relevant information,hidden markov model,web information processing tool,chinese ner,hybrid-statistical model,information extraction,statistical model,viterbi algorithm,part of speech,information processing | Data mining,Expression (mathematics),Computer science,Natural language processing,Artificial intelligence,Viterbi algorithm,Entity linking,Information extraction,Statistical model,Principle of maximum entropy,Hidden Markov model,Named-entity recognition,Database | Conference |
Volume | ISSN | ISBN |
3399 | 0302-9743 | 3-540-25207-X |
Citations | PageRank | References |
1 | 0.40 | 9 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiaoyan Zhang | 1 | 1 | 0.40 |
Ting Wang | 2 | 36 | 9.43 |
Jintao Tang | 3 | 89 | 14.00 |
Huiping Zhou | 4 | 17 | 7.96 |
Huo-wang Chen | 5 | 235 | 33.47 |