Title
France Telecom R&D Beijing Word Segmenter for Sighan Bakeoff 2006
Abstract
This paper presents two word segmenta- tion (WS) systems and a named entity recognition (NER) system in France Telecom R&D Beijing. The one system of WS is for open tracks based on n- gram language model and another one is for closed tracks based on maximum en- tropy approach. The NER system uses a hybrid algorithm based on Class-based language model and rule-based knowl- edge. These systems are all augmented with a set of post-processors.
Year
Venue
Keywords
2006
SIGHAN@COLING/ACL
language model,rule based,hybrid algorithm,word segmentation
Field
DocType
Citations 
Telecommunications,Hybrid algorithm,Computer science,Text segmentation,Speech recognition,Artificial intelligence,Natural language processing,Principle of maximum entropy,Named-entity recognition,Language model,Beijing
Conference
4
PageRank 
References 
Authors
0.70
4
6
Name
Order
Citations
PageRank
wu liu140.70
h li241.37
Yuan DONG313925.66
nan he440.70
Haitao Luo5255.00
Haila WANG626523.54