Abstract | ||
---|---|---|
Automatic text chunking aims to recognize grammatical phrase structures in natural language text. Text chunking provides downstream syntactic information for further analysis, which is also an important technology in the area of text mining (TM) and natural language processing (NLP). Existing chunking systems make use of external knowledge, e.g. grammar parsers, or integrate multiple learners to achieve higher performance. However, the external knowledge is almost unavailable in many domains and languages. Besides, employing multiple learners does not only complicate the system architecture, but also increase training and testing time costs. In this paper, we present a novel phrase chunking model based on the proposed mask method without employing external knowledge and multiple learners. The mask method could automatically derive more training examples from the original training data, which significantly improves system performance. We had evaluated our method in different chunking tasks and languages in comparison to previous studies. The experimental results show that our method achieves state of the art performance in chunking tasks. In two English chunking tasks, i.e., shallow parsing and base-chunking, our method achieves 94.22 and 93.23 in F"("@b"="1") rates. When porting to Chinese, the F"("@b"="1") rate is 92.30. Also, our chunker is quite efficient. The complete chunking time of a 50K-words is less than 10s. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1016/j.eswa.2006.06.022 | Expert Syst. Appl. |
Keywords | Field | DocType |
text mining,knowledge discover in text,chunking system,proposed mask method,english chunking task,complete chunking time,chunking task,robust multilingual portable phrase,shallow parsing,natural language processing,phrase chunking,different chunking task,mask method,external knowledge,multiple learner,automatic text chunk,system architecture,natural language,system performance | Chunking (computing),Shallow parsing,Phrase chunking,Computer science,Phrase,Speech recognition,Natural language,Natural language processing,Chunking (psychology),Artificial intelligence,Parsing,Syntax | Journal |
Volume | Issue | ISSN |
33 | 3 | Expert Systems With Applications |
Citations | PageRank | References |
14 | 0.69 | 40 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yue-Shi Lee | 1 | 543 | 41.14 |
Yu-Chieh Wu | 2 | 247 | 23.16 |