Title | ||
---|---|---|
Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web |
Abstract | ||
---|---|---|
Dictionaries only contain some of the information we need to know about a language. The growth of the Web, the maturation of linguistic processing tools, and the decline in price of memory storage allow us to envision descriptions of languages that are much larger than before. We can conceive of building a complete language model for a language using all the text that is found on the Web for this language. This article describes our current project to do just that. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1007/978-3-540-70939-8_4 | CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing |
Keywords | Field | DocType |
complete language model,massive scale,conquering language,memory storage,current project,linguistic processing tool,build high dimensional language,language model | Specification language,World Wide Web,Data control language,Computer science,Object language,Language identification,Artificial intelligence,Natural language processing,Universal Networking Language,First-generation programming language,Language technology,Language primitive | Conference |
Volume | ISSN | Citations |
4394 | 0302-9743 | 7 |
PageRank | References | Authors |
0.66 | 16 | 1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gregory Grefenstette | 1 | 1129 | 147.00 |