Title
Conquering Language: Using NLP on a Massive Scale to Build High Dimensional Language Models from the Web
Abstract
Dictionaries only contain some of the information we need to know about a language. The growth of the Web, the maturation of linguistic processing tools, and the decline in price of memory storage allow us to envision descriptions of languages that are much larger than before. We can conceive of building a complete language model for a language using all the text that is found on the Web for this language. This article describes our current project to do just that.
Year
DOI
Venue
2007
10.1007/978-3-540-70939-8_4
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Keywords
Field
DocType
complete language model,massive scale,conquering language,memory storage,current project,linguistic processing tool,build high dimensional language,language model
Specification language,World Wide Web,Data control language,Computer science,Object language,Language identification,Artificial intelligence,Natural language processing,Universal Networking Language,First-generation programming language,Language technology,Language primitive
Conference
Volume
ISSN
Citations 
4394
0302-9743
7
PageRank 
References 
Authors
0.66
16
1
Name
Order
Citations
PageRank
Gregory Grefenstette11129147.00