Title
Mining Unstructured Text at Gigabyte per Second Speeds
Abstract
Humans communicate with text in thousands of languages, in dozens of scripts, in a variety of binary codes, on millions of topics. There is a need, for both government and commercial applications, to identify these text characteristics to enable follow-on processing such as transcoding, translation, transliteration, routing and prioritization. This paper deals with the implementation of real-time mining of unstructured text on high-speed hardware capable of processing network data streams at gigabyte per second speeds.
Year
DOI
Venue
2008
10.1109/ICDMW.2008.9
ICDM Workshops
Keywords
Field
DocType
second speeds,mining unstructured text,high-speed hardware,text characteristic,binary code,paper deal,real-time mining,commercial application,processing network data stream,unstructured text,follow-on processing,language,real time,hardware,encoding,data mining,computer aided manufacturing,natural,text analysis,pattern matching,processing
Computer-aided manufacturing,Data mining,Transcoding,Text mining,Computer science,Gigabyte,Binary code,Artificial intelligence,Pattern matching,Machine learning,Transliteration,Scripting language
Conference
ISBN
Citations 
PageRank 
978-0-7695-3503-6
0
0.34
References 
Authors
2
1
Name
Order
Citations
PageRank
Alan Ratner111.40