Abstract | ||
---|---|---|
Humans communicate with text in thousands of languages, in dozens of scripts, in a variety of binary codes, on millions of topics. There is a need, for both government and commercial applications, to identify these text characteristics to enable follow-on processing such as transcoding, translation, transliteration, routing and prioritization. This paper deals with the implementation of real-time mining of unstructured text on high-speed hardware capable of processing network data streams at gigabyte per second speeds. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/ICDMW.2008.9 | ICDM Workshops |
Keywords | Field | DocType |
second speeds,mining unstructured text,high-speed hardware,text characteristic,binary code,paper deal,real-time mining,commercial application,processing network data stream,unstructured text,follow-on processing,language,real time,hardware,encoding,data mining,computer aided manufacturing,natural,text analysis,pattern matching,processing | Computer-aided manufacturing,Data mining,Transcoding,Text mining,Computer science,Gigabyte,Binary code,Artificial intelligence,Pattern matching,Machine learning,Transliteration,Scripting language | Conference |
ISBN | Citations | PageRank |
978-0-7695-3503-6 | 0 | 0.34 |
References | Authors | |
2 | 1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alan Ratner | 1 | 1 | 1.40 |