Abstract | ||
---|---|---|
Previous work addressing the issue of word distribution in documents has shown the importance of word repetitiveness as an indicator of the word content- bearing characteristics. In this paper we propose a sim- ple method using a measure of the tendency of words to repeat within a document to separate the words with similar document frequencies, but different topic discrim- inating characteristics. |
Year | DOI | Venue |
---|---|---|
2000 | 10.1145/345508.345641 | SIGIR |
Field | DocType | Citations |
tf–idf,Information retrieval,Computer science | Conference | 1 |
PageRank | References | Authors |
0.36 | 6 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Martin Franz | 1 | 483 | 53.56 |
J. Scott Mccarley | 2 | 214 | 21.36 |