Abstract | ||
---|---|---|
ABSTRACT This paper describes the tests made on chunking methods,used for plagiarism detection. The result of the ,tests makes ,it possible to decide,on the ,best fitting chunking ,method ,for ,a given application. For example, overlapping word chunking is good for agrammar analyzer or for small databases, sentence chunking suits best for finding quoted texts, hashed breakpoint chunking is the fastest method ,therefore advisable for search in big ,set of documents, or if more reliability is needed overlapping hashed breakpoint chunking,can be used as well. Categories and Subject Descriptors H.3.1 [Content Analysis and Indexing] General Terms |
Year | Venue | Keywords |
---|---|---|
2003 | WWW Posters | content analysis,indexation |
Field | DocType | Citations |
Chunking (computing),Plagiarism detection,Computer science,Grammar,Artificial intelligence,Chunking (psychology),Natural language processing,Sentence | Conference | 2 |
PageRank | References | Authors |
0.37 | 0 | 1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Máté Pataki | 1 | 24 | 4.15 |