Abstract | ||
---|---|---|
We propose a text summarization system known as MySum in finding the significance of sentences in order to produce a summary based on asymmetric word similarity and topic similarity. We use mass assignment theory to compute similarity between words based on the basis of their contexts. The algorithm is incremental so that words or documents can be added or subtracted without massive re-computation. Words are considered similar if they appear in similar contexts, however, these words do not have to be synonyms. We also compute the similarity of a sentence to the topic using frequency of overlapping words. We compare the summaries produced with the ones by humans and other system known as TF.ISF (term frequency-inverse sentence frequency). Our method generates summaries that are up to 60% similar to the manually created summaries taken from DUC 2002 test collection. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1007/3-540-31662-0_39 | APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY |
Keywords | Field | DocType |
sentence extraction,asymmetric word similarity,topic similarity,mass assignment,fuzzy | Information retrieval,Computer science,Document summarization,Natural language processing,Artificial intelligence,Sentence extraction | Conference |
Volume | ISSN | Citations |
34.0 | 1615-3871 | 1 |
PageRank | References | Authors |
0.39 | 7 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masrah Azmi-murad | 1 | 10 | 1.32 |
Trevor P. Martin | 2 | 134 | 26.98 |