Title
Text Segmentation Using Roget-Based Weighted Lexical Chains.
Abstract
In this article we present a new method for text segmentation. The method relies on the number of lexical chains (LCs) which end in a sentence, which begin in the following sentence and which traverse the two successive sentences. The lexical chains are based on Roget's thesaurus (the 1987 and the 1911 version). We evaluate the method on ten texts from the DUC 2002 conference and on twenty texts from the CAST project corpus, using a manual segmentation as gold standard.
Year
Venue
Keywords
2013
COMPUTING AND INFORMATICS
Lexical chains,text segmentation,topic boundaries,Roget's thesaurus,segmentation evaluation
Field
DocType
Volume
Segmentation,Computer science,Speech recognition,Text segmentation,Natural language processing,Artificial intelligence,Sentence,Traverse
Journal
32
Issue
ISSN
Citations 
2
1335-9150
0
PageRank 
References 
Authors
0.34
12
3
Name
Order
Citations
PageRank
Doina Tatar1318.48
Diana Inkpen2105987.92
Gabriela Czibula38019.53