Detecting Subject Boundaries Within Text: A Language Independent Statistical Approach - Citegraph

Paper Info

Title
Detecting Subject Boundaries Within Text: A Language Independent Statistical Approach

Abstract
We describe here an algorithm for detect- ing subject boundaries within text based on a statistical lexical similarity measure. Hearst has already tackled this problem with good results (Hearst, 1994). One of her main assumptions is that a change in subject is accompanied by a change in vo- cabulary. Using this assumption, but by introducing a new measure of word signif- icance, we have been able to build a ro- bust and reliable algorithm which exhibits improved accuracy without sacrificing lan- guage independency.

Year	Venue	Field
1997	EMNLP	Computer science,Speech recognition,Artificial intelligence,Natural language processing
DocType	Volume	Citations
Conference	W97-03	17
PageRank	References	Authors
2.93	5	3

Authors (3 rows)

Cited by (17 rows)

References (5 rows)

Name	Order	Citations	PageRank
Korin Richmond	1	531	46.14
Andrew Smith	2	27	6.55
Einat Amitay	3	597	54.66

1