Title | ||
---|---|---|
Detecting Subject Boundaries Within Text: A Language Independent Statistical Approach |
Abstract | ||
---|---|---|
We describe here an algorithm for detect- ing subject boundaries within text based on a statistical lexical similarity measure. Hearst has already tackled this problem with good results (Hearst, 1994). One of her main assumptions is that a change in subject is accompanied by a change in vo- cabulary. Using this assumption, but by introducing a new measure of word signif- icance, we have been able to build a ro- bust and reliable algorithm which exhibits improved accuracy without sacrificing lan- guage independency. |
Year | Venue | Field |
---|---|---|
1997 | EMNLP | Computer science,Speech recognition,Artificial intelligence,Natural language processing |
DocType | Volume | Citations |
Conference | W97-03 | 17 |
PageRank | References | Authors |
2.93 | 5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Korin Richmond | 1 | 531 | 46.14 |
Andrew Smith | 2 | 27 | 6.55 |
Einat Amitay | 3 | 597 | 54.66 |