Automatic Paragraph Identification: A Study across Languages and Domains - Citegraph

Paper Info

Title
Automatic Paragraph Identification: A Study across Languages and Domains

Abstract
In this paper we investigate whether paragraphs can be identified automatically in different languages and domains. We propose a machine learning ap- proach which exploits textual and discourse cues and we assess how well humans perform on this task. Our best models achieve an accuracy that is significantly higher than the best baseline and, for most data sets, comes to within 6% of human per- formance.

Year	Venue	Keywords
2004	EMNLP	machine learning,human performance
Field	DocType	Volume
Computer science,Paragraph,Artificial intelligence,Natural language processing,Linguistics	Conference	W04-32
Citations	PageRank	References
5	0.55	9
Authors
2

Authors (2 rows)

Cited by (5 rows)

References (9 rows)

Name	Order	Citations	PageRank
Caroline Sporleder	1	453	31.84
Mirella Lapata	2	5973	369.52

1