Abstract | ||
---|---|---|
In this paper we introduce a web application (SAPIENT) for sentence based annotation of full papers with semantic information. SAPIENT enables experts to annotate scientific papers sentence by sentence and also to link related sentences together, thus forming spans of interesting regions, which can facilitate text mining applications. As part of the system, we developed an XML-aware sentence splitter (SSSplit) which preserves XML markup and identifies sentences through the addition of in-line markup. SAPIENT has been used in a systematic study for the annotation of scientific papers with concepts representing the Core Information about Scientific Papers (CISP) to create a corpus of 225 annotated papers. |
Year | Venue | Keywords |
---|---|---|
2009 | BioNLP@HLT-NAACL | annotated paper,interesting region,core information,xml-aware sentence splitter,semantic annotation,scientific paper,scientific papers,in-line markup,enrichment tool,scientific papers sentence,full paper,xml markup,text mining |
DocType | Citations | PageRank |
Conference | 6 | 0.63 |
References | Authors | |
8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maria Liakata | 1 | 375 | 30.40 |
Claire Q | 2 | 6 | 0.63 |
Larisa N. Soldatova | 3 | 180 | 20.75 |