Abstract | ||
---|---|---|
The analysis of discourse phenomena is essential in many natural language processing (NLP) applications. The growing diversity of available corpora and NLP tools brings a multitude of representation formats. In order to alleviate the problem of incompatible formats when constructing complex text mining pipelines, the Unstructured Information Management Architecture (UIMA) provides a standard means of communication between tools and resources. U-Compare, a text mining workflow construction platform based on UIMA, further enhances interoperability through a shared system of data types, allowing free combination of compliant components into workflows. Although U-Compare and its type system already support syntactic and semantic analyses, support for the analysis of discourse phenomena was previously lacking. In response, we have extended the U-Compare type system with new discourse-level types. We illustrate processing and visualisation of discourse information in U-Compare by providing several new deserialisation components for corpora containing discourse annotations. The new U-Compare is downloadable from http://nactem.ac.uk/ucompare. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1007/978-3-642-37247-6_45 | CICLing |
Keywords | Field | DocType |
discourse information,u-compare type system,new deserialisation component,new u-compare,nlp tool,new discourse-level type,discourse annotation,shared system,discourse phenomenon,interoperable nlp platform,type system,coreference,causality | Data science,Information management,Coreference,Computer science,Interoperability,Visualization,Data type,Metaknowledge,Artificial intelligence,Natural language processing,Workflow,Syntax | Conference |
Citations | PageRank | References |
5 | 0.42 | 28 |
Authors | ||
8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Riza Theresa Batista-Navarro | 1 | 98 | 10.87 |
Georgios Kontonatsios | 2 | 45 | 8.03 |
Claudiu Mihăilă | 3 | 32 | 4.35 |
Paul Thompson | 4 | 98 | 6.25 |
Rafal Rak | 5 | 382 | 18.30 |
Raheel Nawaz | 6 | 126 | 16.98 |
Ioannis Korkontzelos | 7 | 244 | 24.60 |
Sophia Ananiadou | 8 | 2658 | 183.08 |