Title
A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture
Abstract
Information extraction from large data repositories is critical to Information Management solutions. In addition to prerequisite corpus analysis, to determine domain-specific characteristics of text resources, developing, refining and evaluating analytics entails a complex and lengthy process, typically requiring more than just domain expertise. Modern architectures for text processing, while facilitating reuse and (re-) composition of analytical pipelines, place additional constraints upon the analytics development, as domain experts need not only configure individual annotator components, but situate these within a fully functional annotator pipeline. We present the design, and current status, of a tool for configuring model-driven annotators, which abstracts away from annotator implementation details, pipeline composition constraints, and data management. Instead, the tool embodies support for all stages of ontology-centric model development cycle - from corpus analysis and concept definition, to model development and testing, to large scale evaluation, to easy and rapid composition of text applications deploying these concept models. With our design, we aim to meet the needs of domain experts, who are not necessarily expert NLP practitioners.
Year
Venue
Keywords
2008
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008
information extraction,data management,development environment,information management
Field
DocType
Citations 
Information management,Architecture,Computer science,Reuse,Subject-matter expert,Information extraction,Artificial intelligence,Natural language processing,Analytics,Data management,Text processing
Conference
0
PageRank 
References 
Authors
0.34
6
5
Name
Order
Citations
PageRank
Youssef Drissi126111.74
Branimir Boguraev2549108.99
David Ferrucci3101.18
Paul Keyser410.73
Anthony Levas515914.49