Title | ||
---|---|---|
Measurement Context Extraction from Text: Discovering Opportunities and Gaps in Earth Science. |
Abstract | ||
---|---|---|
We propose Marve, a system for extracting measurement values, units, and related words from natural language text. Marve uses conditional random fields (CRF) to identify measurement values and units, followed by a rule-based system to find related entities, descriptors and modifiers within a sentence. Sentence tokens are represented by an undirected graphical model, and rules are based on part-of-speech and word dependency patterns connecting values and units to contextual words. Marve is unique in its focus on measurement context and early experimentation demonstrates Marveu0027s ability to generate high-precision extractions with strong recall. We also discuss Marveu0027s role in refining measurement requirements for NASAu0027s proposed HyspIRI mission, a hyperspectral infrared imaging satellite that will study the worldu0027s ecosystems. In general, our work with HyspIRI demonstrates the value of semantic measurement extractions in characterizing quantitative discussion contained in large corpuses of natural language text. These extractions accelerate broad, cross-cutting research and expose scientists new algorithmic approaches and experimental nuances. They also facilitate identification of scientific opportunities enabled by HyspIRI leading to more efficient scientific investment and research. |
Year | Venue | Field |
---|---|---|
2017 | arXiv: Information Retrieval | Conditional random field,Data mining,Information retrieval,Computer science,Hyperspectral imaging,Natural language,Graphical model,Sentence |
DocType | Volume | ISSN |
Journal | abs/1710.04312 | 23rd ACM SIGKDD International Conference on Knowledge Discovery
and Data Mining, Data-Driven Discovery Workshop, Halifax, Canada, August 2017 |
Citations | PageRank | References |
0 | 0.34 | 11 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kyle Hundman | 1 | 1 | 0.73 |
Chris A. Mattmann | 2 | 200 | 25.39 |