Abstract | ||
---|---|---|
The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resource for the biology domain, providing information about predicate-argument structure that has been bootstrapped from a biomedical corpus on the subject of E. Coli. The lexicon is currently focussed on verbs, and includes both automatically-extracted syntactic subcategorization frames, as well as semantic event frames that are based on annotation by domain experts. In addition, the lexicon contains manually-added explicit links between semantic and syntactic slots in corresponding frames. To our knowledge, this lexicon currently represents a unique resource within in the biomedical domain. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/978-3-642-00382-0_11 | CICLing |
Keywords | Field | DocType |
biomedical domain,verb lexicon,general domain,biomedical information extraction,semantic property,semantic event frame,specialized domain,automatically-extracted syntactic subcategorization frame,biology domain,syntactic slot,domain expert,biomedical corpus,information extraction | Verb,Annotation,Subcategorization,Computer science,Lexical item,Semantic property,Information extraction,Lexicon,Artificial intelligence,Natural language processing,Syntax | Conference |
Volume | ISSN | Citations |
5449 | 0302-9743 | 9 |
PageRank | References | Authors |
0.46 | 8 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Giulia Venturi | 1 | 132 | 21.20 |
Simonetta Montemagni | 2 | 255 | 33.40 |
Simone Marchi | 3 | 71 | 5.52 |
Yutaka Sasaki | 4 | 97 | 6.15 |
Paul Thompson | 5 | 199 | 8.80 |
John McNaught | 6 | 422 | 22.96 |
Sophia Ananiadou | 7 | 2658 | 183.08 |