Title | ||
---|---|---|
Inductive improvement of part-of-speech tagging and its effect on a terminology of molecular biology |
Abstract | ||
---|---|---|
In the context of Part-of-Speech (PoS)-tagging of specialized corpora, we proposed an inductive approach focusing on the most ‘important' PoStags because mistaking them can lead to a total misunderstanding of the text After a standard tagging of a biological corpus by Brill's tagger, we noted persistent errors that are very hard to deal with As an application, we studied two cases of different nature: first, confusion between past participle, adjective and preterit for verbs that end with ‘ed'; second, confusion between plural nouns and verbs, 3rd person singular present With a friendly user interface, the expert corrected the examples Then, from these well-annotated examples, we induced rules using a propositional rule induction algorithm Experimental validation showed improvement in tagging precision The relevance of the terminology of the considered field, here molecular biology, is greatly improved when the number of these tagging errors decreases. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/11424918_38 | Canadian Conference on AI |
Keywords | Field | DocType |
tagging precision,considered field,friendly user interface,tagging errors decrease,standard tagging,part-of-speech tagging,inductive improvement,biological corpus,inductive approach,different nature,past participle,molecular biology,user interface,part of speech,noun | Inductive logic programming,Verb,Participle,Plural,Terminology,Computer science,Noun,Natural language processing,Artificial intelligence,Rule induction,Molecular biology,Adjective | Conference |
Volume | ISSN | ISBN |
3501 | 0302-9743 | 3-540-25864-7 |
Citations | PageRank | References |
1 | 0.36 | 20 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ahmed Amrani | 1 | 10 | 2.28 |
Mathieu Roche | 2 | 3 | 1.46 |
Yves Kodratoff | 3 | 581 | 172.25 |
Oriane Matte-tailliez | 4 | 18 | 3.23 |