Title
Inductive improvement of part-of-speech tagging and its effect on a terminology of molecular biology
Abstract
In the context of Part-of-Speech (PoS)-tagging of specialized corpora, we proposed an inductive approach focusing on the most ‘important' PoStags because mistaking them can lead to a total misunderstanding of the text After a standard tagging of a biological corpus by Brill's tagger, we noted persistent errors that are very hard to deal with As an application, we studied two cases of different nature: first, confusion between past participle, adjective and preterit for verbs that end with ‘ed'; second, confusion between plural nouns and verbs, 3rd person singular present With a friendly user interface, the expert corrected the examples Then, from these well-annotated examples, we induced rules using a propositional rule induction algorithm Experimental validation showed improvement in tagging precision The relevance of the terminology of the considered field, here molecular biology, is greatly improved when the number of these tagging errors decreases.
Year
DOI
Venue
2005
10.1007/11424918_38
Canadian Conference on AI
Keywords
Field
DocType
tagging precision,considered field,friendly user interface,tagging errors decrease,standard tagging,part-of-speech tagging,inductive improvement,biological corpus,inductive approach,different nature,past participle,molecular biology,user interface,part of speech,noun
Inductive logic programming,Verb,Participle,Plural,Terminology,Computer science,Noun,Natural language processing,Artificial intelligence,Rule induction,Molecular biology,Adjective
Conference
Volume
ISSN
ISBN
3501
0302-9743
3-540-25864-7
Citations 
PageRank 
References 
1
0.36
20
Authors
4
Name
Order
Citations
PageRank
Ahmed Amrani1102.28
Mathieu Roche231.46
Yves Kodratoff3581172.25
Oriane Matte-tailliez4183.23