Title | ||
---|---|---|
Automatic selection of class labels from a thesaurus for an effective semantic tagging of corpora |
Abstract | ||
---|---|---|
It is widely accepted that tagging text with semantic information would improve the quality of lexical learning in corpus-based NLP methods. However available on-line taxonomies are rather entangled and introduce an unnecessary level of ambiguity. The noise produced by the redundant number of tags often overrides the advantage of semantic tagging. In this paper we propose an automatic method to select from WordNet a subset of domain-appropriate categories that effectively reduce the overambiguity of WordNet, and help at identifying and categorise relevant language patterns in a more compact way. The method is evaluated against a manually tagged corpus, SEMCOR. |
Year | DOI | Venue |
---|---|---|
1997 | 10.3115/974557.974612 | ANLP |
Keywords | Field | DocType |
tagging text,effective semantic,categorise relevant language pattern,domain-appropriate category,automatic selection,lexical learning,semantic tagging,class label,available on-line taxonomy,automatic method,corpus-based nlp method,semantic information,redundant number | Information retrieval,Computer science,Semantic information,Pattern language,Artificial intelligence,Natural language processing,WordNet,Ambiguity | Conference |
Citations | PageRank | References |
7 | 0.69 | 13 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alessandro Cucchiarelli | 1 | 226 | 36.38 |
paola velardi | 2 | 1553 | 163.66 |