Abstract | ||
---|---|---|
This paper presents a general method for alias assignment task in information extraction. We compared two approaches to face the problem and learn a classifier. The first one quantifies a global similarity between the alias and all the possible entities weighting some features about each pair alias-entity. The second is a classical classifier where each instance is a pair alias-entity and its attributes are their features. Both approaches use the same feature functions about the pair alias-entity where every level of abstraction, from raw characters up to semantic level, is treated in an homogeneous way. In addition, we propose an extended feature functions that break down the information and let the machine learning algorithm to determine the final contribution of each value. The use of extended features improve the results of the simple ones. |
Year | Venue | Keywords |
---|---|---|
2007 | PROCESAMIENTO DEL LENGUAJE NATURAL | Alias Assignment, Information Extraction, Entity Matching |
DocType | Volume | Issue |
Journal | 39 | 39 |
ISSN | Citations | PageRank |
1135-5948 | 1 | 0.35 |
References | Authors | |
13 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Emili Sapena | 1 | 91 | 6.50 |
Llu ´ õs Padro | 2 | 1 | 0.35 |
Jordi Turmo | 3 | 306 | 30.52 |