Title | ||
---|---|---|
Configurable assembly of classification rules for enhancing entity resolution results |
Abstract | ||
---|---|---|
•We propose four heuristics for turning the results produced by logical rules into confidence scores (i.e., a continuous quantification associated with the confidence produced by the rule regarding the duplicity of an entity pair).•We propose a novel auto-tuning algorithm for classifying duplicate entities based on confidence scores.•We propose an efficient algorithm for tuning the parameters of the Rule Assembler (considering scenarios in which training data is available).•We propose a systematic approach to map user preferences regarding precision and recall into parameters of the Rule Assembler.•We present an experimental evaluation of the proposed approach using both real-world and synthetic datasets. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1016/j.ipm.2020.102224 | Information Processing & Management |
DocType | Volume | Issue |
Journal | 57 | 3 |
ISSN | Citations | PageRank |
0306-4573 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dimas Cassimiro Nascimento | 1 | 0 | 0.34 |
Carlos Eduardo Santos Pires | 2 | 57 | 10.68 |
Thiago Pereira Nóbrega | 3 | 0 | 0.34 |