Title
Configurable assembly of classification rules for enhancing entity resolution results
Abstract
•We propose four heuristics for turning the results produced by logical rules into confidence scores (i.e., a continuous quantification associated with the confidence produced by the rule regarding the duplicity of an entity pair).•We propose a novel auto-tuning algorithm for classifying duplicate entities based on confidence scores.•We propose an efficient algorithm for tuning the parameters of the Rule Assembler (considering scenarios in which training data is available).•We propose a systematic approach to map user preferences regarding precision and recall into parameters of the Rule Assembler.•We present an experimental evaluation of the proposed approach using both real-world and synthetic datasets.
Year
DOI
Venue
2020
10.1016/j.ipm.2020.102224
Information Processing & Management
DocType
Volume
Issue
Journal
57
3
ISSN
Citations 
PageRank 
0306-4573
0
0.34
References 
Authors
0
3