Title
Mining Approximate Primary Functional Dependency On Web Tables
Abstract
We propose to discover approximate primary functional dependency (aPFD) for web tables, which focus on the determination relationship between primary attributes and non-primary attributes and are more helpful for entity column detection and topic discovery on web tables. Based on association rules and information theory, we propose metrics Conf and InfoGain to evaluate PFDs. By quantifying PFDs' strength and designing pruning strategies to eliminate false positives, our method could select minimal non-trivial approximate PFD effectively and are scalable to large tables. The comprehensive experimental results on real web datasets show that our method significantly outperforms previous work in both effectiveness and efficiency.
Year
DOI
Venue
2019
10.1587/transinf.2018EDL8130
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Keywords
Field
DocType
functional dependencies, web table, metrics, pruning strategies
Information retrieval,Pattern recognition,Computer science,Functional dependency,Artificial intelligence,Web tables
Journal
Volume
Issue
ISSN
E102D
3
1745-1361
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
Siyu Chen1295.96
Ning Wang238.48
Mengmeng Zhang311524.91