Title
Bridging Quantities in Tables and Text
Abstract
There is a wealth of schema-free tables on the Web, holding valuable information about quantities on sales and costs, environmental footprint of cars, health data and more. Table content can only be properly interpreted in conjunction with the textual context that surrounds the tables. This paper introduces the quantity alignment problem: bidirectional linking between textual mentions of quantities and the corresponding table cells, in order to support advanced content summarization and faster navigation between explanations in text and details in tables. We present the BriQ system for computing such alignments. BriQ is designed to cope with the specific challenges of approximate quantities, aggregated quantities, and calculated quantities in text that are common but cannot be directly matched in table cells. We judiciously combine feature-based classification with joint inference by random walks over candidate alignment graphs. Experiments with a large collection of tables from the Common Crawl project demonstrate the viability of our methods.
Year
DOI
Venue
2019
10.1109/ICDE.2019.00094
international conference on data engineering
Keywords
Field
DocType
Aggregates,Web pages,Knowledge based systems,Joining processes,Informatics,Automobiles,Ice
Automatic summarization,Graph,Data mining,Web page,Random walk,Computer science,Inference,Bridging (networking),Knowledge-based systems,Ecological footprint
Conference
ISSN
ISBN
Citations 
1084-4627
978-1-5386-7474-1
4
PageRank 
References 
Authors
0.40
0
4
Name
Order
Citations
PageRank
Yusra Ibrahim1223.10
Mirek Riedewald2113684.31
Gerhard Weikum3127102146.01
Demetrios Zeinalipour-Yazti469357.60