Title
Senbazuru: a prototype spreadsheet database management system
Abstract
Spreadsheets have become a critical data management tool, but they lack explicit relational metadata, making it difficult to join or integrate data across multiple spreadsheets. Because spreadsheet data are widely available on a huge range of topics, a tool that allows easy spreadsheet integration would be hugely beneficial for a variety of users. We demonstrate that Senbazuru, a prototype spreadsheet database management system (SSDBMS), is able to extract relational information from spreadsheets. By doing so, it opens up opportunities for integration among spreadsheets and with other relational sources. Senbazuru allows users to search for relevant spreadsheets in a large corpus, probabilistically constructs a relational version of the data, and offers several relational operations over the resulting extracted data (including joins to other spreadsheet data). Our demonstration is available on two clients: a JavaScript-rich Web site and a touch interface on the iPad. During the demo, Senbazuru will allow VLDB participants to search spreadsheets, extract relational data from them, and apply relational operators such as select and join.
Year
DOI
Venue
2013
10.14778/2536274.2536276
very large data bases
Keywords
Field
DocType
relational version,relational data,multiple spreadsheets,relational information,spreadsheet data,relational operation,relational source,relational operator,explicit relational metadata,prototype spreadsheet database management,critical data management tool
Metadata,Data mining,Joins,World Wide Web,Data administration,Relational database,Computer science,Very large database,Relational operator,Data management,Web site,Database
Journal
Volume
Issue
ISSN
6
12
2150-8097
Citations 
PageRank 
References 
16
0.63
10
Authors
5
Name
Order
Citations
PageRank
Zhe Chen1833.28
Michael J. Cafarella22246144.15
Jun Chen3160.63
Daniel Prevo4160.63
Junfeng Zhuang5160.63