TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection. | 0 | 0.34 | 2022 |
Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow. | 0 | 0.34 | 2022 |
TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection. | 0 | 0.34 | 2022 |
Understanding and optimizing packed neural network training for hyper-parameter tuning | 0 | 0.34 | 2021 |
SAND: streaming subsequence anomaly detection | 2 | 0.36 | 2021 |
DLHub: Simplifying publication, discovery, and use of machine learning models in science | 0 | 0.34 | 2021 |
Fast and Reliable Missing Data Contingency Analysis with Predicate-Constraints | 0 | 0.34 | 2020 |
CrocodileDB - Efficient Database Execution through Intelligent Deferment. | 0 | 0.34 | 2020 |
Band-limited Training and Inference for Convolutional Neural Network | 3 | 0.39 | 2019 |
GRAIL: Efficient Time-Series Representation Learning. | 2 | 0.36 | 2019 |
Artificial Intelligence in Resource-Constrained and Shared Environments | 0 | 0.34 | 2019 |
Intermittent Query Processing. | 0 | 0.34 | 2019 |
Prototyping a Web-Scale Multimedia Retrieval Service Using Spark. | 2 | 0.42 | 2018 |
Drizzle: Fast and Adaptable Stream Processing at Scale. | 24 | 0.78 | 2017 |
Diagnosing Machine Learning Pipelines with Fine-grained Lineage. | 4 | 0.39 | 2017 |
Cioppino: Multi-Tenant Crowd Management. | 0 | 0.34 | 2017 |
BoostClean: Automated Error Detection and Repair for Machine Learning. | 5 | 0.40 | 2017 |
Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies. | 0 | 0.34 | 2016 |
Apache Spark: a unified engine for big data processing. | 260 | 9.42 | 2016 |
Towards reliable interactive data cleaning: a user survey and recommendations. | 12 | 0.63 | 2016 |
ActiveClean: Interactive Data Cleaning For Statistical Modeling. | 0 | 0.34 | 2016 |
ActiveClean: Interactive Data Cleaning While Learning Convex Loss Models. | 6 | 0.47 | 2016 |
Spark SQL: Relational Data Processing in Spark | 307 | 9.13 | 2015 |
SampleClean: Fast and Reliable Analytics on Dirty Data. | 10 | 0.58 | 2015 |
Crowdsourcing Enumeration Queries: Estimators and Interfaces | 1 | 0.34 | 2015 |
Automating model search for large scale machine learning | 29 | 1.76 | 2015 |
Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity | 23 | 0.74 | 2015 |
Quantifying eventual consistency with PBS | 9 | 0.49 | 2014 |
A Partitioning Framework for Aggressive Data Skipping. | 0 | 0.34 | 2014 |
GraphX: Unifying Data-Parallel and Graph-Parallel Analytics. | 20 | 0.80 | 2014 |
GraphX: graph processing in a distributed dataflow framework | 291 | 7.56 | 2014 |
A methodology for learning, analyzing, and mitigating social influence bias in recommender systems | 17 | 0.78 | 2014 |
The Expected Optimal Labeling Order Problem for Crowdsourced Joins and Entity Resolution. | 2 | 0.45 | 2014 |
Coordination Avoidance in Database Systems. | 26 | 0.92 | 2014 |
Data Science Challenges in Real Estate Asset and Capital Markets | 3 | 0.50 | 2014 |
Coordination-Avoiding Database Systems. | 9 | 0.52 | 2014 |
Making sense of big data with the Berkeley data analytics stack | 2 | 0.40 | 2013 |
Crowdsourced enumeration queries | 8 | 0.49 | 2013 |
PBS at work: advancing data management with consistency metrics | 3 | 0.39 | 2013 |
CrowdQ: Crowdsourced Query Understanding. | 18 | 0.78 | 2013 |
RTP: robust tenant placement for elastic in-memory database clusters | 22 | 0.90 | 2013 |
GraphX: a resilient distributed graph system on Spark | 237 | 6.01 | 2013 |
Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing | 1255 | 44.75 | 2012 |
Active Learning for Crowd-Sourced Databases | 7 | 0.63 | 2012 |
Special section on large-scale analytics | 0 | 0.34 | 2012 |
Shark: fast data analysis using coarse-grained distributed memory | 59 | 2.64 | 2012 |
Scaling the mobile millennium system in the cloud | 12 | 2.12 | 2011 |
Hybrid in-database inference for declarative information extraction | 21 | 1.08 | 2011 |
The SCADS director: scaling a distributed storage system under stringent performance requirements | 71 | 2.04 | 2011 |
Crowdsourcing applications and platforms: a data management perspective | 15 | 0.62 | 2011 |