Name
Affiliation
Papers
KUNLE OLUKOTUN
Computer Systems Laboratory|Stanford University
139
Collaborators
Citations 
PageRank 
217
4532
373.50
Referers 
Referees 
References 
8783
2988
1670
Search Limit
1001000
Title
Citations
PageRank
Year
High performance lattice regression on FPGAs via a high level hardware description language00.342021
Aurochs: An Architecture for Dataflow Threads10.342021
Chopping off the Tail: Bounded Non-Determinism for Real-Time Accelerators00.342021
SARA: Scaling a Reconfigurable Dataflow Accelerator40.422021
Capstan: A Vector RDA for Sparsity30.362021
Gorgon: Accelerating Machine Learning from Relational Data50.402020
DeepFreak: Learning Crystallography Diffraction Patterns with Automated Machine Learning.00.342019
Scalable interconnects for reconfigurable spatial architectures00.342019
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark.60.552018
LevelHeaded: A Unified Engine for Business Intelligence and Linear Algebra Querying10.352018
High-Accuracy Low-Precision Training.00.342018
Exploring the Utility of Developer Exhaust00.342018
Practical Design Space Exploration10.362018
Mind the gap: bridging multi-domain query workloads with EmptyHeaded10.352017
Flare: Native Compilation for Heterogeneous Workloads in Apache Spark.30.372017
Infrastructure for Usable Machine Learning: The Stanford DAWN Project.30.372017
LevelHeaded: Making Worst-Case Optimal Joins Work in the Common Case.00.342017
Understanding and Optimizing Asynchronous Low-Precision Stochastic Gradient Descent.140.642017
Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling.60.552016
EmptyHeaded: A Relational Engine for Graph Processing290.792016
Automatic Generation of Efficient Accelerators for Reconfigurable Hardware.190.702016
Old techniques for new join algorithms: A case study in RDF processing70.472016
GraphOps: A Dataflow Library for Graph Analytics Acceleration.190.662016
Scaling Data Analytics with Moore's Law.00.342016
Automatic support for multi-module parallelism from computational patterns40.452015
Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width60.502015
Energy-Efficient Abundant-Data Computing: The N3XT 1,000x241.822015
Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms00.342015
Simplifying Scalable Graph Processing with a Domain-Specific Language160.652014
Delite: A Compiler Architecture for Performance-Oriented Embedded Domain-Specific Languages481.402014
Beyond parallel programming with domain specific languages10.352014
Hardware acceleration of database operations561.842014
Global Convergence of Stochastic Gradient Descent for Some Nonconvex Matrix Problems.20.502014
Hardware system synthesis from Domain-Specific Languages200.882014
Locality-Aware Mapping of Nested Parallel Patterns on GPUs200.882014
Surgical precision JIT compilers160.772014
Composition and reuse with compiled domain-specific languages260.942013
On fast parallel detection of strongly connected components (SCC) in small-world graphs331.002013
Optimizing data structures in high-level programs: new directions for extensible compilers based on staging501.422013
Green-Marl: a DSL for easy and efficient graph analysis1194.032012
A case of system-level hardware/software co-design and co-verification of a commodity multi-processor system with custom hardware10.352012
High performance embedded domain specific languages10.362012
Implementing Domain-Specific Languages for Heterogeneous Parallel Computing371.422011
OptiML: An Implicitly Parallel Domain-Specific Language for Machine Learning.652.532011
Runtime automatic speculative parallelization130.602011
Efficient Parallel Graph Exploration on Multi-Core CPU and GPU1304.602011
A domain-specific approach to heterogeneous parallelism733.472011
Accelerating CUDA graph algorithms at maximum warp1715.872011
Hardware/software co-design for high performance computing: challenges and opportunities40.682010
Eigenbench: A simple exploration tool for orthogonal TM characteristics310.992010
  • 1
  • 2