Designing data center networks using bottleneck structures | 0 | 0.34 | 2021 |
Computing Bottleneck Structures at Scale for High-Precision Network Performance Analysis | 0 | 0.34 | 2020 |
Approximate Inverse Chain Preconditioner: Iteration Count Case Study for Spectral Support Solvers | 0 | 0.34 | 2020 |
Multiscale Data Analysis Using Binning, Tensor Decompositions, And Backtracking | 0 | 0.34 | 2020 |
G2: A Network Optimization Framework for High-Precision Analysis of Bottleneck and Flow Performance | 1 | 0.37 | 2019 |
Fast Large-Scale Algorithm for Electromagnetic Wave Propagation in 3D Media | 0 | 0.34 | 2019 |
Enhancing Network Visibility and Security through Tensor Analysis. | 2 | 0.45 | 2019 |
Combinatorial Multigrid: Advanced Preconditioners For Ill-Conditioned Linear Systems | 1 | 0.36 | 2019 |
Combining Tensor Decompositions and Graph Analytics to Provide Cyber Situational Awareness at HPC Scale | 2 | 0.36 | 2019 |
Computationally Efficient CP Tensor Decomposition Update Framework for Emerging Component Discovery in Streaming Data | 0 | 0.34 | 2018 |
Accelerating Dijkstra's Algorithm Using Multiresolution Priority Queues | 0 | 0.34 | 2018 |
Algorithms and Data Structures to Accelerate Network Analysis. | 1 | 0.37 | 2018 |
All-at-once Decomposition of Coupled Billion-scale Tensors in Apache Spark | 2 | 0.39 | 2018 |
Polyhedral Optimization Of Tensorflow Computation Graphs | 1 | 0.36 | 2017 |
Memory-efficient parallel tensor decompositions | 8 | 0.60 | 2017 |
Multiresolution Priority Queues. | 0 | 0.34 | 2017 |
A unified Coq framework for verifying C programs with floating-point computations. | 4 | 0.39 | 2016 |
A sparse multidimensional FFT for real positive vectors. | 0 | 0.34 | 2016 |
Efficient Compilation to Event-Driven Task Programs. | 0 | 0.34 | 2016 |
An Interactive Visual Tool for Code Optimization and Parallelization Based on the Polyhedral Model | 1 | 0.35 | 2016 |
Highly Scalable Near Memory Processing with Migrating Threads on the Emu System Architecture. | 1 | 0.37 | 2016 |
Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime. | 0 | 0.34 | 2016 |
Scalable Hierarchical Polyhedral Compilation | 4 | 0.42 | 2016 |
Optimization of symmetric tensor computations | 3 | 0.45 | 2015 |
Automatic cluster parallelization and minimizing communication via selective data replication | 1 | 0.35 | 2015 |
Embedded second-order cone programming with radar applications | 0 | 0.34 | 2015 |
Parallelizing and optimizing sparse tensor computations | 5 | 0.51 | 2014 |
A Tale of Three Runtimes. | 3 | 0.41 | 2014 |
Runnemede: An architecture for Ubiquitous High-Performance Computing | 38 | 1.17 | 2013 |
Memory reuse optimizations in the R-Stream compiler. | 3 | 0.40 | 2013 |
Scalable Cyber-Security for Terabit Cloud Computing | 0 | 0.34 | 2012 |
Automatic communication optimizations through memory reuse strategies | 0 | 0.34 | 2012 |
A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction | 33 | 2.35 | 2010 |
Evaluation of Stream Virtual Machine on Raw Processor | 0 | 0.34 | 2007 |
Retrospective: the J-machine | 4 | 1.66 | 1998 |