Name
Affiliation
Papers
FABRICE RASTELLO
Inria
60
Collaborators
Citations 
PageRank 
99
482
38.30
Referers 
Referees 
References 
794
1349
875
Search Limit
1001000
Title
Citations
PageRank
Year
IOOpt: automatic derivation of I/O complexity bounds for affine programs00.342021
Efficient tiled sparse matrix multiplication through matrix signatures00.342020
Automated Derivation of Parametric Data Movement Lower Bounds for Affine Programs10.352020
Building a Polyhedral Representation from an Instrumented Execution: Making Dynamic Analyses of Nonaffine Programs Scalable00.342020
Analytical cache modeling and tilesize optimization for tensor contractions00.342019
Data-flow/dependence profiling for structured transformations.10.382019
Register optimizations for stencils on GPUs.100.492018
Performance modeling for GPUs using abstract kernel emulation.00.342018
GPU code optimization using abstract kernel emulation and sensitivity analysis.10.352018
Associative instruction reordering to alleviate register pressure.10.352018
Simplification and runtime resolution of data dependence constraints for loop transformations.10.352017
POSTER: Statement Reordering to Alleviate Register Pressure for Stencils on GPUs10.352017
Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis.00.342017
Static and Dynamic Frequency Scaling on Multicore CPUs.40.452016
Brief Announcement: Approximating the I/O Complexity of One-Shot Red-Blue Pebbling.20.372016
A bounded memory allocator for software-defined global address spaces.00.342016
Description, Implementation and Evaluation of an Affinity Clause for Task Directives.20.382016
PolyCheck: dynamic verification of iteration space transformations on affine programs.20.362016
Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures.60.472016
Effective padding of multidimensional arrays to avoid cache conflict misses.60.432016
Generalized cache tiling for dataflow programs00.342016
A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment.40.392016
An interval constrained memory allocator for the Givy GAS runtime.00.342016
Register allocation and promotion through combined instruction scheduling and loop unrolling30.372016
On fusing recursive traversals of K-d trees.60.432016
POSTER: Hybrid Data Dependence Analysis for Loop Transformations.00.342016
Runtime pointer disambiguation70.472015
On characterizing the data movement complexity of computational DAGs for parallel execution10.352014
On Using the Roofline Model with Lower Bounds on Data Movement10.352014
Parameterized Construction of Program Representations for Sparse Dataflow Analyses00.342014
Beyond reuse distance analysis: Dynamic analysis for characterization of data locality potential.40.422014
A framework for enhancing data reuse via associative reordering210.692014
A polynomial spilling heuristic: Layered allocation00.342013
SSI Properties Revisited20.352012
Decoupled graph-coloring register allocation with hierarchical aliasing10.362011
Graph-coloring and treescan register allocation using repairing40.402011
A non-iterative data-flow algorithm for computing liveness sets in strict SSA programs20.372011
Split register allocation: linear complexity without the performance penalty20.382010
Parallel copy motion40.442010
Revisiting Out-of-SSA Translation for Correctness, Code Quality and Efficiency210.782009
Advanced conservative and optimistic register coalescing80.492008
On the Complexity of Register Coalescing260.922007
On the complexity of spill everywhere under SSA form150.652007
Register allocation: what does the NP-completeness proof of Chaitin et al. really prove? or revisiting register allocation: why and how200.712006
Procedure placement using temporal-ordering information: dealing with code size expansion171.172005
Optimal task scheduling at run time to exploit intra-tile parallelism20.382003
Efficient Tiling for an ODE Discrete Integration Program: Redundant Tasks Instead of Trapezoidal Shaped-Tiles20.382002
Partitioning a Square into Rectangles: NP-Completeness and Approximation Algorithms201.152002
Dense linear algebra kernels on heterogeneous platforms: redistribution issues90.732002
Automatic Partitioning of Parallel Loops with Parallelepiped-Shaped Tiles80.612002
  • 1
  • 2