Name
Affiliation
Papers
JUAN J. NAVARRO
Department of Computer Architecture, Universitat Politècnica de Catalunya, Gran Capità s/n, Modul D6, E-08034 Barcelona, Spain
41
Collaborators
Citations 
PageRank 
40
323
42.90
Referers 
Referees 
References 
605
516
385
Search Limit
100605
Title
Citations
PageRank
Year
Reusing cached schedules in an out-of-order processor with in-order issue logic30.382009
Hypermatrix oriented supernode amalgamation10.392008
Analysis of a sparse hypermatrix Cholesky with fixed-sized blocking10.362007
Exploiting computer resources for fast nearest neighbor classification80.502007
Using non-canonical array layouts in dense matrix operations10.422006
Compiler-optimized kernels: an efficient alternative to hand-coded inner kernels40.432006
A study on load imbalance in parallel hypermatrix multiplication using OpenMP00.342005
Adapting linear algebra codes to the memory hierarchy using a hypermatrix scheme10.362005
Efficient Implementation of Nearest Neighbor Classification00.342005
Optimization of a statically partitioned hypermatrix sparse cholesky factorization10.362004
Automatic Benchmarking and Optimization of Codes: An Experience with Numerical Kernels70.602003
Building Software Via Shared Knowledge00.342003
CC-Radix: a Cache Conscious Sorting Based on Radix sort111.062003
Improving Performance of Hypermatrix Cholesky Factorization90.582003
Case study: Memory conscious parallel sorting10.372002
The effect of local sort on parallel sorting algorithms10.372002
Fast parallel in-memory 64-bit sorting70.652001
Sorting on the SGI Origin 2000: comparing MPI and shared memory implementations00.341999
Communication conscious radix sort80.801999
Dynamic history-length fitting: a third level of adaptivity for branch prediction504.231998
Data caches for superscalar processors271.471997
Block algorithms for sparse matrix computations on high performance workstations121.721996
The difference-bit cache171.281996
Data prefetching and multilevel blocking for linear algebra operations80.751996
Vector and Parallel Interpolation by Natural Cubic Splines and B-Splines10.411996
Review of general and Toeplitz vector bidiagonal solvers80.671996
Performance on Distributed Memory Multicomputers of Domain Decomposition Solvers00.341995
A Generalized Criterion for the Early Termination of R-Cyclic Reduction and Divide and Conquer for Recurrences00.341995
A generalized vision of some parallel bidiagonal systems solvers20.411994
MOB forms: a class of multilevel block algorithms for dense linear algebra operations244.071994
A Parallel Tridiagonal Solver For Vector Uniprocessors20.411993
Spike algorithm with savings for strictly diagonal dominant tridiagonal systems20.451993
A method for implementation of one-dimensional systolic algorithms with data contraflow using pipelined functional units50.771992
Increasing the number of strides for conflict-free vector access405.991992
Performance evaluation of transputer systems with linear algebra problems00.341991
Mapping QR decomposition of banded matrix on a 1D systolic array with data contraflow and pipelined functional units10.401991
Conflict-Free Strides for Vectors in Matched Memories20.381991
Transformation of systolic algorithms for interleaving partitions00.341991
Systematic hardware adaptation of systolic algorithms40.721989
Partitioning: An Essential Step in Mapping Algorithms Into Systolic Array Processors517.141987
Solving Matrix Problems with No Size Restriction on a Systolic Array Processor31.281986