Preface: Special issued on the Sixth International Symposium on Networking and Computing. | 0 | 0.34 | 2019 |
FIFO-Based Hardware Sorters for High Bandwidth Memory | 1 | 0.37 | 2019 |
Folded Bloom Filter for High Bandwidth Memory, with GPU Implementations | 0 | 0.34 | 2019 |
Introduction to APDCM 2019 | 0 | 0.34 | 2019 |
Bulk execution of Euclidean algorithms on the CUDA-enabled GPU. | 0 | 0.34 | 2016 |
Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars | 2 | 0.39 | 2016 |
Deterministic Construction of Regular Geometric Graphs with Short Average Distance and Limited Edge Length. | 0 | 0.34 | 2016 |
An Efficient Implementation of LZW Compression in the FPGA. | 0 | 0.34 | 2016 |
Efficient Implementation of FDFM Approach for Euclidean Algorithms on the FPGA. | 0 | 0.34 | 2016 |
Randomly Optimized Grid Graph for Low-Latency Interconnection Networks | 5 | 0.47 | 2016 |
An Efficient Implementation of LZW Decompression in the FPGA | 3 | 0.45 | 2016 |
Preface: Special issue on the Third International Symposium on Computing and Networking. | 0 | 0.34 | 2016 |
Optimal Parallel Hardware K-Sorter and Top K-Sorter, with FPGA Implementations | 2 | 0.41 | 2015 |
Asterisk PBX Capacity Evaluation | 1 | 0.37 | 2015 |
A Warp-Synchronous Implementation for Multiple-Length Multiplication on the GPU. | 0 | 0.34 | 2015 |
Optimality of Fundamental Parallel Algorithms on the Hierarchical Memory Machine, with GPU Implementation | 0 | 0.34 | 2015 |
Efficient GPU Implementations for the Conway's Game of Life. | 1 | 0.36 | 2015 |
A Flexible-Length-Arithmetic Processor Based on FDFM Approach in FPGAs. | 0 | 0.34 | 2015 |
A Fast Approximate String Matching Algorithm on GPU. | 2 | 0.40 | 2015 |
GPU-Accelerated Digital Halftoning by the Local Exhaustive Search | 0 | 0.34 | 2015 |
Using Pulse/Tone Signals As An Alternative To Boost Channel Reservation On Directional Communications | 1 | 0.38 | 2015 |
A GPU Implementation of Clipping-Free Halftoning Using the Direct Binary Search. | 4 | 0.46 | 2014 |
APDCM Introduction and Committees | 0 | 0.34 | 2014 |
Offline Permutation On The Cuda-Enabled Gpu | 0 | 0.34 | 2014 |
An Efficient Implementation of the Gradient-Based Hough Transform Using DSP Slices and Block RAMs on the FPGA | 3 | 0.51 | 2014 |
Thorough Evaluation of GPU Shared Memory Load and Store Instructions | 2 | 0.40 | 2014 |
Random Address Permute-Shift Technique for the Shared Memory on GPUs | 0 | 0.34 | 2014 |
Asynchronous Memory Machine Models with Barrier Synchronization | 1 | 0.36 | 2014 |
GPU-Accelerated Verification of the Collatz Conjecture. | 0 | 0.34 | 2014 |
Template Matching Using DSP Slices on the FPGA | 5 | 0.51 | 2013 |
The Hierarchical Memory Machine Model for GPUs | 3 | 0.44 | 2013 |
Offline Permutation Algorithms On The Discrete Memory Machine With Performance Evaluation On The Gpu | 9 | 0.60 | 2013 |
Accelerating computation of Euclidean distance map using the GPU with efficient memory access | 6 | 0.51 | 2013 |
ASCII Art Generation Using the Local Exhaustive Search on the GPU | 10 | 0.63 | 2013 |
Sequential Memory Access on the Unified Memory Machine with Application to the Dynamic Programming | 8 | 0.55 | 2013 |
The super warp architecture with random address shift | 3 | 0.50 | 2013 |
An optimal parallel prefix-sums algorithm on the memory machine models for GPUs | 15 | 1.01 | 2012 |
The Parallel FDFM Processor Core Approach for CRT-based RSA Decryption. | 9 | 0.59 | 2012 |
A Rewriting Approach to Replace Asynchronous ROMs with Synchronous Ones for the Circuits with Cycles. | 0 | 0.34 | 2012 |
An Implementation of Conflict-Free Offline Permutation on the GPU | 2 | 0.40 | 2012 |
Efficient Implementations of the Approximate String Matching on the Memory Machine Models | 9 | 0.68 | 2012 |
An Algorithm to Remove Asynchronous ROMs in Circuits with Cycles | 0 | 0.34 | 2011 |
Efficient Exhaustive Verification of the Collatz Conjecture using DSP blocks of Xilinx FPGAs. | 10 | 0.62 | 2011 |
The Parallel FDFM Processor Core Approach for Neural Networks | 2 | 0.41 | 2011 |
Accelerating the Dynamic Programming for the Matrix Chain Product on the GPU | 24 | 1.55 | 2011 |
Fast Ellipse Detection Algorithm Using Hough Transform on the GPU | 18 | 1.17 | 2011 |
A Graph Rewriting Approach For Converting Asynchronous Roms Into Synchronous Ones | 1 | 0.37 | 2011 |
Low-Latency Connected Component Labeling Using An Fpga | 10 | 0.53 | 2010 |
Deafness Resilient Mac Protocol For Directional Communications | 3 | 0.46 | 2010 |
A Perspective on the Experiential Learning of Computer Architecture. | 0 | 0.34 | 2010 |