Name
Affiliation
Papers
YAOHUA WANG
Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
39
Collaborators
Citations 
PageRank 
113
44
14.23
Referers 
Referees 
References 
162
1037
384
Search Limit
1001000
Title
Citations
PageRank
Year
Heterogeneous Systolic Array Architecture for Compact CNNs Hardware Accelerators00.342022
MT-3000: a heterogeneous multi-zone processor for HPC00.342022
HeSA - Heterogeneous Systolic Array Architecture for Compact CNNs Hardware Accelerators.00.342021
CODIC: A Low-Cost Substrate for Enabling Custom In-DRAM Functionalities and Optimizations20.352021
Configurable Multi-directional Systolic Array Architecture for Convolutional Neural Networks00.342021
Universal Pre-Calculating Structure: Reducing Complexity of Ising Chips with Arbitrary Connectivity00.342021
Advancing DSP into HPC, AI, and beyond: challenges, mechanisms, and future directions10.362021
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching20.352020
Deviation based clustering for unsupervised person re-identification.10.362020
Novel Design Strategy Toward A2 Trojan Detection Based on Built-In Acceleration Structure00.342020
CMSA: Configurable Multi-directional Systolic Array for Convolutional Neural Networks00.342020
A Lifelong Health Monitoring Framework in Processors: Work-in-Progress00.342020
Hierarchical Clustering With Hard-Batch Triplet Loss for Person Re-Identification70.482020
A Specification-Based Semi-Formal Functional Verification Method By A Stage Transition Graph Model00.342019
ITAP: Idle-Time-Aware Power Management for GPU Execution Units.10.352019
The Evaluation of DCNN on Vector-SIMD DSP.00.342019
Dataplant: In-DRAM Security Mechanisms for Low-Cost Devices.00.342019
FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives.70.462018
Enabling Efficient RDMA-based Synchronous Mirroring of Persistent Memory Transactions.10.352018
Advancing CMOS-Type Ising Arithmetic Unit into the Domain of Real-World Applications.10.372018
Reducing DRAM Latency via Charge-Level-Aware Look-Ahead Partial Restoration.90.392018
Iteration Interleaving-Based SIMD Lane Partition.10.362016
Mod (2P-1) Shuffle Memory-Access Instructions for FFTs on Vector SIMD DSPs10.412016
FT-Matrix: A Coordination-Aware Architecture for Signal Processing30.432014
Redefining the relationship between scalar and parallel units in SIMD architectures00.342013
A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments10.352013
A Novel Qpp Interleaver For Parallel Turbo Decoder00.342013
Dual-Core Framework: Eliminating The Bottleneck Effect Of Scalar Kernels On Simd Architectures10.482013
Breaking The Performance Bottleneck Of Sparse Matrix-Vector Multiplication On Simd Processors10.482013
Architecture Design Trade-offs among VLIW SIMD and Multi-core Schemes00.342012
Architectural Implications for SIMD Processors in the Wireless Communication Domain00.342012
Cmrf: A Configurable Matrix Register File For Accelerating Matrix Operations On Simd Processors10.372012
Instruction Shuffle: Achieving MIMD-like Performance on SIMD Architectures10.362012
Suca: A Scalable Unicore Architecture With Novel Instruction Encoding And Distributed Execution Control00.342011
Matrix Odd-Even Partition: A High Power-Efficient Solution to the Small Grain Data Shuffle00.342011
Lp2d: A Novel Low-Power 2d Memory For Sliding-Window Applications In Vector Dsps00.342011
AIFSP: An Adaptive Instruction Flow Stream Processor00.342011
Accelerating the data shuffle operations for FFT algorithms on SIMD DSPs10.392011
Mapping of H.264/AVC Encoder on a Hierarchical Chip Multicore DSP Platform10.352010