Title
HW/SW approaches to accelerate GRAPES in an FU array
Abstract
In this research, a high performance computing weather forecasting application GRAPES has been tuned onto a functional unit (FU) array based architecture. Software and hardware approaches are specifically employed to increase the data locality and data reuse to accelerate the stencil computation in GRAPES. The simulation results indicate that we can achieve a per-core average IPC of 12.3 within a 20-stage FU array processor, which has a 5.8x power-efficiency boost than the many-core processor (MCP) of a same process technology. This can accordingly slow down the increase of communication by one order in the cluster system, resulting in a 12x power-efficiency boost in all PEs. © 2013 IEEE.
Year
DOI
Venue
2013
10.1109/CoolChips.2013.6547920
COOL Chips
Field
DocType
Volume
Architecture,Locality,Supercomputer,Computer science,Stencil code,Software,Vector processor,Weather forecasting,Data reuse,Embedded system
Conference
null
Issue
ISSN
ISBN
null
null
978-1-4673-5781-4
Citations 
PageRank 
References 
0
0.34
5
Authors
6
Name
Order
Citations
PageRank
Wei Wang100.34
Jun Yao239547.98
Youhui Zhang320228.36
Wei Xue440052.95
Yasuhiko Nakashima512832.60
Weimin Zheng61889182.48