Abstract | ||
---|---|---|
In this research, a high performance computing weather forecasting application GRAPES has been tuned onto a functional unit (FU) array based architecture. Software and hardware approaches are specifically employed to increase the data locality and data reuse to accelerate the stencil computation in GRAPES. The simulation results indicate that we can achieve a per-core average IPC of 12.3 within a 20-stage FU array processor, which has a 5.8x power-efficiency boost than the many-core processor (MCP) of a same process technology. This can accordingly slow down the increase of communication by one order in the cluster system, resulting in a 12x power-efficiency boost in all PEs. © 2013 IEEE. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/CoolChips.2013.6547920 | COOL Chips |
Field | DocType | Volume |
Architecture,Locality,Supercomputer,Computer science,Stencil code,Software,Vector processor,Weather forecasting,Data reuse,Embedded system | Conference | null |
Issue | ISSN | ISBN |
null | null | 978-1-4673-5781-4 |
Citations | PageRank | References |
0 | 0.34 | 5 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Wei Wang | 1 | 0 | 0.34 |
Jun Yao | 2 | 395 | 47.98 |
Youhui Zhang | 3 | 202 | 28.36 |
Wei Xue | 4 | 400 | 52.95 |
Yasuhiko Nakashima | 5 | 128 | 32.60 |
Weimin Zheng | 6 | 1889 | 182.48 |