Abstract | ||
---|---|---|
This study evaluates the performance of the 64-core-based TILEPro64, and compares it with Core i7 and Atom by executing three benchmark programs: a synthetic bench, SPEC CINT2006 and SPLASH-2. TILEPro64 is not advertised for regular applications such as SPLASH-2. However, its internal many-core structure makes it worth investigating the performance characteristic with conventional benchmarks. The synthetic benchmark shows that the stall time because of on-chip network takes up to 85% of total execution time in TILEPro64. The single-core performance with CINT2006 reports that Core i7 and Atom deliver 15.4 × and 3.8 × superior performance to TILEPro64, respectively. The parallel performance with SPLASH-2 reports a similar trend. Comparing the fastest execution times, Core i7 boasts of a 19.2 × faster performance than TILEPro64 and even Atom outperforms TILEPro64 by 2.6 × on average. It came as a surprise that even Atom outperforms TILEPro64 in most of the benchmark programs. The highest number of last-level cache misses is a major culprit for low performance. The forerunner many-core products such as TILEPro64 offer excellent test-beds for polishing, adjusting and reshaping many-core architecture in the right direction. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1049/iet-cdt.2012.0101 | IET Computers & Digital Techniques |
Keywords | DocType | Volume |
Atom,single-core performance,parallel performance,64-core-based TILEPro64,multiprocessing systems,internal many-core structure,Core i7,SPLASH-2,performance evaluation,synthetic benchmark,last-level cache misses,stall time,on-chip network,SPEC CINT2006 | Journal | 7 |
Issue | ISSN | Citations |
4 | 1751-8601 | 0 |
PageRank | References | Authors |
0.34 | 5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Han-Yee Kim | 1 | 1 | 0.69 |
Young Hwan Kim | 2 | 181 | 37.11 |
Heonchang Yu | 3 | 227 | 29.58 |
Taeweon Suh | 4 | 171 | 22.47 |