Abstract | ||
---|---|---|
Ever-increasing demand for computing capability is driving the construction of ever-larger computer clusters, soon to be reaching tens of thousands of processors. Many functionalities of system software have failed to scale accordingly - systems are becoming more complex, less reliable, and less efficient. Our premise is that these deficiencies arise from a lack of global control and coordination of the processing nodes. In practice, current parallel machines are loosely-coupled systems that are used for solving inherently tightly-coupled problems. This paper demonstrates that existing and future systems can be made more scalable by using BSP-like parallel programming principles in the design and implementation of the system software, and by taking full advantage of the latest interconnection network hardware. Moreover, we show that this approach can also yield great improvements in efficiency, reliability, and simplicity. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1007/978-3-540-27866-5_90 | Lecture Notes in Computer Science |
Keywords | Field | DocType |
operating system | System software,Computer science,Parallel computing,Networking hardware,Fault tolerance,Software architecture,Operating system,Computer cluster,Scalability,Context switch,Distributed computing,Embedded system | Conference |
Volume | ISSN | Citations |
3149 | 0302-9743 | 1 |
PageRank | References | Authors |
0.48 | 18 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Eitan Frachtenberg | 1 | 1060 | 85.08 |
Kei Davis | 2 | 1 | 0.48 |
Fabrizio Petrini | 3 | 2050 | 165.82 |
Juan Fernandez | 4 | 269 | 23.17 |
José Carlos Sancho | 5 | 382 | 29.97 |