Abstract | ||
---|---|---|
We propose a method for identifying the sources of problems in complex production systems where, due to the prohibitive costs of instrumentation, the data available for analysis may be noisy or incomplete. In particular, we may not have complete knowledge of all components and their interactions. We define influences as a class of component interactions that includes direct communication and resource contention. Our method infers the influences among components in a system by looking for pairs of components with time-correlated anomalous behavior. We summarize the strength and directionality of shared influences using a Structure-of-Influence Craph (SIC). This paper explains how to construct a SIC and use it to isolate system misbehavior, and presents both simulations and in-depth case studies with two autonomous vehicles and a 9024-node production supercomputer. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/DSN.2010.5544921 | 2010 IEEE-IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS DSN |
Keywords | Field | DocType |
production system | Graph,Supercomputer,Computer science,Resource contention,Real-time computing,Surprise,Distributed computing,Anomalous behavior | Conference |
ISSN | Citations | PageRank |
1530-0889 | 31 | 1.31 |
References | Authors | |
17 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Adam J. Oliner | 1 | 715 | 51.10 |
Ashutosh V. Kulkarni | 2 | 43 | 1.96 |
Alex Aiken | 3 | 5009 | 461.41 |