Abstract | ||
---|---|---|
Problem determination in a database management system can be a difficult task given the complexity of the system and the large amount of data that must be collected and analyzed. Monitoring the system for this data incurs overhead and has a detrimental effect on application performance. As an alternative to the standard practice of storing the performance data and performing offline analysis, we examine an approach where monitoring data is produced as a continuous data stream and data stream mining techniques are applied. We implement this approach as a prototype system called Tempo on IBM DB2®. Tempo implements Top-K analysis, which is a common task performed by database administrators for problem determination. Top-K analysis typically identifies the set of most frequently occurring events, or the highest consumers of system resources. Our experimental evaluation indicates that Tempo is time and space efficient, incurs low overhead, and produces accurate results. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1145/1923947.1923969 | CASCON |
Keywords | Field | DocType |
lightweight problem determination,continuous data stream,problem determination,offline analysis,system resource,prototype system,performance data,data stream analysis technique,top-k analysis,database management system,data stream mining technique,data incurs overhead,data stream mining | Data mining,Data stream mining,IBM,Computer science,Data stream,Problem determination,Real-time computing,Offline analysis,Database administrator,Data stream analysis,Database,Distributed computing | Conference |
Citations | PageRank | References |
1 | 0.35 | 8 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jing Huang | 1 | 1 | 0.35 |
Patrick Martin | 2 | 1 | 0.35 |
Wendy Powley | 3 | 329 | 28.43 |
Paul Bird | 4 | 119 | 8.78 |
Dmitri Abrashkevich | 5 | 1 | 0.35 |