Title
Continuous profiling: where have all the cycles gone?
Abstract
This article describes the Digital Continuous Profiling Infrastructure, a sampling-based profiling system designed to run continuously on production systems. The system supports multiprocessors, works on unmodified executables, and collects profiles for entire systems, including user programs, shared libraries, and the operating system kernel. Samples are collected at a high rate (over 5200 samples/sec. per 333MHz processor), yet with low overhead (1–3% slowdown for most workloads). Analysis tools supplied with the profiling system use the sample data to produce a precise and accurate accounting, down to the level of pipeline stalls incurred by individual instructions, of where time is bring spent. When instructions incur stalls, the tools identify possible reasons, such as cache misses, branch mispredictions, and functional unit contention. The fine-grained instruction-level analysis guides users and automated optimizers to the causes of performance problems and provides important insights for fixing them.
Year
DOI
Venue
1997
10.1145/265924.265925
Special Interest Group on Operating Systems
Keywords
DocType
Volume
performance-monitoring hardware,profiling,profiling system,fine-grained instruction-level analysis guide,accurate accounting,operating system kernel,production system,continuous profiling,program analysis,analysis tool,digital continuous profiling infrastructure,automated optimizers,performance understanding,entire system,sampling-based profiling system,operating system,functional unit,system design
Journal
15
Issue
ISSN
ISBN
4
0163-5980
0-89791-916-5
Citations 
PageRank 
References 
255
66.94
12
Authors
10
Search Limit
100255
Name
Order
Citations
PageRank
Jennifer M. Anderson1849170.27
Lance M. Berc225566.94
Jeffrey Dean325566.94
Sanjay Ghemawat4132421185.52
Monika Rauch Henzinger54307481.86
Shun-Tak Leung62610356.31
Richard L. Sites7522278.05
Mark T. Vandevoorde832681.95
Carl Waldspurger92003336.72
William E. Weihl102614903.11