Title
REPT: Reverse Debugging of Failures in Deployed Software.
Abstract
Debugging software failures in deployed systems is important because they impact real users and customers. However, debugging such failures is notoriously hard in practice because developers have to rely on limited information such as memory dumps. The execution history is usually unavailable because high-fidelity program tracing is not affordable in deployed systems.In this paper, we present REPT, a practical system that enables reverse debugging of software failures in deployed systems. REPT reconstructs the execution history with high fidelity by combining online lightweight hardware tracing of a programu0027s control flow with offline binary analysis that recovers its data flow. It is seemingly impossible to recover data values thousands of instructions before the failure due to information loss and concurrent execution. REPT tackles these challenges by constructing a partial execution order based on timestamps logged by hardware and iteratively performing forward and backward execution with error correction.We design and implement REPT, deploy it on Microsoft Windows, and integrate it into WinDbg. We evaluate REPT on 16 real-world bugs and show that it can recover data values accurately (92% on average) and efficiently (in less than 20 seconds) for these bugs. We also show that it enables effective reverse debugging for 14 bugs.
Year
Venue
Field
2018
OSDI
Microsoft Windows,Computer science,Control flow,Real-time computing,Error detection and correction,Software,Timestamp,Operating system,Tracing,Debugging,Data flow diagram
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
7
Name
Order
Citations
PageRank
Weidong Cui1118056.04
Xinyang Ge2956.54
Baris Kasikci313312.99
Ben Niu41525.38
Upamanyu Sharma500.34
Ruoyu Wang628216.23
Insu Yun7905.74