Title
OFRewind: enabling record and replay troubleshooting for networks
Abstract
Debugging operational networks can be a daunting task, due to their size, distributed state, and the presence of black box components such as commercial routers and switches, which are poorly instrumentable and only coarsely configurable. The debugging tool set available to administrators is limited, and provides only aggregated statistics (SNMP), sampled data (NetFlow/sFlow), or local measurements on single hosts (tcpdump). In this paper, we leverage split forwarding architectures such as OpenFlow to add record and replay debugging capabilities to networks - a powerful, yet currently lacking approach. We present the design of OFRewind, which enables scalable, multi-granularity, temporally consistent recording and coordinated replay in a network, with fine-grained, dynamic, centrally orchestrated control over both record and replay. Thus, OFRewind helps operators to reproduce software errors, identify datapath limitations, or locate configuration errors.
Year
Venue
Keywords
2011
USENIX Annual Technical Conference
configuration error,daunting task,debugging tool,black box component,commercial routers,local measurement,aggregated statistic,enabling record,leverage split forwarding,datapath limitation,coarsely configurable
Field
DocType
Citations 
Troubleshooting,sFlow,Datapath,NetFlow,Computer science,Real-time computing,OpenFlow,Scalability,Simple Network Management Protocol,Debugging,Distributed computing
Conference
63
PageRank 
References 
Authors
4.33
24
4
Name
Order
Citations
PageRank
Andreas Wundsam135523.00
Dan Levin230118.59
Srini Seetharaman355331.32
Anja Feldmann44935596.02