Course Agenda: NetRADS (Network Support for Reliable
Adaptive
Distributed Systems)
Week 1
17 January
- Martin Luther King's Birthday Holiday--No class meeting
19 January
- Organizational Session
- Discussion of NetRADS NSF Proposal
Week 2
24 January
- What is the Structure of Internet
Services and Why do They Fail? What can we learn from the ROC
project?
- Tutorial on system reliability: J. Gray, "Why do computers stop
and what can be done about it?," Tandem TR 85.7, (June 1985). [PDF]
- E. Brewer, "Lessons from Giant-Scale Services," IEEE Internet Computing, V 5., N.
4, (2001). [PDF]
[Slides]
- A. Fox, D. Patterson, "Self-Repairing Computers," Scientific American, (June 2003). [PDF]
- D. Oppenheimer, A. Ganapathi, D. A. Patterson,"Why Do Internet
Services Fail, and What Can Be Done About It?," USENIX Symp. On
Internet Technologies and Systems, Seattle, WA, (March 2003). [PDF] [Slides]
26 January
- Why (and How) do Networks Fail?
Background and Metrics
- Tutorials on router architectures, ISP topologies and peering, BGP
- General discussion of reliability attributes of networks, network
services, and network equipment
- A. Vahdat, J. Chase, M. Dahlin, “The Perfect Storm: Reliability
Benchmarking for Global-Scale Services,” 2003. [PDF]
- Notes from David Chu: [PDF]
Week 3
31 January
- Why (and How) do Networks Fail?
Early Work on Wide-Area Routing Anomalies and Instabilities
- V. Paxson, "End-to-End Routing Behavior in the Internet," IEEE/ACM Transactions on
Networking, Vol.5, No.5, pp. 601-615, October
1997. [Zipped
PS]
- R. Govindan, A. Reddy, “An Analysis of Internet Interdomain
Topology and Route Stability,” IEEE Infocomm, Kobe, Japan, (April
1997). [PDF]
- R. Govindan, C. Alaettinoglu, G. Eddy, D. Kessens, S. Kumar, W.
S. Lee, "An Architecture for Stable, Analyzable Internet Routing," IEEE
Network Magazine, 13(1), (1999), pp. 29-35. [PDF]
2 February
- Why (and How) do Networks Fail?
Wide-Area Instabilities (continued)
- C. Labovitz, G. R. Malan, F.
Jahanian, “Internet Routing Instability,” IEEE/ACM Trans. On
Networking, V. 6, N. 5, (October 1998), pp. 515-528. Original
paper
appeared in SIGCOMM 1997, Cannes, France, (August 1997). [PDF]
- C. Labovitz, A. Ahuja, F. Jahanian,
“Experimental Study of Internet Stability and Backbone Failures,” Proc.
FTCS-29, Wisconsin, 1999. [PDF]
- A. Shaikh, A. Varma, L. Kalampoukas, R. Dube, “Routing Stability
in Congested Networks: Experimentation and Analysis,” ACM SIGCOMM 2000,
Stockholm, Sweden, (August 2000). [PDF]
[Slides]
Week 4
7 February
- Why (and How) do Networks Fail?
Wide-Area Instabilities: More Recent Work (continued)
- R. Teixeira, T. G. Griffin, G. Voelker, A. Shaikh, "Network
Sensitivity to Hot Potato Disruptions," ACM SIGCOMM 2004, (August
2004), Portland, OR. [PDF]
- R. Teixeira, A. Shaikh, T. Griffin, J. Rexford, "Dynamics of
Hot-Potato Routing in IP Networks," ACM SIGMETRICS 2004, New York, NY,
(June 2004). [PDF]
- M. Roughan, T. G. Griffin, Z. M. Mao, A. Greenberg, B. Freeman,
"IP
Forwarding Anamolies and Improving their Detection Using Multiple Data
Sources," Workshop on Network Troubleshooting: Research, Theory and
Operations Practice Meet Malfunctioning Reality, ACM SIGCOMM, (August
2004), Portland, OR. [PDF]
9 February
- Discussion on What We Have Learned
About Wide-Area Routing Anomalies and Instabilities [PDF]
- Discussion of Possible Course
Research Projects
Week 5
14 February
- Why (and How) do Networks Fail?
Summer 2001: Fires and Worms (continued)
- Nick Weaver's Tutorial on Network Worms [RTF,
PDF]
- Keynote Systems' White Paper [Doc]
and Subsequent Press Releases [HTML,
HTML,
HTML]
- J. Cowie, A. T. Ogielski, B. J. Premore, Y. Yuan, “Global Routing
Instabilities Triggered by Code Red II and Nimda Worm Attacks,” Rensys
Corporation White Paper, Hanover, NH, (December 2001). [HTML] [Slides]
16 February
- Why (and How) do Networks Fail? More
Worms
- D. Moore, C. Shannon, J. Brown, “Code Red: A Case Study on the
Spread and Victims of an Internet Worm,” 2nd ACM Internet Measurement
Workshop, Marseille, France, (November 2002). [PDF]
- L. Wang, Z. Zhao, D. Pei, R. Bush, D. Massey, A. Mankin, S. F.
Wu,
L. Zhang, “Observation and Analysis of BGP Behavior under Stress,” 2nd
ACM Workshop on Internet Measurement, Marseille, France, (2002). [PDF]
[Slides]
- M. Lad, X. Zhao, B. Zhang, D. Massey, L. Zhang, “Analysis of BGP
Update Surge during Slammer Worm Attack,” Fifth International Workshop
on Distributed Computing, Calcutta, India, (December 2003). [PDF]
Week 6
21 February
- Presidents' Day Holiday -- No Class Meeting
23 February
- Tutorial on Programming the IBM
BladeCenter Architecture
Week 7
28 February
- DNS as an Essential but Vulnerable
Internet Service
- Tutorial on DNS
- V. Pappas, Z. Xu, S. Lu, D.
Massey, A. Terzis, L. Zhang, "Impact of Configuration Errors on DNS
Robustness," ACM SIGCOMM 2004, Portland, OR, (August 2004). [PDF]
- V. Pappas, P. Faltstrom, D. Massey, L. Zhang, "Distributed DNS
Troubleshooting," Workshop on Network Troubleshooting: Research, Theory
and Operations Practice Meet Malfunctioning Reality, ACM SIGCOMM,
(August 2004), Portland, OR. [PDF]
2 March
- Configuration and Operator Errors as
a Source of Failure
- A. B. Brown, D. A. Patterson, "Undo for Operators:Building an
Undoable E-mail Store" 2003 USENIX Annual Technical Conference, San
Antonio, TX, (June 2003). [PDF]
- H. J. Wang, J. C. Platt, Y. Chen, R. Zhang, Y-M Wang, "Automatic
Misconfiguration Troubleshooting with PeerPressure," OSDI 2004, San
Francisco, CA, (December 2004). [PDF]
- K. Nagaraja, F. Oliveira, R. Bianchini, R. P. Martin, T. D.
Nguyen, "Understanding and Dealing with Operator Mistakes in Internet
Services," OSDI 2004, San Francisco, CA, (December 2004). [PDF]
Week 8
7 March
- Discussion on What We Have Learned
About Sources of Failures in Networks
- Discussion of Possible Course
Research Projects
9 March
- Active Networks
- D. L. Tennenhouse, J. M. Smith, W. D. Sincoskie, D. J. Wetherall,
G. J. Minden, "A Survey of Active Network Research," IEEE
Communications Magazine, V. 35, N. 1, (January 1997), pp. 80-86.
[Zipped
PS ]
- D. Wetherall, "Active Network Vision and Reality: Lessons from a
Capsule-based System," 17th Symp. on Operating Systems Principles,
Kiawah Island, SC, (December 1999). [PDF]
- B. Schwartz, A. Jackson, W. T. Strayer, W. Zhou, R. D. Rockwell,
C.
Partridge, "Smart Packets: Applying Active Networks to Network
Management," ACM Trans. Computer Systems, V 18, N 1, (2000), pp. 67-88.
[PDF]
Week 9
14 March
- Formal Project Presentations
16 March
- Formal Project Presentations (continued)
Spring Break
Week 10
28 March
- Evolving the Internet
- D. D. Clark, C. Partridge, J. C. Ramming, J. T. Wroclawski, "A
Knowledge Plane for the Internet," ACM SIGCOMM 2003, Karlsruhe,
Germany, (August 2003). [PDF]
- L. Peterson, D. Culler, T. Anderson T. Roscoe, "A Blueprint for
Introducing Disruptive Technology into the Internet", Proc. 1st
Workshop on Hot Topics in Networks (HotNets-I), Princeton, NJ, (October
2002). [PDF]
30 March
- Network Virtualization
- J. Turner, "Virtualizing the Net: A Strategy for Network
De-ossification", HotNets-III 2004 Keynote Presentation, San Diego, CA,
(November 2004). [Slides]
- L. Peterson, S. Shenker, J. Turner, "Overcoming the Internet
Impasse Through Virtualization," HotNets-III, San Diego, CA, (November
2004). [PDF]
Week 11
4 April
- New Architectures
- I. Stoica, D. Adkins, S. Zhuang, S. Shenker, S. Surana, "Internet
Indirection Architecture," ACM SIGCOMM 2002, Pittsburgh, PA, (August
2002), pp. 73-86. [PDF]
- M. Walfish, J. Stribling, M. Krohn, H. Balakrishnan, R. Morris,
S. Shenker, "Middleboxes No Longer Considered Harmful," USENIX
OSDI 2004, San Francisco, CA, (December 2004). [PDF]
6 April
Week 12
11 April
- New Architectures (continued)
- D. R. Cheriton, M. Gritter, "TRIAD: a Scalable Deployable
NAT-based Internet Architecture," (March 2000). [PDF]
- D. G. Andersen, H. Balakrishnan, M. F. Kaashoek, R. Morris,
"Resilient Overlay Networks," 18th ACM SOSP, Banff, Canada, (October
2001), pp. 131-145. [PDF]
13 April
- Project Progress Updates (continued)
Week 13
18 April
- Network Observations
- A. Feldmann, O. Maennel, Z. M. Mao, A. Berger, B. Maggs,
"Locating
Internet Routing Instabilities," SIGCOMM 2004, Portland, OR, (August
2004). [PDF]
- R.
Pang, V. Yegneswaran, P Barford, V. Paxson, L. Peterson,
"Characteristics of Internet Background Radiation," ACM Internet
Measurement Conference 2004, Taormina, Sicily, (October 2004) [PDF]
20 April
- Network Protocol Design and
Verification
- T. Anderson, S. Shenker, I. Stoica, D. Wetherall, "Design
Guidelines for Robust Internet Protocols," SIGCOMM 2002, Princeton,
NJ, (October 2002). [PDF]
- N. Feamster,
"Practical Verification Techniques for Wide-Area Routing," HotNets-II,
Cambridge, MA , (November 2003). [PDF]
Week 14
25 April
- Special Topics
- K. Argyrakiz, P. Maniatisy, D. Cheriton, S. Shenker, "Providing
Packet Obituaries," Proc. HotNets-III, San Diego, CA, (November 2004). [PDF]
[Slides]
- V. N. Padmanabhan, S. Ramabhadran, J. Padhye, "NetProfiler:
Profiling Wide-Area Networks Using Peer Cooperation," Proc. IPTPS 2005,
Ithaca, NY, (February 2005). [PDF]
27 April
- Special Topics (continued)
- K. Xu, Z. Zhang, S. Bhattacharyya, "Profiling Internet Backbone
Traffic: Behavior Models and Applications", Sprint ATL Research Report
RR05-ATL-020777, Burlingame, CA, (April 2005). [PDF]
- K. Xu, J. Chandrashekar, Z. Zhang, "Inferring Major Events from
BGP Update Streams," U. Minnesota Tech Report 04-043, 2004. [PDF]
Week 15
2 May
- General Discussion of Lessons Learned and Course Feedback
4 May
- End of Course Celebrations
Week 16
9 May
- Final Project Presentations
Last updated: 21 April 2005, randy@cs.Berkeley.edu