Tentative Lecture Schedule

Blog resources:



Lecture Date Topic  Reading Comments
1 Jan 21 Overview, Logistics, Goals (notes)
2 Jan 26 Remembering the Past (notes)
  1. An experimental time-sharing system [Andy]
  2. Virtual storage and virtual machine concepts [Ion]
  3. Introduction to Network Computer [YanPei]
3 Jan 28 Datacenter Architecture Examples
  1. An Architecture for Modular Data Centers [Tyson]
  2. Web Search for a Planet: The Google Cluster Architecture [Lucian]
  3. Facebook Architecture Presentation [Ion]
4 Feb 2 Failures in Datacenters
  1. Failure stories: [Pete]
  2. Failure Trends in a Large Disk Drive Population [Ganesh]
  3. Designing a highly availabile directory service[Matei]
5 Feb 4 Datacenter Networking (I)
  1. A Scalable, Commodity Data Center Network Architecture [Lucian]
  2. DCell: A Scalable and Fault-Tolerant Network Structure for Data [Junda]
  3. A Policy-aware Switching Layer for Data Centers [Dilip, invited]
6 Feb 9 eBay Architecture [Franco Travostino, eBay]
  1. eBay Scalling Odyssey
  2. All I Need is Scale!
7 Feb 11 Google Storage Stack
  1. The Google File System [Kristal]
  2. The Chubby Lock Service for Loosely-Coupled Distributed Systems [Ganesh]
  3. Bigtable: A Distributed Storage System for Structured Data [Andy]
Feb 16 Presidents' Day Holiday
8 Feb 18 Dynamo [Peter Vosshall, Amazon]
    Dynamo: Amazon's Highly Available Key-Value Store
9 Feb 23 Initial Project Proposal Presentations
10 Feb 25 Other Structured and Relational Storage
  1. HIVE: Data Warehousing & Analytics on Hadoop [Matei]
  2. PNUTS: Yahoo!'s Hosted Data Serving Platform [Tyson]
  3. SCADS [Michael, invited]
11 March 2 Pig Latin Presentation [Chris Olston, Yahoo! Research]
    Pig Latin: A Not-So-Foreign Language for Data Processing
12 March 4 Computation Models [Mihai Budiu, MSR, presentation on DryadLINQ, .ppt]
  1. MapReduce: Simplified Data Processing on Large Clusters [Andrew]
  2. Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks [Lucian]
  3. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language [Kristal]
13 March 9 Big Data in the Clouds [Jeff Hammerbacher, Cloudera] Global Information Platforms (Evolving the Data Warehouse)
14 March 11 Datacenter Networking (II) [David Maltz, MSR]
15 March 16 Software Platforms
  1. Azure (I), Azure (II) [Kristal]
  2. Comprehensive review of security and vulnerability protections [Yanpei]
  3. AWS [Lucian]
16 March 18 Tracing and Monitoring (I)
  1. Artemis [Todd]
  2. Scribe [Gunho]
  3. Chukwa [Andy]
March 23 Spring break
March 25 Spring break
17 March 30 Tracing and Monitoring (II)
  1. DTrace: Dynamic Instrumentation of Production Systems [Ganesh]
  2. XTrace [Daekyeong]
  3. Friday: Global Comprehension for Distributed Replay [Gautam, invited]
18 April 1 Language Support: Erlang and Scala
  1. Open-source Erlang - White Paper [Matei]
  2. Erlang - A survey of the language and its industrial applications [Gunho]
  3. Generics of a Higher Kind (and an easier to understand presentation) [Todd]
19 April 6 Cancelled [Ion away]
20 April 8 Language Support: Ruby on Rail [Armando Fox]
21 April 13 Check-point Project Presentations.
22 April 15 Cloud Interoperability
  1. Open Cloud Manifesto [Yanpei]
  2. Portable computing , AppDrop [Junda]
  3. Cloud: commodity or proprietary? , LAMP Stack [Ragnar]
23 April 20 From Datacenter to DatacenterS
  1. Scalling Out [Kristal]
  2. Flexible, Wide-Area Storage for Distributed Systems with WheelFS [Lucian]
  3. (...we'll also re-discuss PNUTS [Tyson] in this lecture...)
24 April 22 Cancelled [Ion at EECS Retreat]
25 April 27 CDNs and p2p Systems
  1. Measuring and Evaluating Large-Scale CDNs [Matei]
  2. Incentives Build Robustness in BitTorrent [Tyson]
  3. BOINC: A System for Public-Resource Computing and Storage [Andy]
26 April 29 Cloud Economics [Kevin Lai, HP Research]
27 May 4 Energy Efficiency in Datacenters
  1. MS blog on Date Center Energy Efficiency (part 1, part 2 , part 3) [Yanpei]
  2. Reducing Data Center Energy Consumption [Gunho]
  3. Hamilton's notes on Data Center Energy Summit (1, 2) [Ganesh]
28 May 6 Course Summary / Course Evaluation
29 May 11 Poster Session (6th floor alcove) [Project report due: May 18]