Reading and Schedule

Below are the tentative schedule and reading list of this course.

Date	Reading	Lead
9/27	Datacenter Overview The Datacenter as a Computer -- An Introduction to the Design of Warehouse-Scale Machines (Ch 1,2,6,7. Briefly Ch 3,4,5) Questions No need to submit anything for this reading Additional Readings Building Large-Scale Internet Services (Google)	Yiying Slides
10/2	Cloud Overview Above the Clouds: A Berkeley View of Cloud Computing Questions Name three pros and three cons of cloud. Despite the obstacles listed in the paper, cloud has happened and it is almost everywhere in our lives now. Why do you think is the fundamental reasons behind its success? What do you think is the future of cloud computing? Additional Readings Amazon AWS Microsoft Azure Google Cloud Platform (GCP) XaaS article 1 XaaS article 2	Yiying Slides
10/4	Virtualization Comet Book Chapter on Virtual Machine Monitors Questions During normal application run time (when the application does not cause any traps), does running the application in a VM have any performance overhead? How can VMM know when to install a shadow page table entry? What exactly happens when a VM wants to create a new page table entry on a hardware-managed-TLB platform? Is there any way to reduce the overhead of the return path of a trap (steps 3, 4, 5 in Figure B.3) Additional Readings Memory Resource Management in VMware ESX Server Disco: Running Commodity Operating Systems on Scalable Multiprocessors (TOCS'97) Scale and Performance in the Denali Isolation Kernel Xen and the Art of Virtualization Difference Engine: Harnessing Memory Redundancy in Virtual Machines The Turtles Project: Design and Implementation of Nested Virtualization vIC: Interrupt Coalescing for Virtual Machine Storage Device IO ELI: Bare-Metal Performance for I/O Virtualization A Comparison of Software and Hardware Techniques for x86 Virtualization Software Techniques for Avoiding Hardware Virtualization Exits Live Migration of Virtual Machines Remus: High Availability via Asynchronous Virtual Machine Replication	Yiying Slides
10/9	Container Understanding and Hardening Linux Containers (mainly Ch 2 to Ch 5; you can ignore many of the details in these chapters. Read Ch 1 for more background on virtualization. Read other chapters if you are interested in security.) Questions What types of isolations does Linux containers achieve? Can one Linux container affect the performance of another Linux container on the same machine (i.e., performance isolation)? Why or why not? Why do you think containers are less "secure" than virtual machines? Additional Readings LXC/LXD Docker Kubernetes Unikernels: Library Operating Systems for the Cloud My VM is Lighter (and Safer) than your Container Borg, Omega, and Kubernetes (Google) Slacker: Fast Distribution with Lazy Docker Containers Amazon Fargate Kata Containers	Yiying Slides
10/11	Serverless Cloud Programming Simplified: A Berkeley View on Serverless Computing (alternative link) Questions Current datacenters use container as the host to run serverless functions. Do you think that is a good way? Why and why not? Today's serverless functions are stateless. How do you think different functions can share data and communicate? Can you think of any security threats of serverless computing? Bonus points if you can outline a real threat/attack. Additional Readings Amazon Lambda Google Cloud Functions Azure Functions Amazon Firecracker Pocket: Elastic Ephemeral Storage for Serverless Analytics (OSDI'18) Occupy the Cloud: Distributed Computing for the 99% (PyWren) SAND: Towards High-Performance Serverless Computing Taking the Cloud-Native Approach with Microservices Microservices by James Lewis and Martin Fowler Introduction to Microservices by Nginx	Lihao, Zhipeng, Yihan
10/16	Resource Disaggregation LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation (OSDI'18) Questions What are the major benefits and weaknesses of resource disaggregation? List the steps that happen in LegoOS when an application allocates new virtual memory (e.g., calling malloc) and the steps that happen when it first accesses an allocated memory. Do you think it is a good idea to build serverless systems on top of a resource-disaggregated datacenter? Why or why not? (Bonus points for answering "how to build one?") Additional Readings Disaggregated Memory for Expansion and Sharing in Blade Servers Scale-Out NUMA Shoal: A Lossless Network for High-density and Disaggregated Racks Flash Storage Disaggregation Understanding Rack-Scale Disaggregated Storage R2C2: A Network Stack for Rack-scale Computers	Zhiyuan
10/18	Historical The Amoeba Distributed Operating System - A Status Report Security A Systematic Evaluation of Transient Execution Attacks and Defenses Towards Trusted Cloud Computing Questions What are the targeted usages of Amoeba? Name at least two advantages and one disadvantage of Amoeba's processor pool + specialized servers model. Why does Amoeba choose to use "immutable" files? What are the advantages and what type of workloads can benefit from this design? Choose one of the transient execution attack senario listed in the first paper and develop a realistic attack senario in the cloud on top of it. Specifically, your victim and attacker should both be cloud users. Describe who are the victim/attacker, how does the attacker do harm with transient execution, and what is the outcome of the attack. You do not need to develop any real attack. Just think of a story. Choose two defenses discussed in the two papers and discuss their implication on application performance (compared to no defense). After reading these two papers, do you trust cloud more? Do you think security will remain a key challenge in cloud computing? Additional Readings The Sprite Network Operating System A Comparison of Two Distributed Systems: Amoeba and Sprite Distributed Shared Memory: A Survey of Issues and Algorithms Distributed Shared Memory: Concepts and Systems Shasta: A Low Overhead, Software-Only Approach for Supporting Fine-Grain Shared Memory Amazon Web Services: Overview of Security Processes (Choose any topics you find interesting to read) Meltdown: Reading Kernel Memory from User Space Spectre Attacks: Exploiting Speculative Execution On the Meltdown & Spectre Design Flaws (by Mark Hill) Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds Flipping Bits in Memory Without Accessing Them: An Experimental Study of DRAM Disturbance Errors Intel SGX Explained Flush+Reload: a High Resolution, Low Noise, L3 Cache Side-Channel Attack Flush+Flush: A Fast and Stealthy Cache Attack Cache Template Attacks: Automating Attacks on Inclusive Last-Level Caches Cache attacks and countermeasures: the case of AES Last-Level Cache Side-Channel Attacks are Practical Flipping Bits in Memory Without Accessing Them: An Experimental Study of DRAM Disturbance Errors Throwhammer: Rowhammer Attacks over the Network and Defenses Pythia: Remote Oracles for the Masses Oblivistore: High performance oblivious cloud storage Shroud: Ensuring private access to largescale data in the data center Taostore: Overcoming asynchronicity in oblivious data storage	Audrey, Zhipeng
10/23	Concensus ZooKeeper: Wait-free coordination for Internet-scale systems (ATC'10) Questions What's the goal of ZooKeeper? Why does ZooKeeper wants to supporting asynchronous (or wait-free) requests? Why is ordering of wait-free requests important in ZooKeeper? Why does ZooKeeper use replicated databases with snapshots and write-ahead logs? Is it enough to just store everything in memory? Additional Readings The Chubby Lock Service for Loosely-Coupled Distributed Systems (OSDI'06) In Search of an Understandable Consensus Algorithm (ATC'14) Paxos Made Simple Chain Replication for Supporting High Throughput and Availability (OSDI'04) The Dangers of Replication and a Solution (SIGMOD'96) Managing Update Conflicts in Bayou, a Weakly Connected Replicated Storage System (SOSP'95) The Byzantine Generals Problem Byzantine Generals in Action: Implementing Fail-Stop Processors Weighted Voting for Replicated Data The Chubby Lock Service for Loosely-Coupled Distributed Systems (OSDI'06) Time, Clocks, and the Ordering of Events in a Distributed System Viewstamped Replication: A New Primary Copy Method to Support Highly-Available Distributed Systems Consensus on Transaction Commit	Wenquan, Chih-hung
10/25	Storage The Google File System (SOSP'03) Dynamo: Amazon’s Highly Available Key-value Store Questions What design decisions in GFS still make sense after a decade? What do you think were bad decisions? Why does GFS map full path names to metadata instead of the traditional file system's way of maintaining "directory" What is eventual consistency and why does Amazon choose this consistency level? How and when does Dynamo resolves conflicts? Why do you think Amazon and Google choose their data consistency level and storage model? Additional Readings GFS: Evolution on Fast-forward Finding a Needle in Haystack: Facebook's Photo Storage Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency Fast Crash Recovery in RAMCloud Cassandra - A Decentralized Structured Storage System TAO: Facebook’s Distributed Data Store for the Social Graph	Kaiqi, Qianqian, Zhanghan
10/30	Database Choosing A Cloud DBMS: Architectures and Tradeoffs (VLDB 2019) Questions Think of two metrics that you think are valuable to measure but the paper did not measure Which database system among the ones tested in this paper do you think fit the serverless computing model the most? Why? If you are building a website and have your web service running at AWS, which database system from the paper will you choose to store customer account info? and which for storing shopping cart data? why? Additional Readings Spanner: Google’s Globally-Distributed Database Bigtable: A Distributed Storage System for Structured Data Spark SQL: Relational Data Processing in Spark The Snowflake Elastic Data Warehouse Transaction Management in the R* Distributed Database Management System	Saurabh
11/1	Networking A Scalable, Commodity Data Center Network Architecture (SIGCOMM'08) A Clean Slate 4D Approach to Network Control and Management Questions FatTree has many benefits and thus is widely deployed in many datacenters. What do you think is the main reason of its success? Can you think of a disadvantage/limitation of FatTree? FatTree's two-level routing is largely static (i.e., path is decided mostly on destination host IP) and FatTree does not do any congestion control. Do you think this can work well in reality in datacenters? In a way, the SDN approach resembles classical distributed systems (e.g., think of GFS). What problems in distributed systems do SDN also face? What problems in dist sys that SDN does not have? And what problems only SDN has but not distributed systems? Give one to two examples for each question. Software-define datacenter and software-defined storage (and other software-defined things) are hot topics in recent years (ref: the first two papers in recommended readings). Do you think "software-defined" is just a buzz word or do we really have a similar problem to solve and similar approach in other parts of datacenters. Give a reason why or why not you believe there should be a "software-defined storage" (some calls it SDS). Additional Readings PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Data Center TCP (DCTCP) Understanding Lifecycle Management Complexity of Datacenter Topologies OpenFlow Enabling Innovation in Campus Networks Onix: A Distributed Control Platform for Large-scale Production Networks FBOSS: Building Switch Software at Scale (Facebook) A Large Scale Study of Data Center Network Reliability (Facebook) TIMELY: RTT-based Congestion Control for the Datacenter Technology-Driven, Highly-Scalable Dragonfly TopologyChronos: Predictable Low Latency for Data Center Applications Jellyfish: Networking Data Centers Randomly Flattened Butterfly : A Cost-Efficient Topology for High-Radix Networks U-Net: A User-Level Network Interface for Parallel and Distributed Computing Helios: A Hybrid Electrical/Optical Switch Architecture for Modular Data Centers Leveraging Endpoint Flexibility in Data-Intensive Clusters Software Defined Batteries Reading list of SDN	Rui, Jiaxiang
11/6	Remote Memory FaRM: Fast Remote Memory (NSDI'14) Questions Why does FaRM use large 2GB pages? How many network round trips does a (not read-only) transaction take in FaRM? What is epoch used for in FaRM Additional Readings LITE Kernel RDMA Support for Datacenter Applications Remote Regions: a Simple Abstraction for Remote Memory Efficient Memory Disaggregation with Infiniswap HPE Memory-Driven Computing Using RDMA Efficiently for Key-value Services FaSST: Fast, Scalable and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs Using Onesided RDMA Reads to Build a Fast, CPU-efficient Key-value Store Datacenter RPCs can be General and Fast (NSDI'19) A Double-Edged Sword: Security Threats and Opportunities in One-Sided Network Communication Deconstructing rdma-enabled distributed transactions: Hybrid is better!	Jie, Yi, Hao
11/8	Resource Management Large-scale cluster management at Google with Borg (EuroSys'15) Questions How does Google use quota to have different policy for high- and low-priority jobs? Would Borgmaster be a scalability bottleneck? In general, do you think the resource management problem is hard? Do you think more "smarter" mechanisms like machine learning would be a better solution? Additional Readings Borg, Omega, and Kubernetes (Google) Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms (SOSP'17) Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center Apache Hadoop YARN: Yet Another Resource Negotiator Resource Control @ FB	Haolan, Anmol
11/13	Dataflow MapReduce: Simplified Data Processing on Large Clusters (OSDI'04) Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing (NSDI'12) Questions MapReduce uses a very simple programming model with just two types of functions: map and reduce. Do you think this abstraction is enough to express and implement most datacenter applications? Can you give an example that is difficult to write with MapReduce? MapReduce is probably the most widely adopted idea from a systems paper in the past decade. Why do you think are the reasons behind this? Spark gained a lot of attention and usage in the last few years. Compared to MapReduce, what technical advantages does the Spark system have? What about broader reasons behind its success? MapReduce was implemented in C++. Hadoop (open source version of MapReduce) was implemented in Java. Spark was implemented in Scala. Why do you think they made the decisions to use these languages? Additional Readings Dryad: Distributed Data-parallel Programs from Sequential Building Blocks (Microsoft) DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language	Minxiang, Li-An
11/15	Systems and Machine Learning A Berkeley View of Systems Challenges for AI Questions Can you think of a way to run composable AI with the serverless computing platform? What would you put into a serverless function? Whata data (states) need to be communicated/stored? Could data flow systems like Hadoop and Spark be used to implement ML training/inference? Do you think that's a good idea? Other than the challenges mentioned in the paper, could you list two other challenges of AI/ML? Additional Readings TensorFlow: A System for Large-Scale Machine Learning (OSDI'16) A Berkeley View of Systems Challenges for AI TVM: An Automated End-to-End Optimizing Compiler for Deep Learning (OSDI'18) Ray: A Distributed Framework for Emerging AI Applications Project Adam: Building an Efficient and Scalable Deep Learning Training System Large Scale Distributed Deep Networks Scaling Distributed Machine Learning with the Parameter Server Mastering the game of Go with deep neural networks and tree search Deepmind Publications Playing Atari with Deep Reinforcement Learning	Side, Siman, Palash
11/20	Streaming / Video SVE: Distributed Video Processing at Facebook Scale Questions The paper does not have any consistency discussion. Can there be any consistency issues (e.g., with concurrent data accesses or parallel processing)? Why or why not? During encoding, there is one step that cannot be parallelized. What is it? Why is livestreaming a mismatch for SVE? Do you think video processing is a good use case of serverless computing? How would you design a video processing system on a serverless computing platform? Additional Readings Popularity Prediction of Facebook Videos for Higher Quality Streaming Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads Discretized Streams: Fault-Tolerant Streaming Computation at Scale (Spark Streaming) Taiji: Managing Global User Traffic for Large-Scale Internet Services at the Edge (Facebook) StreamScope: Continuous Reliable Distributed Processing of Big Data Streams (Microsoft)	Ryan, Jingwen, Weiwei
11/22	Hardware A Cloud-Scale Acceleration Architecture (Microsoft FPGA) Amazon Nitro (esp. the video talk on that page) Questions One of the unique designs of Catapult V2 (this paper) is to place FPGA as a "bump in the wire". Discuss the pros and cons of this design vs. 1) a design that swaps the location of NIC and FPGA, and 2) a design without NIC. Microsoft has not used FPGAs in their datacenters for cloud services. On the hand, Amazon and Alibaba both offer FPGA as a cloud service (called F1 and F3). Do you think FPGA should be used only for datacenter internal usages (like Microsof) or used only for cloud or both? Choose one of these three options that you advocate for and briefly discuss the potential challenges and benefits of it. With Amazon Nitro, virtualization functions are mostly offloaded to hardware. Do we still need a hypervisor (or an OS)? Can everything just run in user space and interact with Nitro cards directly? Instead of building different ASIC cards for different functionalities (the appraoch Amazon is taking with Nitro), one could also use the same FPGA cards but configure them differently for different functionalities (the Microsoft approach). Discuss the pros and cons of them (e.g., on performance, $$ cost, etc.) Additional Readings A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services (Microsoft Catapult V1) In-Datacenter Performance Analysis of a Tensor Processing Unit (ISCA'17) KV-Direct: High-Performance In-Memory Key-Value Store with Programmable NIC Azure Accelerated Networking: SmartNICs in the Public Cloud FPGAs in the Cloud: Should you Rent or Buy FPGAs for Development and Deployment?	Shu-Ting, Yizhou, Xuhao
12/4	Case Study: Databricks and an Interview with Ali Ghodsi (Databricks CEO) Course Summary Hints for Computer System Design -- Butler Lampson Questions Read the "Hints for Computer System Design" paper and summarize what you have learned over the course. Feel free to write about anything else you want to comment on the course. Additional Readings Databricks Blog	Yiying
12/6	Project Presentations