Blog
A repository of reviews of the research papers I found interesting. A lot of reviews are dated (2018) and reflected my views when I was still at CMU. However thanks to a year of working with some of the smartest minds of Data systems at LinkedIn, I now have a new found perspective on how to think about systems at scale. I plan to re-review some of the major ones, and also add reviews to some new papers I have been reading (mostly on large scale data and machine learning systems). This blog also contains our technical discussions from Mountain View CS Reading Groups.
- 2019
- MTV CS Reading Group: Data structures that power your database
- 2018
- An Updated Performance Comparison of Virtual Machines and Linux Containers : Review
- Xen and the Art of Virtualization: Review
- The Evolution of an x86 Virtual Machine Monitor: Review
- The Datacenter as a Computer- An Introduction to the Design of Warehouse-Scale Machines: Review
- Large-scale cluster management at Google with Borg: Review
- Log-structured Memory for DRAM-based Storage: Review
- TAO: Facebook's Distributed Data Store for the Social Graph: Review
- Scaling Memcache at Facebook: Review
- Spanner: Google's Globally-Distributed Database: Review
- Bigtable: A Distributed Storage System for Structured Data: Review
- The Chubby Lock Service for Loosely-Coupled Distributed Systems: Review
- Paxos Made Live - An Engineering Perspective: Review
- Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing: Review
- TensorFlow: A system for large-scale machine learning: Review
- The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing: Review
- MapReduce: Simplified Data Processing on Large Clusters: Review
- S4: Distributed Stream Computing Platform: Review
- The Google File System: Review
- Combined reviews for 1. The Anatomy of a Large-Scale Hypertextual Web Search Engine, and 2. The PageRank Citation Ranking: Bringing Order to the Web
- Measuring the capacity of a Web server under realistic loads: Review
- kqueue versus epoll: A Performance Comparison: Review
- Comparing the Performance of Web Server Architectures: Review
- Flash: An efficient and portable Web server: Review
- End-to-End Arguments in System Design: Review