I am a PhD student at UC Berkeley in the AMPLab and the Database Group, advised by Michael Franklin. Prior to Berkeley, I had three short engineering stints at Google, IBM, and Altera. I enjoy traveling, playing badminton and squash.
Below is a list of projects that I have worked on during my PhD:
Shark: An open source SQL query engine. It uses Spark as the physical execution engine and can run Hive QL queries up to 100x faster without losing the fault-tolerance and scale-out properties of MapReduce.
GraphX: A distributed graph computation framework built on top of Spark. Its concise APIs enable users to express graph algorithms such as PageRank in 5 lines of code. It supports both interactive graph mining and efficient graph computations in a single runtime.
CrowdDB: A pioneering database system that incorporates crowd-sourced query processing. The project presents a vision in which humans are simply resources database systems can use to answer queries.
Readings in Databases: I maintain a list of papers essential to the understanding of database systems online.