Matei Zaharia
I'm a third year PhD student at UC Berkeley in the RAD Lab, working with Scott Shenker and Ion Stoica. I'm interested in computer systems and networking.
I'm currently working on Nexus, a common substrate for cluster computing, as well as projects in Internet architecture and security.
In the past, I've worked on several research projects involving Hadoop, the open-source MapReduce framework from Apache. In 2007-2008, I worked on tracing Hadoop to detect hardware and software faults. More recently, I worked on algorithms for speculative execution to mitigate slow nodes (paper at OSDI 08). In the summer of 2008, I interned at Facebook, where I built the Hadoop fair scheduler. I also became a committer on Hadoop in January 2009.
I got my Bachelor's degree from the University of Waterloo, in Canada. In undergrad, I worked with Srinivasan Keshav on peer-to-peer systems, opportunistic wireless communication, and low-cost, delay-tolerant networking for developing regions (the KioskNet project).
Contact
493B Soda Hall,
Berkeley, CA 94720-1776
Publications
2009
B. Hindman, A. Konwinski, M. Zaharia and I. Stoica, A Common Substrate for Cluster Computing, HotCloud 2009, June 2009.
R. Luk, M. Zaharia, M. Ho, B. Levine and P. Aoki, ICTD for Healthcare in Ghana: Two Parallel Case Studies, ICTD 2009, April 2009.
2008
- M. Zaharia, A. Konwinski, A.D. Joseph, R. Katz and I. Stoica, Improving MapReduce Performance in Heterogeneous Environments, OSDI 2008, December 2008.
2007
- S. Guo, M.H. Falaki, E.A. Oliver, S. Ur Rahman, A. Seth, M. Zaharia, U. Ismail, and S. Keshav, Design and Implementation of the KioskNet System, ICTD 2007, December 2007.
- S. Guo, M.H. Falaki, E.A. Oliver, S. Ur Rahman, A. Seth, M. Zaharia, and S. Keshav, Very Low-Cost Internet Access Using KioskNet, ACM Computer Communication Review, October 2007.
- M. Zaharia and S. Keshav, Gossip-based Search Selection in Hybrid Peer-to-Peer Networks, J. Concurrency and Computation: Practice and Experience, 2007.
- M. Zaharia, A. Chandel, S. Saroiu, and S. Keshav, Finding Content in File-Sharing Networks When You Can't Even Spell, Proc. IPTPS, February 2007.
2006
- A. Seth, D. Kroeker, M. Zaharia, S. Guo, S. Keshav, Low-cost Communication for Rural Internet Kiosks Using Mechanical Backhaul, Proc. MOBICOM 2006, September 2006.
- M. Zaharia and S. Keshav, Gossip-Based Search Selection in Hybrid Peer-to-Peer Networks, Proc. IPTPS, February 2006.
Technical Reports
- B. Hindman, A. Konwinski, M. Zaharia, A. Ghodsi, A.D. Joseph, S. Shenker, and I. Stoica, Nexus: A Common Substrate for Cluster Computing, UC Berkeley Technical Report UCB/EECS-2009-158, November 2009.
- G. Ananthanarayanan, K. Heimerl, M. Zaharia, M. Demmer, T. Koponen, A. Tavakoli, S. Shenker, and I. Stoica, Enabling Innovation Below the Communication API, UC Berkeley Technical Report UCB/EECS-2009-141, October 2009.
- M. Zaharia, D. Borthakur, J. Sen Sarma, K. Elmeleegy, S. Shenker, and I. Stoica, Job Scheduling for Multi-User MapReduce Clusters, UC Berkeley Technical Report UCB/EECS-2009-55, April 2009.
- M. Armbrust, A. Fox, R. Griffith, A.D. Joseph, R.H. Katz, A. Konwinski, G. Lee, D.A. Patterson, A. Rabkin, I. Stoica and M. Zaharia, Above the Clouds: A Berkeley View of Cloud Computing, UC Berkeley Technical Report UCB/EECS-2009-28, February 2009.
- S. Guo, M.H. Falaki, U. Ismail, E.A. Oliver, S. Ur Rahman, A. Seth, M. Zaharia, and S. Keshav, Design and Implementation of the KioskNet System (Extended Version), University of Waterloo Technical Report CS-2007-40, November 2007.
- M. Zaharia and S. Keshav, Fast and Optimal Scheduling Over Multiple Network Interfaces, University of Waterloo Technical Report CS-2007-36, October 2007.
- M. Zaharia and S. Keshav, Adaptive Peer-to-Peer Search, University of Waterloo Technical Report 2004-55, November 2004.
Talks
- Cloud Computing with MapReduce and Hadoop (ppt, pdf), Berkeley Par Lab Boot Camp 2009, Berkeley, CA, August 2009.
- Job Scheduling with the Fair and Capacity Schedulers (ppt, pdf), Hadoop Summit 2009, Santa Clara, CA, June 2009.
- Job Scheduling for MapReduce (ppt, pdf), Microsoft Research Silicon Valley, Mountain View, CA, January 2009.
- Improving MapReduce Performance in Heterogeneous Environments (ppt, pdf), OSDI 2008, San Diego, CA, December 2008.
Other Activities
Starting in high school, I've participated in a number of programming contests, including the International Olympiad in Informatics and the ACM International Collegiate Programming Contest. I've now stopped doing contests, but I still love algorithmic and mathematical problems.
In undergrad, I contributed to the open-source realtime strategy game 0 A.D., where I worked on gameplay logic, random map generation, water rendering, and multiplayer networking.
I enjoy reading, nature, and food that is either good or free.