Welcome

I am a PostDoc in the Computer Science Devision at UC Berkeley, working in the AMP Lab on large-scale data management applications and infrastructure. Currently, my research focuses on the intersection between data management for analytics, machine learning, and crowd sourcing. During my PhD at ETH Zurich inside the Systems Group, I have been working on data management in the cloud and stream processing. I co-authored one of the first works taking a database perspective on cloud computing. Since then, I worked on various topics related to large-scale data management in the cloud, such as Consistency Rationing (an economic model trading consistency against cost), Smoky (a scalable and adaptable cloud streaming system) and a new database architecture and consistency protocols which utilize existing cloud services

Research Topics

  • Data management in the cloud
  • Hybrid human-machine data management systems
  • Infrastructure for cloud-scale analytics and machine learning
  • New consistency and concurrency control models
  • Data streams/continuous analytics
  • XML query processing

Research Projects

In the following a list of my current and past research projects:

  • CrowdDB - Answering Queries with Crowdsourcing
  • AMPLab - Algorithms, Machines & People
  • Cloudy/Smoky - a distributed storage and streaming service in the cloud
  • Building a database on cloud infrastructure
  • CloudBench - a benchmark for the cloud
  • Zorba - a general purpose XQuery processor implementing in C++
  • MXQuery - A lightweight, full-featured Java XQuery Engine
  • Mapping Data to Queries (MDQ) - data integration with XQuery
  • XQIB - XQuery In the Browser

University of California at Berkeley
Computer Science Division, EECS
465 Soda Hall #1776
Berkeley, CA 94720

Email:
Phone: +1 (510) 926-5856
Fax: +1 (510) 643-7352

News/Events