Resume
5th Year, Ph.D. Candidate
Electricial Engineering and Computer Sciences
University of California, Berkeley
RADlab 465 Soda Hall, #1776
Berkeley, CA 94720-1776
Homepage: http://www.cs.berkeley.edu/~daisyw/

Advisors
Projects
- BayesStore: A data management system supports native SML (statistical machine learning) models for Scalable Advanced Data Analytics!
- DataSpace: Pay-as-you-go Data Integration System
News
- Sep/11/09 DBSiminar Fall 2009 with Scalable Data Analytics theme was kicked-off! – I am playing the host!
- Sep/09/09 Our Declarative Information Extraction paper got accepted to ICDE2010 as a short paper!
- Aug/28/09 I received 2009 Stonebraker/Wong Fellowship, which “recognizes outstanding students conducting database research, or research in successor fields in information science”!
- June/27/09 I am at SIGMOD 2009 Providence. I am giving a talk at WebDB – hope to see you there!
- May/09/09 I gave several talks at Berkeley and Stanford to both DB and ML community about BayesStore! – Got some enthusiastic feedback from the Berkeley ML community!
- Mar/06/09 I started a blog Data+Model+View dedicated to Systems, Algorithms and Visualizations for Scalable Data Analytics using SML! – It’s about time!
Talks
- WebDB, June 2009, Functional Dependency Generation and Applications in Pay-As-You-Go Data Integration Systems webdb09slides
- Berkeley Machine Learning Tea, 8th May 2009, BayesStore: Supporting Statistical Models in Probabilistic Databases
- Stanford Info Lunch, 1st May 2009, Declarative Information Extraction in a Probabilistic Database System stanford09slides
- VLDB08, August 2008, BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models vldb08slides
- Berkeley Database Seminar, 2006, Probabilistic Complex Event Triggering (PCET)
Selected Papers
Declarative Information Extraction in a Probabilistic Database System icde10 TR-pdb-ie
Proceedings of ICDE short paper, 2010
Daisy Zhe Wang, Eirinaios Michelakis, Minos Garofalakis, Michael J. Franklin, and Joseph M. Hellerstein
Functional Dependency Generation and Applications in Pay-as-you-go Data Integration Systems webdb09 webdb09slides TR-probFDgen
Proceedings of SIGMOD WebDB, 2009
Daisy Zhe Wang, Luna Dong, Anish Das Sarma, Michael J. Franklin, and Alon Halevy
BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models vldb08a vldb08slides
Proceedings of VLDB, 2008
Daisy Zhe Wang, Eirinaios Michelakis, Minos Garofalakis, and Joseph M. Hellerstein
WebTables: Exploring the Power of Tables on the Web vldb08b
Proceedings of VLDB, 2008
Michael Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, Yang Zhang
Supervising Students
Michael Zhang (M.S. student, fall 2009)
- working on designing interface to BayesStore for application and model developers
Dwight Crow (undergrad, summer 2009) Long Wei (undergrad, fall 2009)
- working together on DataSpace project trying to cluster millions of HTML table schemas
Open Source Projects
PG-ML (with Milenko Petrovic): a PostgreSQL wrapper for statistical machine learning libraries
A Parable of Modern Research
Bob has lost his keys in a room which is dark except for one brightly lit corner.
“Why are you looking under the light, you lost them in the dark!”
“I can only see here.”