Eugene Wu


Eugene Wu is broadly interested in technologies that help users play with their data. His goal is for users at all technical levels to effectively and quickly make sense of their information. He is interested in solutions that ultimately improve the interface between users and data, and uses techniques borrowed from fields such as data management, systems, crowd sourcing, visualization, and HCI. Eugene Wu received his Ph.D. from MIT, B.S. from Cal, and was a postdoc in the AMPLab. A profile, an obit.

Eugene Wu has received the VLDB 2018 10-year test of time award, best-of-conference citations at ICDE and VLDB, the SIGMOD 2016 best demo award, the NSF CAREER, and the Google, Adobe, and Amazon faculty awards.

The WuLab Website & Blog

Overview of My Research and Teaching

Joining The Lab
PhDs and Postdocs: email me thoughts on extensions/project ideas, include "bananas" in subject line.
Interns + UGrad + Masters: please contact and talk to the graduate students in the lab.

All Publications (Show Selected)

  1. The Fast and the Private: Task-based Dataset Search
    Zezhou Huang, Jiaxiang Liu, Haonan Wang, Eugene Wu
    CIDR 2024
  2. JoinBoost: Grow Trees Over Normalized Data Using Only SQL
    Zezhou Huang, Rathijit Sen, Jiaxiang Liu, Eugene Wu
    VLDB 2023
  3. DIG: The Data Interface Grammar
    Yiru Chen, Jeffrey Tao, Eugene Wu
    HILDA at SIGMOD 2023
  4. PI2: Generating Visual Analysis Interfaces From Queries
    Yiru Chen, Eugene Wu
    SIGMOD 2022
  5. View Composition Algebra for Ad Hoc Comparisons
    Eugene Wu
    TVCG 2022
  6. Complaint-Driven Training Data Debugging at Interactive Speeds
    Lampros Flokas, Young Wu, Jiannan Wang, Nakul Verma, Eugene Wu
    SIGMOD 2022
  7. Explaining SQL-ML Queries with Bayesian Optimization
    Brandon Lockhard, Jiannan Wang, Eugene Wu
    VLDB 2021
  8. From Cleaning Before ML to Cleaning For ML
    Felix Neutatz, Binger Chen, Ziawasch Abedjan, Eugene Wu
    Invited, IEEE Data Engineering Bulletin 2021
  9. Continuous Prefetch for Interactive Data Applications
    Haneen Mohammed, Ziyun Wei, Ravi Netravali, Eugene Wu
    VLDB 2020 Talk Video Blogpost
  10. Complaint-driven Training Data Debugging for Query 2.0
    Young Wu, Lampros Flokas, Jiannan Wang, Eugene Wu
    SIGMOD 2020 Talk Video Blogpost
  11. Monte Carlo Tree Search for Generating Interactive Data Analysis Interfaces
    Yiru Chen, Eugene Wu
    Intelligent Process Automation (IPA) 2020
  12. AlphaClean: Automatic Generation of Data Cleaning Pipelines
    Sanjay Krishnan, Eugene Wu
    ArXiv 2019
  13. Towards Democratizing Relational Data Visualization
    Nan Tang, Eugene Wu, Guoliang Li
    SIGMOD 2019 Tutorial
  14. Precision Interfaces
    Qianrui Zhang, Haoci Zhang, Viraj Rai, Thibault Sellam, Eugene Wu
    SIGMOD 2019
  15. DeepBase: Deep Inspection of Neural Networks
    Thibault Sellam, Kevin Lin, Ian Yiran Huang, Michelle Yang, Carl Vondrick, Eugene Wu
    SIGMOD 2019
  16. Ten Years of Web Tables
    Michael Cafarella, Alon Halevy, Daisy Zhe Wang, Hongrae Lee, Jayant Madhavan, Cong Yu, Eugene Wu
    PVLDB 2018 Invited Paper,
  17. Provenance in Interactive Visualizations
    Fotis Psallidas, Eugene Wu
    HILDA 2018
  18. Leveraging Quality Prediction Models for Automatic Writing Feedback
    Hamed Nilforoshan, Eugene Wu
    ICWSM 2018
  19. Smoke: Fine-grained Lineage at Interactive Speeds
    Fotis Psallidas, Eugene Wu
    VLDB 2018
  20. Combining Design and Performance in a Data Visualization Management System
    Eugene Wu, Fotis Psallidas, Zhengjie Miao, Haoci Zhang, Laura Rettig, Yifan Wu, Thibault Sellam
    CIDR 2017
  21. QFix: Diagnosing errors through query histories
    Xiaolan Wang, Alexandra Meliou, Eugene Wu
    SIGMOD 2017
  22. PFunk-H: Approximate Query Processing using Perceptual Models
    Daniel Alabi, Eugene Wu
    HILDA 2016
  23. ActiveClean: Interactive Data Cleaning While Learning Convex Loss Models
    Sanjay Krishnan, Jiannan Wang, Eugene Wu, Michael J. Franklin, Ken Goldberg
    Arxiv 2016
  24. Explaining Data in Visual Analytic Systems
    Eugene Wu
    Doctoral Thesis 2015
  25. The Case for Data Visualization Management Systems
    Eugene Wu, Leilani Battle, Samuel Madden
    VLDB 2014
  26. Scorpion: Explaining Away Outliers in Aggregate Queries
    Eugene Wu, Samuel Madden
    VLDB 2013 (Best-of) Slides
  27. SubZero: a Fine-Grained Lineage System for Scientific Databases
    Eugene Wu, Samuel Madden, Michael Stonebraker
    ICDE 2013 (Best-of)
  28. Human-powered Sorts and Joins
    Adam Marcus, Eugene Wu, David Karger, Samuel Madden, Robert Miller
    VLDB 2012
  29. Relational Cloud: A Database-as-a-Service for the Cloud
    Carlo Curino, Evan Jones, Raluca Popa, Nirmesh Malviya, Eugene Wu, Sam Madden, Hari Balakrishnan, Nickolai Zeldovich
    CIDR 2011
  30. High-performance complex event processing over streams
    Eugene Wu, Yanlei Diao, Shariq Rizvi
    SIGMOD 2006