Luis Gravano's Curriculum Vitae

Last updated: March 14, 2024

Contact Information


Education

Professional Employment

Honors and Awards

Grants and Gifts

Patents

  1. Systems and Methods for Using Anchor Text as Parallel Corpora for Cross-Language Information Retrieval, L. Gravano and M. Henzinger, United States Patent 8,631,010, issued January 14, 2014 (continuation of United States Patents 7,146,358, 7,814,103, 7,996,402, and 8,190,608)
  2. Systems and Methods for Using Anchor Text as Parallel Corpora for Cross-Language Information Retrieval, L. Gravano and M. Henzinger, United States Patent 8,190,608, issued May 29, 2012 (continuation of United States Patents 7,146,358, 7,814,103, and 7,996,402)
  3. Systems and Methods for Using Anchor Text as Parallel Corpora for Cross-Language Information Retrieval, L. Gravano and M. Henzinger, United States Patent 7,996,402, issued August 9, 2011 (continuation of United States Patents 7,146,358 and 7,814,103)
  4. Systems and Methods for Using Anchor Text as Parallel Corpora for Cross-Language Information Retrieval, L. Gravano and M. Henzinger, United States Patent 7,814,103, issued October 12, 2010 (continuation of United States Patent 7,146,358)
  5. String Predicate Selectivity Estimation, S. Chaudhuri, V. Ganti, and L. Gravano, United States Patent 7,149,735, issued December 12, 2006
  6. Systems and Methods for Using Anchor Text as Parallel Corpora for Cross-Language Information Retrieval, L. Gravano and M. Henzinger, United States Patent 7,146,358, issued December 5, 2006
  7. Method of Building Multidimensional Workload-Aware Histograms, S. Chaudhuri, N. Bruno, and L. Gravano, United States Patent 7,007,039, issued February 28, 2006
  8. Method for Cost-Based Optimization over Multimedia Repositories, S. Chaudhuri and L. Gravano, United States Patent 5,806,061, issued September 8, 1998
  9. Method of Packet Routing in Torus Networks with Two Buffers per Edge, R. Cypher and L. Gravano, United States Patent 5,444,701, issued August 22, 1995

Editorships

Program Committees

Invited Talks

Invited Panels and Working Groups

Other Professional Activities

Papers in Refereed Journals

  1. Discovering Foodborne Illness in Online Restaurant Reviews, T. Effland, A. Lawson, S. Balter, K. Devinney, V. Reddy, H. Waechter, L. Gravano, and D. Hsu, in Journal of the American Medical Informatics Association, vol. 25, no. 12, pages 1586–1592, Dec. 2018.
  2. Fast and Accurate Time-Series Clustering, I. Paparrizos and L. Gravano, in ACM Transactions on Database Systems, vol. 42, no. 2, June 2017.
  3. Sampling Strategies for Information Extraction over the Deep Web, P. Barrio and L. Gravano, in Information Processing & Management, vol. 53, no. 2, pages 309–331, Mar. 2017.
  4. Predicting the Impact of Scientific Concepts Using Full-Text Features, K. McKeown and many others, in Journal of the Association for Information Science and Technology, vol. 67, no. 11, pages 2684-2696, Nov. 2016.
  5. Answering General Time-Sensitive Queries, W. Dakka, L. Gravano, and P. Ipeirotis, in IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 2, pages 220-235, Feb. 2012.
  6. Hip and Trendy: Characterizing Emerging Trends on Twitter, M. Naaman, H. Becker, and L. Gravano, in Journal of the American Society for Information Science and Technology, vol. 62, no. 5, pages 902–918, May 2011.
  7. Classification-Aware Hidden-Web Text Database Selection, P. Ipeirotis and L. Gravano, in ACM Transactions on Information Systems, vol. 26, no. 2, art. 6 (66 pages), Mar. 2008.
  8. Towards a Query Optimizer for Text-Centric Tasks, P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano, in ACM Transactions on Database Systems, vol. 32, no. 4, art. 21 (46 pages), Nov. 2007.
  9. Modeling and Managing Changes in Text Databases, P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano, in ACM Transactions on Database Systems, vol. 32, no. 3, art. 14 (38 pages), Aug. 2007.
  10. Optimizing Top-k Selection Queries over Multimedia Repositories, S. Chaudhuri, L. Gravano, and A. Marian, in IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 8, pages 992-1009, Aug. 2004.
  11. Evaluating Top-k Queries over Web-Accessible Databases, A. Marian, N. Bruno, and L. Gravano, in ACM Transactions on Database Systems, vol. 29, no. 2, pages 319-362, June 2004.
  12. Learning to Find Answers to Questions on the Web, E. Agichtein, S. Lawrence, and L. Gravano, in ACM Transactions on Internet Technology, vol. 4, no. 2, pages 129-162, May 2004.
  13. QProber: A System for Automatic Classification of Hidden-Web Databases, L. Gravano, P. Ipeirotis, and M. Sahami, in ACM Transactions on Information Systems, vol. 21, no. 1, pages 1-41, Jan. 2003.
  14. Top-k Selection Queries over Relational Databases: Mapping Strategies and Performance Evaluation, N. Bruno, S. Chaudhuri, and L. Gravano, in ACM Transactions on Database Systems, vol. 27, no. 2, pages 153-187, Jun. 2002.
  15. GlOSS: Text-Source Discovery over the Internet, L. Gravano, H. Garcia-Molina, A. Tomasic, in ACM Transactions on Database Systems, vol. 24, no. 2, pages 229-264, Jun. 1999.
  16. The Stanford Digital Library Metadata Architecture, M. Baldonado, C.-C. K. Chang, L. Gravano, and A. Paepcke, in International Journal on Digital Libraries, vol. 1, no. 2, pages 108-121, Sep. 1997.
  17. Data Structures for Efficient Broker Implementation, A. Tomasic, L. Gravano, C. Lue, P. Schwarz, and L. Haas, in ACM Transactions on Information Systems, vol. 15, no. 3, pages 223-253, Jul. 1997.
  18. Storage-Efficient, Deadlock-Free Packet Routing Algorithms for Torus Networks, R. Cypher and L. Gravano, in IEEE Transactions on Computers, vol. 43, no. 12, pages 1376-1385, Dec. 1994.
  19. Requirements for Deadlock-Free, Adaptive Packet Routing, R. Cypher and L. Gravano, in SIAM Journal on Computing, vol. 23, no. 6, pages 1266-1274, Dec. 1994.
  20. Adaptive Deadlock- and Livelock-Free Routing with All Minimal Paths in Torus Networks, L. Gravano, G. Pifarre, P. Berman, and J. Sanz, in IEEE Transactions on Parallel and Distributed Systems, vol. 5, no. 12, pages 1233-1251, Dec. 1994.
  21. Adaptive Deadlock- and Livelock-Free Routing in the Hypercube Network, G. Pifarre, L. Gravano, G. Denicolay, J. Sanz, in IEEE Transactions on Parallel and Distributed Systems, vol. 5, no. 11, pages 1121-1139, Nov. 1994.
  22. Fully Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and Other Networks: Algorithms and Simulations, G. Pifarre, L. Gravano, S. Felperin, and J. Sanz, in IEEE Transactions on Parallel and Distributed Systems, vol. 5, no. 3, pages 247-263, Mar. 1994.

Book Chapter

  1. XML & Data Streams, N. Bruno, L. Gravano, N. Koudas, and D. Srivastava. Chapter 4 in "Stream Data Management," edited by N. Chaudhry, K. Shaw, and M. Abdelguerfi, Series: Advances in Database Systems, Volume 30, pages 59-81, Springer, 2005.

Papers in Refereed Conferences

  1. Geospatial and Geosocial Dimensions of Foodborne Illness as Reflected in Yelp Restaurant Reviews, E. Shaveet, S. Chowdhury, D. Hsu, and L. Gravano, accepted to 2024 International Conference on Social Media & Society, 2024.
  2. Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of Findings of the 2020 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP 2020), 2020.
  3. Leveraging Just a Few Keywords for Fine-Grained Aspect Detection Through Weakly Supervised Co-Training, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019), 2019 (23.8% accepted).
  4. Ranking Deep Web Text Collections for Scalable Information Extraction, P. Barrio, L. Gravano, and C. Develder, in Proc. of the 24th ACM Conference on Information and Knowledge Management (CIKM 2015), 2015 (18% accepted in "long paper" category in Knowledge Management Track).
  5. k-Shape: Efficient and Accurate Clustering of Time Series, J. Paparrizos and L. Gravano, in Proc. of the 2015 ACM SIGMOD International Conference on Management of Data, 2015.
  6. Learning to Rank Adaptively for Scalable Information Extraction, P. Barrio, G. Simões, H. Galhardas, and L. Gravano, in Proc. of the 18th International Conference on Extending Database Technology (EDBT 2015), pages 241-252, 2015.
  7. When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms, G. Simões, H. Galhardas, and L. Gravano, in Proc. of the VLDB Endowment, vol. 6, no. 13, pages 1462-1473, 2013.
  8. Identifying Content for Planned Events Across Social Media Sites, H. Becker, D. Iter, M. Naaman, and L. Gravano, in Proc. of the 2012 ACM International Conference on Web Search and Data Mining (WSDM 2012), pages 533-542, 2012 (20.7% accepted; one of 30 papers, or 8.3% of submissions, selected for full-length plenary-session presentation).
  9. Beyond Trending Topics: Real-World Event Identification on Twitter, H. Becker, M. Naaman, and L. Gravano, in Proc. of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), pages 438-441, 2011 (short 4-page "poster" paper).
  10. Selecting Quality Twitter Content for Events, H. Becker, M. Naaman, and L. Gravano, in Proc. of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), pages 442-445, 2011 (short 4-page "poster" paper).
  11. Learning Similarity Metrics for Event Identification in Social Media, H. Becker, M. Naaman, and L. Gravano, in Proc. of the 2010 ACM International Conference on Web Search and Data Mining (WSDM 2010), pages 291-300, 2010 (15.5% accepted).
  12. Join Optimization of Information Extraction Output: Quality Matters!, A. Jain, P. Ipeirotis, A. Doan, and L. Gravano, in Proc. of the 25th IEEE International Conference on Data Engineering (ICDE 2009), pages 186-197, 2009 (16.8% accepted in "long paper" category).
  13. Answering General Time-Sensitive Queries, W. Dakka, L. Gravano, and P. Ipeirotis, in Proc. of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pages 1437-1438, 2008 (short 2-page "poster" paper; 16% accepted in "poster" paper category).
  14. Optimizing SQL Queries over Text Databases, A. Jain, A. Doan, and L. Gravano, in Proc. of the 24th IEEE International Conference on Data Engineering (ICDE 2008), pages 636-645, 2008 (12.1% accepted in "full presentation" category).
  15. Efficient Summarization-Aware Search for Online News Articles, W. Dakka and L. Gravano, in Proc. of the 2007 ACM+IEEE Joint Conference on Digital Libraries (JCDL 2007), pages 63-72, 2007.
  16. Efficient Keyword Search Across Heterogeneous Relational Databases, M. Sayyadian, H. LeKhac, A. Doan, and L. Gravano, in Proc. of the 23rd IEEE International Conference on Data Engineering (ICDE 2007), pages 346-355, 2007 (19% accepted).
  17. SQL Queries Over Unstructured Text Databases, A. Jain, A. Doan, and L. Gravano, in Proc. of the 23rd IEEE International Conference on Data Engineering (ICDE 2007), pages 1255-1257, 2007 (short 3-page "poster" paper).
  18. To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks, P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano, in Proc. of the 2006 ACM SIGMOD International Conference on Management of Data, pages 265-276, 2006 ("Best Paper" Award; 13% accepted).
  19. Modeling and Managing Content Changes in Text Databases, P. Ipeirotis, A. Ntoulas, J. Cho, and L. Gravano, in Proc. of the 21st IEEE International Conference on Data Engineering (ICDE 2005), pages 606-617, 2005 ("Best Paper" Award; 13% accepted).
  20. When one Sample is not Enough: Improving Text Database Selection Using Shrinkage, P. Ipeirotis and L. Gravano, in Proc. of the 2004 ACM SIGMOD International Conference on Management of Data, pages 767-778, 2004 (16% accepted).
  21. Selectivity Estimation for String Predicates: Overcoming the Underestimation Problem, S. Chaudhuri, V. Ganti, and L. Gravano, in Proc. of the 20th IEEE International Conference on Data Engineering (ICDE 2004), pages 227-238, 2004 (14% accepted).
  22. Categorizing Web Queries According to Geographical Locality, L. Gravano, V. Hatzivassiloglou, and R. Lichtenstein, in Proc. of the 12th ACM Conference on Information and Knowledge Management (CIKM 2003), pages 325-333, 2003 (15% accepted).
  23. Efficient IR-Style Keyword Search over Relational Databases, V. Hristidis, L. Gravano, and Y. Papakonstantinou, in Proc. of the 29th International Conference on Very Large Data Bases (VLDB 2003), pages 850-861, 2003 (15% accepted).
  24. Text Joins in an RDBMS for Web Data Integration, L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava, in Proc. of  the 12th International World Wide Web Conference (WWW 2003), pages 90-101, 2003 (13% accepted).
  25. Querying Text Databases for Efficient Information Extraction, E. Agichtein and L. Gravano, in Proc. of the 19th IEEE International Conference on Data Engineering (ICDE 2003), pages 113-124, 2003 ("Best Student Paper" Award; 14% accepted).
  26. Navigation- vs. Index-Based XML Multi-Query Processing, N. Bruno, L. Gravano, N. Koudas, and D. Srivastava, in Proc. of the 19th IEEE International Conference on Data Engineering (ICDE 2003), pages 139-150, 2003 (14% accepted).
  27. Text Joins for Data Cleansing and Integration in an RDBMS, L. Gravano, P. Ipeirotis, N. Koudas, and D. Srivastava, in Proc. of the 19th IEEE International Conference on Data Engineering (ICDE 2003), pages 729-731, 2003 (short 3-page "poster" paper).
  28. Distributed Search over the Hidden-Web: Hierarchical Database Sampling and Selection, P. Ipeirotis and L. Gravano, in Proc. of the 28th International Conference on Very Large Data Bases (VLDB 2002), pages 394-405, 2002 (16% accepted).
  29. Evaluating Top-k Queries over Web-Accessible Databases, N. Bruno, L. Gravano, and A. Marian, in Proc. of the 18th IEEE International Conference on Data Engineering (ICDE 2002), pages 369-380, 2002 (19% accepted).
  30. Extending SDARTS: Extracting Metadata from Web Databases and Interfacing with the Open Archives Initiative, P. Ipeirotis, T. Barry, and L. Gravano, in Proc. of the Second ACM+IEEE Joint Conference on Digital Libraries (JCDL 2002), pages 162-170, 2002 (33% accepted).
  31. Approximate String Joins in a Database (Almost) for Free, L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava, in Proc. of the 27th International Conference on Very Large Data Bases (VLDB 2001), pages 491-500, 2001 (17% accepted).
  32. Probe, Count, and Classify: Categorizing Hidden Web Databases, P. Ipeirotis, L. Gravano, and M. Sahami, in Proc. of the 2001 ACM SIGMOD International Conference on Management of Data, pages 67-78, 2001 (15% accepted).
  33. STHoles: A Multidimensional Workload-Aware Histogram, N. Bruno, S. Chaudhuri, and L. Gravano, in Proc. of the 2001 ACM SIGMOD International Conference on Management of Data, pages 211-222, 2001 (15% accepted).
  34. SDLIP + STARTS = SDARTS: A Protocol and Toolkit for Metasearching, N. Green, P. Ipeirotis, and L. Gravano, in Proc. of the First ACM+IEEE Joint Conference on Digital Libraries (JCDL 2001), pages 207-214, 2001.
  35. PERSIVAL, a System for Personalized Search and Summarization over Multimedia Healthcare Information, K. McKeown, S.-F. Chang, J. Cimino, S. Feiner, C. Friedman, L. Gravano, V. Hatzivassiloglou, S. Johnson, D. Jordan, J. Klavans, A. Kushniruk, V. Patel, and S. Teufel, in Proc. of the First ACM+IEEE Joint Conference on Digital Libraries (JCDL 2001), pages 331-340, 2001.
  36. Learning Search Engine Specific Query Transformations for Question Answering, E. Agichtein, S. Lawrence, and L. Gravano, in Proc. of the 10th International World Wide Web Conference (WWW10), pages 169-178, 2001 (20% accepted).
  37. Computing Geographical Scopes of Web Resources, J. Ding, L. Gravano, and N. Shivakumar, in Proc. of the 26th International Conference on Very Large Data Bases (VLDB'00), pages 545-556, 2000 (15% accepted).
  38. An Investigation of Linguistic Features and Clustering Algorithms for Topical Document Clustering, V. Hatzivassiloglou, L. Gravano, and A. Maganti, in Proc. of the 23rd ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'00), pages 224-231, 2000 (25% accepted).
  39. Snowball: Extracting Relations from Large Plain-Text Collections, E. Agichtein and L. Gravano, in Proc. of the 5th ACM International Conference on Digital Libraries (DL'00), pages 85-94, 2000 (<33% accepted).
  40. Evaluating Top-k Selection Queries, S. Chaudhuri and L. Gravano, in Proc. of the 25th International Conference on Very Large Data Bases (VLDB'99), pages 399-410, 1999 (15% accepted).
  41. Merging Ranks from Heterogeneous Internet Sources, L. Gravano and H. Garcia-Molina, in Proc. of the 23rd International Conference on Very Large Data Bases (VLDB'97), pages 196-205, 1997 (15% accepted).
  42. Metadata for Digital Libraries: Architecture and Design Rationale, M. Baldonado, C.-C. K. Chang, L. Gravano, and A. Paepcke, in Proc. of the 2nd ACM International Conference on Digital Libraries (DL'97), pages 47-56, 1997 (27% accepted).
  43. STARTS: Stanford Proposal for Internet Meta-Searching, L. Gravano, C.-C. K. Chang, H. Garcia-Molina, and A. Paepcke, in Proc. of the 1997 ACM SIGMOD International Conference on Management of Data, pages 207-218, 1997 (21% accepted).
  44. dSCAM: Finding Document Copies across Multiple Databases, H. Garcia-Molina, L. Gravano, and N. Shivakumar, in Proc. of the 4th International Conference on Parallel and Distributed Information Systems (PDIS'96), pages 68-79, 1996 (18% accepted).
  45. Optimizing Queries over Multimedia Repositories, S. Chaudhuri and L. Gravano, in Proc. of the 1996 ACM SIGMOD International Conference on Management of Data, pages 91-102, 1996 (16% accepted).
  46. Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies, L. Gravano and H. Garcia-Molina, in Proc. of the 21st International Conference on Very Large Data Bases (VLDB'95), pages 78-89, 1995.
  47. Precision and Recall of GlOSS Estimators for Database Discovery, L. Gravano, H. Garcia-Molina, and A. Tomasic, in Proc. of the 3rd International Conference on Parallel and Distributed Information Systems (PDIS'94), pages 103-106, 1994 (short paper).
  48. The Effectiveness of GlOSS for the Text-Database Discovery Problem, L. Gravano, H. Garcia-Molina, and A. Tomasic, in Proc. of the 1994 ACM SIGMOD International Conference on Management of Data, pages 126-137, 1994 (15% accepted).
  49. Requirements for Deadlock-Free, Adaptive Packet Routing, R. Cypher and L. Gravano, in Proc. of the 11th ACM Symposium on Principles of Distributed Computing (PODC '92), pages 25-33, 1992.
  50. Adaptive, Deadlock-Free Packet Routing in Torus Networks with Minimal Storage, R. Cypher and L. Gravano, in Proc. of the 1992 International Conference on Parallel Processing (ICPP '92), pages 204-211, 1992 ("Most Original Paper" Award; 13% accepted).
  51. Adaptive Deadlock- and Livelock-Free Routing with All Minimal Paths in Torus Networks, P. Berman, L. Gravano, G. Pifarre, and J. Sanz, in Proc. of the 4th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA '92), pages 3-12, 1992.
  52. Adaptive Deadlock-Free Worm-Hole Routing in Hypercubes, L. Gravano, G. Pifarre, G. Denicolay, and J. Sanz, in Proc. of the 6th International Parallel Processing Symposium (IPPS '92), pages 512-515, 1992 (short paper).
  53. Fully-Adaptive Routing: Packet Switching Performance and Worm-Hole Algorithms, S. Felperin, L. Gravano, G. Pifarre, and J. Sanz, in Proc. of Supercomputing '91, pages 654-663, 1991.
  54. Fully-Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and Other Networks, G. Pifarre, L. Gravano, S. Felperin, and J. Sanz, in Proc. of the 3rd Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA '91), pages 278-290, 1991 (19% accepted).

Papers in Refereed Workshops and Demonstration Sessions, and Other Refereed Publications and Presentations

  1. Quantifying the Effects of COVID-19 on Restaurant Reviews, I. Cao, Z. Liu, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of the 9th International Workshop on Natural Language Processing for Social Media (SocialNLP@NAACL 2021), 2021.
  2. Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only, Z. Liu, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of the 11th International Workshop on Health Text Mining and Information Analysis (LOUHI@EMNLP 2020), 2020.
  3. Weakly Supervised Attention Networks for Fine-Grained Opinion Mining and Public Health, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of the 5th Workshop on Noisy User-Generated Text (W-NUT 2019), 2019.
  4. Training Neural Networks for Aspect Extraction Using Descriptive Keywords Only, G. Karamanolakis, D. Hsu, and L. Gravano, in Proc. of the 2nd Learning from Limited Labeled Data Workshop (LLD 2019), 2019.
  5. Detecting Foodborne Disease Outbreaks Using Social Media (demonstration), F. Psallidas, L. Gravano, and many others, in NYC Media Lab's Annual Summit, 2014.
  6. Information Extraction from Social Media for Public Health, N. Elhadad, L. Gravano, D. Hsu, S. Balter, V. Reddy, and H. Waechter, in KDD at Bloomberg Workshop, Data Frameworks Track (KDD 2014), 2014.
  7. REEL: A Relation Extraction Learning Framework (poster), P. Barrio, G. Simões, H. Galhardas, and L. Gravano, in Proc. of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2014), 2014.
  8. Using Online Reviews by Restaurant Patrons to Identify Unreported Cases of Foodborne Illness — New York City, 2012–2013, C. Harrison, M. Jorder, H. Stern, F. Stavinsky, V. Reddy, H. Hanson, H. Waechter, L. Lowe, L. Gravano, and S. Balter, in Centers for Disease Control and Prevention Morbidity and Mortality Weekly Report (CDC MMWR), vol. 63, no. 20, pages 441-445, May 2014.
  9. Quality Impact of Value Matching and Scoring in Top-k Entity Attribute Extraction, M. Solomon, L. Gravano, and C. Yu, in Proc. of the 5th International Workshop on Ranking in Databases (DBRank 2011), 2011.
  10. Automatic Identification and Presentation of Twitter Content for Planned Events (demonstration), H. Becker, F. Chen, D. Iter, M. Naaman, and L. Gravano, in Proc. of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM 2011), pages 655-656, 2011.
  11. Popularity-Guided Top-k Extraction of Entity Attributes, M. Solomon, C. Yu, and L. Gravano, in Proc. of the ACM SIGMOD Workshop on the Web and Databases (WebDB 2010), 6 pages, 2010 (32% accepted).
  12. Exploiting Social Links for Event Identification in Social Media (poster), H. Becker, B. Xiao, M. Naaman, and L. Gravano, in Proc. of the 3rd Annual Workshop on Search in Social Media (SSM 2010), 2 pages, 2010.
  13. Event Identification in Social Media, H. Becker, M. Naaman, and L. Gravano, in Proc. of the ACM SIGMOD Workshop on the Web and Databases (WebDB 2009), 6 pages, 2009 (33% accepted).
  14. Modeling Query-Based Access to Text Databases, E. Agichtein, P. Ipeirotis, and L. Gravano, in Proc. of the ACM SIGMOD Workshop on the Web and Databases (WebDB 2003), pages 87-92, 2003 (25% accepted).
  15. QXtract: A Building Block for Efficient Information Extraction from Text Databases (demonstration), E. Agichtein and L. Gravano, in Proc. of the 2003 ACM SIGMOD International Conference on Management of Data, page 663, 2003 (30% accepted).
  16. Snowball: A Prototype System for Extracting Relations from Large Text Collections (demonstration), E. Agichtein, L. Gravano, J. Pavel, V. Sokolova, and A. Voskoboynik, in Proc. of the 2001 ACM SIGMOD International Conference on Management of Data, page 612, 2001 (~50% accepted).
  17. PERSIVAL Demo: Categorizing Hidden-Web Resources (demonstration), P. Ipeirotis, L. Gravano, and M. Sahami, in Proc. of the First ACM+IEEE Joint Conference on Digital Libraries (JCDL 2001), page 454, 2001.
  18. Automatic Classification of Text Databases through Query Probing, P. Ipeirotis, L. Gravano, and M. Sahami, in Proc. of the ACM SIGMOD Workshop on the Web and Databases (WebDB'00), pages 117-122, 2000 (29% accepted). Also in LNCS Series no. 1997, Springer, pages 245-255, 2001.
  19. Combining Strategies for Extracting Relations from Text Collections, E. Agichtein, E. Eskin, and L. Gravano, in Proc. of the ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2000), pages 86-95, 2000 (58% accepted).
  20. Exploiting Geographical Location Information of Web Pages, O. Buyukkokten, J. Cho, H. Garcia-Molina, L. Gravano, and N. Shivakumar, in Proc. of the ACM SIGMOD Workshop on the Web and Databases (WebDB'99), pages 91-96, 1999 (29% accepted).

Invited Papers

  1. k-Shape: Efficient and Accurate Clustering of Time Series, I. Paparrizos and L. Gravano, in SIGMOD Record, Special Issue on "2015 ACM SIGMOD Research Highlights," vol. 45, no. 1, pages 69-76, March 2016.
  2. Effective Event Identification in Social Media, F. Psallidas, H. Becker, M. Naaman, and L. Gravano, in IEEE Data Engineering Bulletin, vol. 36, no. 3, pages 42-50, September 2013.
  3. Building Query Optimizers for Information Extraction: The SQoUT Project, A. Jain, P. Ipeirotis, and L. Gravano, in SIGMOD Record, Special Issue on "Managing Information Extraction," vol. 37, no. 4, pages 28-34, December 2008.
  4. Query- vs. Crawling-based Classification of Searchable Web Databases, L. Gravano, P. Ipeirotis, and M. Sahami, in IEEE Data Engineering Bulletin, vol. 25, no. 1, pages 43-50, March 2002.
  5. Using q-grams in a DBMS for Approximate String Processing, L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, L. Pietarinen, and D. Srivastava, in IEEE Data Engineering Bulletin, vol. 24, no. 4, pages 28-34, December 2001.
  6. Simplifying Data Access: The Energy Data Collection Project, J. L. Ambite, Y. Arens, E. Hovy, A. Philpot, L. Gravano, V. Hatzivassiloglou, and J. Klavans, in IEEE Computer, vol. 34, no. 2, pages 47-54, February 2001.
  7. Database Research at Columbia University, S.-F. Chang, L. Gravano, G. Kaiser, K. Ross, and S. Stolfo, in SIGMOD Record, vol. 27, no. 3, pages 75-80, September 1998.
  8. Mediating and Metasearching on the Internet, L. Gravano and Y. Papakonstantinou, in IEEE Data Engineering Bulletin, vol. 21, no. 2, pages 28-36, June 1998.
  9. The Stanford InfoBus and Its Service Layers: Augmenting the Internet with Higher-Level Information Management Protocols, M. Roscheisen, M. Baldonado, C.-C. K. Chang, L. Gravano, S. Ketchpel, and A. Paepcke, in Digital Libraries in Computer Science: The MeDoc Approach, LNCS Series no. 1392, Springer, pages 213-230, 1998.
  10. Optimizing Queries over Multimedia Repositories, S. Chaudhuri and L. Gravano, in IEEE Data Engineering Bulletin, vol. 19, no. 4, pages 45-52, December 1996.
  11. Routing Techniques for Massively Parallel Communication, S. Felperin, L. Gravano, G. Pifarre, and J. Sanz, in Proceedings of the IEEE, vol. 79, no. 4, pages 488-503, April 1991.

Position Papers, Meeting Reports, and Miscellaneous Publications

  1. Using Restaurant Review Websites to Identify Unreported Complaints of Foodborne Illness, C. Harrison, M. Joarder, H. Stern, F. Stavinsky, V. Reddy, L. Gravano, and S. Balter. Poster in 2013 CSTE Annual Conference, Pasadena, California, June 2013.
  2. Characterizing Web Resources for Improved Search, L. Gravano. Position paper for the First NSF-DELOS Workshop on Information Seeking, Searching, and Querying in Digital Libraries, Zurich, Switzerland, December 2000.
  3. Resource Indexing and Discovery In a Globally Distributed Digital Library, L. Gravano. Position paper for the NSF-EU Digital Library Collaboratory Working Group, Budapest, Hungary, November 1997.
  4. Informal Internet Standards at Stanford, L. Gravano, C.-C. K. Chang, H. Garcia-Molina, A. Paepcke. Position paper for the 1996 World-Wide Web Consortium (W3C) Distributed Indexing/Searching Workshop, May 1996.

Ph.D. Thesis Advising

Bridge to the Ph.D. Program Advising

Teaching at Columbia University

Other Educational Activities

University Service

Computer Science Department

School of General Studies

Continuing Education and Special Programs

Columbia College