Course |
Credit |
Prerequisites |
E6901-005 |
Flexible: 1-12 units |
COMS
W3137/9 (Data Structures and Algorithms) |
W3998-005 |
Flexible: 1-3 units |
COMS
W3137/9 (Data Structures and Algorithms) |
W4901-005 |
Flexible: 1-3 units |
COMS
W3137/9 (Data Structures and Algorithms) |
Note: 1 unit of credit corresponds roughly to 3 hours of work per week.
Instructor |
Luis Gravano |
Office |
706 Schapiro CEPSR |
Telephone |
(212) 939-7064 |
Home page |
Distributed search over text databases and web resources in general; heterogeneous database integration; information extraction from web resources; text mining. Research projects developed in my group include:
GeoSearch, a geographically-aware search engine.
Snowball, an information-extraction system.
QProber, a system for automatically classifying "hidden-web" text databases.
SDARTS, a protocol and toolkit for metasearching.
Possible projects generally include developing various tools for searching for information on the Internet. Please email me (gravano@cs.columbia.edu) to schedule a meeting if you are interested. I am also open to suggestions. This is an exciting area to work in, and one of the most exciting things about it is that new problems and challenges appear every day. If you want to suggest a new problem/project to work on, you are more than welcome to come talk to me!