6998 Section 3, NLP for the Web

Given the large amount of unstructured information on the web, whether text or spoken, natural language processing has the potential to have a large impact on accessing and harvesting information available on the web. In this class,we will focus on applications using natural language processing that either have already been developed or are currently topics of research. Some of these applications aim to make it easier for end users to navigate the web (e.g., summarization and question answering) while others aim to make it easier to more accurately process information on the web (e.g., paraphrasing and entailment). The class will cover the following topics:

This is a seminar style class and will focus on reading of research papers related to the class topics. Classes will alternate presentation with discussion; a list of questions for discussion will be provided before each class.

Students will be required to help in the presentation of one class. This may include presentation of a topic, submission of discussion questions for a paper, or leading discussion. In addition, students will design and carry out a semester long project. A list of possible projects will be provided by the professor, but students may also propose projects of their own, provided they are approved by the professor. Throughout the semester, students will submit incremental versions of their project. There may be a smaller assignment early in the semester. There will be no midterms or finals.

Students must have taken either Artificial Intelligence, Natural Language Processing, or Search Engine Technology as a pre-requisite.