Instructor: John Hewitt — Columbia University — Department of Computer Science
The first three weeks will consist of an intensive review of the mathematics and technical aspects of large language models---their architecture, pretraining, and alignment---as well as attempts to understand them. Then, students will present research papers to the rest of the class, which will jointly assess, critique, and extend those papers.
Lectures: Fridays, 13:10PM-2:00PM
Location: tbd
We'll use Ed for discussion forums, and Gradescope for assignment submission. You should have been added automatically to both. If you just enrolled, ping us to sync the Canvas roster.
This grading breakdown is provisional and subject to change.
Letter grades will be determined by the teaching staff as a function of the following breakdown; cutoffs for each letter grade will be decided at the end of the class, not by pre-set cutoffs. All written elements of the assignments, as well as the final project writeups, must be written in LaTeX and submitted as PDF.Attendance is required. In general, I expect you to be at effectively every lecture. However, I dislike grading on attendance, so there's no penalty for not attending, and I understand that everyone will need to miss a lecture or two.
Please see the grading section for our policies on AI tools in this class. Otherwise, please refer to the Faculty Statement on Academic Integrity and the Columbia University Undergraduate Guide to Academic Integrity.
The teaching team is committed to accomodating students with disabilities in line with the Faculty Statement on Disability Accommodations.