Stephen A. Edwards Columbia University Crown
COMS W4115
Programming Languages and Translators
Fall 2018

Lectures

Class meets Mondays and Wednesdays 4:10 - 5:25 PM 451 CSB.

Staff

Name Email Office hours Location
Prof. Stephen A. Edwards sedwards@cs.columbia.edu see my home page 462 CSB
John Hui jzh2106@columbia.edu T 3:30-5:30 487 CSB
Jennifer Bi jb3495@columbia.edu M 2-4 487 CSB
Mark Mazel mm4764@columbia.edu Th 5-7 487 CSB
Lauren Arnett lba2138@columbia.edu M 10-12 487 CSB
Dean Deng dd2563@columbia.edu Th 3-5 487 CSB
Justin Wong jw3354@columbia.edu T 12-2 487 CSB

Overview

The goal of PLT is to teach you both about the structure of computer programming languages and the basics of implementing compilers for such languages.

The course will focus mostly on traditional imperative and object-oriented languages, but will also cover functional and logic programming, concurrency issues, and some aspects of scripting languages. Homework and tests will cover language issues. You will design and implement a language of your own design in a semester-long team project.

While few of you will ever implement a full commercial compiler professionally, the concepts, techniques, and tools you will learn have broad application.

Prerequisites

COMS W3157 Advanced Programming: You will be dividing into teams to build a compiler, so you need to have some idea how to keep this under control. Quick test: you need to know about Makefiles and source code control systems.

COMS W3261 Computability and Models of Computation: You will need an understanding of formal languages and grammar to build the parser and lexical analyzer. Quick test: you must know about regular expressions, context-free grammars, and NFAs.

Schedule

Date Session Lecture Notes Reading Due
Wed Sep 5 1
Intro. to Languages
pdf
Ch 1, 2
Mon Sep 10 2
Language Processors
pdf
Ch. 2
Wed Sep 12 3
Some Outstanding Projects
pdf

Mon Sep 17 4
Programming in OCaml
pdf

Wed Sep 19 5
"

Proposal
Mon Sep 24 6
"

Wed Sep 26 7
Syntax and Parsing
pdf
Ch. 3, 4
Mon Oct 1 8
"

pdf HW1
Wed Oct 3 9
Guest: Prof. Ray
pdf

Mon Oct 8 10
"

Wed Oct 10 11
"

Thu Oct 11
(turn in homework only)

pdf HW2
Mon Oct 15 12
Review for Midterm
pdf

LRM, Parser
Wed Oct 17 Midterm Exam
Mon Oct 22 13
The MicroC Compiler
pdf
App. A
Wed Oct 24 14
"

Mon Oct 29 15
"

Wed Oct 31 16
"

Mon Nov 5 Election Day Holiday
Wed Nov 7 17
Runtime Environments
pdf
Ch. 7
Mon Nov 12 18
"

Wed Nov 14 19
Types and Static Semantics
pdf
Sec. 6.5
Hello World
Mon Nov 19 20
Code Generation
pdf

Wed Nov 21 Thanksgiving Holiday
Mon Nov 26 21
The Lambda Calculus
pdf

Wed Nov 28 22
"

Mon Dec 3 23
Prolog
pdf

Tue Dec 4
(turn in homework only)

pdf HW3
Wed Dec 5 24
Review for Final
pdf

Mon Dec 10 Final Exam
Wed Dec 19 Project Reports Due

Suggested Text

Alfred V. Aho, Monica Lam, Ravi Sethi, and Jeffrey D. Ullman.
Compilers: Principles, Techniques, and Tools.
Addison-Wesley, 2006. Second Edition.

The first edition was long the standard text on compilers; the second edition of the "dragon book" has now been updated and continues to be one of the more readable books on the topic. Columbia's own Prof. Al Aho is one of the authors.

Cover of the Dragon Book 2nd edition

Related Texts

Michael L. Scott.
Programming Language Pragmatics
Morgan Kaufmann, 2006. Second Edition.

A broad-minded book about languages in general, but has less on practical details of compiler construction.

Cover of Programming Language Pragmatics 2nd edition

Andrew W. Appel.
Modern Compiler Implementation in ML.
Cambridge University Press, 1998.

The opposite of Scott: focuses on compiler construction, not language design issues.
It uses the functional language ML, which is closely related to O'Caml, but just different enough to be annoying.

Cover of Appel

Lawrence C. Paulson
ML for the Working Programmer.
Cambridge University Press, 1996. Second edition.

A book about functional programming. It's written for the ML language, not O'Caml, but the two are closely related.

Cover of Paulson

Steven S. Muchnick
Advanced Compiler Design and Implementation.
Morgan Kaufmann, 1997.

A very extensive book on many aspects of compiler design. Starts about halfway through Appel and goes much farther. Recommended for serious compiler hackers only.

Cover of Muchnick

Objective Caml Resources

webpage The Caml Language Homepage. Compiler downloads and documentation. Start here.
webpage The Objective Caml System. Documentation and User's Manual for the whole system, including documentation for ocamllex, ocamlyacc, ocamldep, ocamldebug, and all the standard libraries.
webpage Emmanuel Chailloux, Pascal Manoury, and Bruno Pagano, Developing Applications with Objective Caml. An online book translated from the French (O'Reilly).
webpage Objective CAML Tutorial
.tar.gz file OCaml source for the four-function calculator.
.tar.gz file OCaml source and test cases for the MicroC language, which generates LLVM IR.

The Project

The focus of 4115 is the design and implementation of a little language. You will divide into teams and design the goals, syntax, and semantics of your language, and implement a compiler for your language.

Exception: CVN students will do the project individually.

Final Report Outline

This is a critical part of the project and will be a substantial fraction of the grade.

Include the following sections:

  1. Introduction
  2. Language Tutorial
  3. Language Manual
  4. Project Plan
  5. Architectural Design
  6. Test Plan
  7. Lessons Learned
  8. Appendix

Project Resources

pdf A two-page introduction to the CVS version control system. I strongly suggest you keep your project under some version control system.
pdf An excellent final report: the Funk language by 4115 students Naser AlDuaij, Senyao Du, Noura Farra, Yuan Kang, and Andrea Lottarini.
pdf An excellent final report: the Sheets language by 4115 students Benjamin Barg, Gabriel Blanco, Amelia Brunner, and Ruchir Khaitan.

Language Reference Manuals

pdf Dennis M. Ritchie, C Reference Manual
pdf Kernighan & Ritchie, The C Programming Language
pdf The C Language Reference Manual (SGI)
pdf Stroustrup, The C++ Programming Language
pdf The Java Language Specification
pdf The C# Language Specification

Projects

AP++: List language (MM)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Amit Patel
BitTwiddler: A language for binary data parsers (SE)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Bruno Martins
Casper: MicroC with strings (SE)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Michael Makris
Coral: Python-like language with gradual typing (LA) star
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Jacob Austin, Matthew Bowers, Rebecca Cawkwell, and Sanford Miller
FIRE: Fire Input Reinterpretation Language (JH)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Ayer Chan, Jason Konikow, Graham Patterson, Frank Spano, and Christopher Thomas
Fli-O: File I/O language (JH)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Matthew Chan, Gideon Cheruiyot, Justin Gross, and Eyob Tefera
GaE: Graphs Ain't Easy (DD)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Andrew Jones, Samara Nebel, and Kevin Zeng
Grape: Graph Manipulation Language (DD)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
James Kolsby, Nick Krasnoff, Po-Yu Wu, and Hyun Bin Yoo
Graphiti: Graph Language (DD)
pdfProposal pdfLRM pdfFinal Report UNRECOGNIZED FILETYPEProject Files
Emily Hao, Sydney Lee, Michal Porubcin, Andrew Quijano, and Alice Thum
Hippograph: High-Performance Parsing of Graphs (JB)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Benjamin Lewinter, Irina Mateescu, Harry Smith, and Yasunari Watanabe
IRIs: Workflow Description Language (JH)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Hanzhou Gu, Xuheng Li, Shulan TANG, and Pinxi Tai
MATRX: Matrix Language (DD)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Alana Anderson, Pearce Kieser, Katherine Pfleger, Julia Sheth, and Nicholas Sparks
MMM: Image Matrix Manipulation Language (LA)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Shenghao Jiang, Yixiong Ren, and Shikun Wang
MathLight: Matrix manipulation language (JB)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Mingye Chen, Chunli Fu, Yuli Han, and Boya Song
ProCSV: CSV File Processing Languages (JW)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Tahiya Chowdhury, Tabara Nosiba, and Tahsina Saosun
PyLit: Python-like language (JW)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Ryan Loprete
SCoLang: Contract Fulfillment Language (JB)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Sambhav Anand, Jackson Chen, Sushanth Raman, Varun Varahabhotla, and Kanishk Vashisht
SOSL: Set Operation Simplification Language (LA)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Ryan Chun, Garrison Grogan, Ryan Koning, and Trisha Maniar
SSOL: Simple Shape Oriented Language (MM)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Jeevan Farias, Daniel Mesko, and Madeleine Tipp
Shoo: Imperative and functional language (JW) star
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Claire Adams, Samurdha Jayasinghe Mudi, Cindy Le, and Crystal Ren
Tree++: Tree Manipulation Language (JW)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Allison Costa, Laura Matos, Jacob Penn, and Laura Smerling
Typescript-on-LLVM: Statically-typed scripting without the browser (SE)
pdfProposal pdfLRM pdfFinal Report ArchiveProject Files
Ratheet Pandya
Uni-Corn: Hardware Description Language (SE)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Maryam Aly, Adiza Awwal, David Lalo, Dan Sendik, and Gael Zendejas
VSCOde: Image Manipulation (MM)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Jessica Cheng, Anna Lu, Hana Mizuta, Spencer Yen, and Kenneth Yuan
ZEN: Graphics Language (MM)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Zoe Gordin, Eleanor Murguia, and Nadia Saleh
bawk: bad awk (JH)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Ashley An, Christine Hsu, Mel Sawyer, and Victoria Yang
text++: PDF Generation Language (SE)
pdfProposal pdfLRM pdfFinal Report pdfSlides ArchiveProject Files
Joi Anderson, Maria Javier, and Klarizsa Padilla

star My favorites

Grading

40 % Project
20 % Midterm
30 % Final
10 % Homework

Collaboration

You will collaborate with your own small team on the programming project, but you may not collaborate with others on homeworks. Teams may share ideas about the programming assignments, but not code. Any two teams found submitting similar code will receive zero credit for the whole assignment, and repeat offenses will be referred to the dean. See the Columbia CS department academic policies for more details.

Other

Valid HTML 4.01Valid CSS