Compilers
Principles, Techniques, & Tools
Second Edition
This page intentionally left blank
Compilers
Principles, Techniques, & Tools
Second Edition
Alfred V. Aho
Columbia University
Monica S. Lam
Stanford University
Ravi Sethi
Avaya
.
Jeffrey D. Ullman
Stanford University
Publisher
executive Editor
acquisitions Editor
project Editor
associate Managing Editor
cover Designer
digital Assets Manager
media Producer
senior Marketing Manager
marketing Assistant
senior Author Support/
technology Specialist
Senior Manufacturing Buyer
Greg Tobin
Michael Hirsch
Matt Goldstein
Katherine Harutunian
Jeffrey Holcomb
Joyce Cosentino Wells
Marianne Groth
Bethany Tidd
Michelle Brown
Sarah Milmore
cover Image
Scott Ullman of Strange Tonic Productions
www.strangetonic.com)
Joe Vetere
Carol Melville
any of the designations used by manufacturers and sellers to distinguish their
products are claimed as trademarks. Where those designations appear in this
book, and Addison-Wesley was aware of a trademark claim, the designations
have been printed in initial caps or all caps.
this interior of this book was composed in LATEX.
Library of Congress Cataloging-in-Publication Data
Compilers : principles, techniques, and tools / Alfred V. Aho ... [et al.]. -- 2nd ed.
. cm.
ev. ed. of: Compilers, principles, techniques, and tools / Alfred V. Aho, Ravi
ethi, Jeffrey D. Ullman. 1986.
SBN 0-321-48681-1 (alk. paper)
. Compilers (Computer programs) I. Aho, Alfred V. II. Aho, Alfred V.
Compilers, principles, techniques, and tools.
A76.76.C65A37 2007
05.4'53--dc22
006024333
copyright © 2007 Pearson Education, Inc. All rights reserved. No part of this
publication may be reproduced, stored in a retrieval system, or transmitted, in
any form or by any means, electronic, mechanical, photocopying, recording, or
otherwise, without the prior written permission of the publisher. Printed in the
United States of America. For information on obtaining permission for use of
material in this work, please submit a written request to Pearson Education,
inc., Rights and Contracts Department, 75 Arlington Street, Suite 300, Boston,
CA 02116, fax your request to 617-848-7047, or e-mail at
http://www.pearsoned.com/legal/permissions.htm.
 2 3 4 5 6 7 8 9 10—CW—10 09 08 07 06
Preface
in the time since the 1986 edition of this book, the world of compiler design
has changed signicantly. Programming languages have evolved to present new
compilation problems. Computer architectures oer a variety of resources of
which the compiler designer must take advantage. Perhaps most interestingly,
the venerable technology of code optimization has found use outside compilers.
it is now used in tools that nd bugs in software, and most importantly, nd
security holes in existing code. And much of the \front-end" technology |
grammars, regular expressions, parsers, and syntax-directed translators | are
still in wide use.
thus, our philosophy from previous versions of the book has not changed.
we recognize that few readers will build, or even maintain, a compiler for a
major programming language. Yet the models, theory, and algorithms associated with a compiler can be applied to a wide range of problems in software
design and software development. We therefore emphasize problems that are
most commonly encountered in designing a language processor, regardless of
the source language or target machine.
use of the Book
it takes at least two quarters or even two semesters to cover all or most of the
material in this book. It is common to cover the rst half in an undergraduate
course and the second half of the book | stressing code optimization | in
 second course at the graduate or mezzanine level. Here is an outline of the
chapters:
chapter 1 contains motivational material and also presents some background
issues in computer architecture and programming-language principles.
chapter 2 develops a miniature compiler and introduces many of the important concepts, which are then developed in later chapters. The compiler itself
appears in the appendix.
chapter 3 covers lexical analysis, regular expressions, nite-state machines, and
scanner-generator tools. This material is fundamental to text-processing of all
ports.
vi
PREFACE
chapter 4 covers the major parsing methods, top-down (recursive-descent, LL)
and bottom-up (LR and its variants).
chapter 5 introduces the principal ideas in syntax-directed denitions and
intermediate code for a typical programming language.
chapter 7 covers run-time environments, especially management of the run-time
stack and garbage collection.
chapter 8 is on object-code generation. It covers construction of basic blocks,
generation of code from expressions and basic blocks, and register-allocation
techniques.
chapter 9 introduces the technology of code optimization, including 
ow graphs,
ata-
ow frameworks, and iterative algorithms for solving these frameworks.
chapter 10 covers instruction-level optimization. The emphasis is on the extraction of parallelism from small sequences of instructions and scheduling them
in single processors that can do more than one thing at once.
chapter 11 talks about larger-scale parallelism detection and exploitation. Here,
the emphasis is on numeric codes that have many tight loops that range over
multidimensional arrays.
hapter 12 is on interprocedural analysis. It covers pointer analysis, aliasing,
and data-
ow analysis that takes into account the sequence of procedure calls
that reach a given point in the code.
courses from material in this book have been taught at Columbia, Harvard,
and Stanford. At Columbia, a senior/rst-year graduate course on programming languages and translators has been regularly oered using material from
the rst eight chapters. A highlight of this course is a semester-long project
in which students work in small teams to create and implement a little language of their own design. The student-created languages have covered diverse
application domains including quantum computation, music synthesis, computer graphics, gaming, matrix operations and many other areas. Students use
compiler-component generators such as ANTLR, Lex, and Yacc and the syntaxdirected translation techniques discussed in chapters two and ve to build their
compilers. A follow-on graduate course has focused on material in Chapters 9
through 12, emphasizing code generation and optimization for contemporary
machines including network processors and multiprocessor architectures.
at Stanford, a one-quarter introductory course covers roughly the material in Chapters 1 through 8, although there is an introduction to global code
optimization from Chapter 9. The second compiler course covers Chapters 9
through 12, plus the more advanced material on garbage collection from Chapter 7. Students use a locally developed, Java-based system called Joeq for
implementing data-
ow analysis algorithms.
PREFACE
ii
Prerequisites
the reader should possess some \computer-science sophistication," including
it least a second course on programming, and courses in data structures and
discrete mathematics. Knowledge of several dierent programming languages
is useful.
exercises
the book contains extensive exercises, with some for almost every section. We
indicate harder exercises or parts of exercises with an exclamation point. The
hardest exercises have a double exclamation point.
gradiance On-Line Homeworks
 feature of the new edition is that there is an accompanying set of on-line
homeworks using a technology developed by Gradiance Corp. Instructors may
assign these homeworks to their class, or students not enrolled in a class may
enroll in an \omnibus class" that allows them to do the homeworks as a tutorial
without an instructor-created class). Gradiance questions look like ordinary
questions, but your solutions are sampled. If you make an incorrect choice you
are given specic advice or feedback to help you correct your solution. If your
instructor permits, you are allowed to try again, until you get a perfect score.
 subscription to the Gradiance service is oered with all new copies of this
text sold in North America. For more information, visit the Addison-Wesley
web site www.aw.com/gradiance or send email to computing@aw.com.
support on the World Wide Web
the book's home page is
dragonbook.stanford.edu
there, you will nd errata as we learn of them, and backup materials. We hope
to make available the notes for each oering of compiler-related courses as we
teach them, including homeworks, solutions, and exams. We also plan to post
descriptions of important compilers written by their implementers.
Acknowledgements
cover art is by S. D. Ullman of Strange Tonic Productions.
Jon Bentley gave us extensive comments on a number of chapters of an
Earlier draft of this book. Helpful comments and errata were received from:
viii
PREFACE
Domenico Bianculli, Peter Bosch, Marcio Buss, Marc Eaddy, Stephen Edwards,
Vibhav Garg, Kim Hazelwood, Gaurav Kc, Wei Li, Mike Smith, Art Stamness,
Krysta Svore, Olivier Tardieu, and Jia Zeng. The help of all these people is
gratefully acknowledged. Remaining errors are ours, of course.
in addition, Monica would like to thank her colleagues on the SUIF compiler team for an 18-year lesson on compiling: Gerald Aigner, Dzintars Avots,
Saman Amarasinghe, Jennifer Anderson, Michael Carbin, Gerald Cheong, Amer
Diwan, Robert French, Anwar Ghuloum, Mary Hall, John Hennessy, David
Heine, Shih-Wei Liao, Amy Lim, Benjamin Livshits, Michael Martin, Dror
Maydan, Todd Mowry, Brian Murphy, Jerey Oplinger, Karen Pieper, Martin Rinard, Olatunji Ruwase, Constantine Sapuntzakis, Patrick Sathyanathan,
Michael Smith, Steven Tjiang, Chau-Wen Tseng, Christopher Unkel, John
Whaley, Robert Wilson, Christopher Wilson, and Michael Wolf.
