Sarah Ita Levitan

sarahita [AT] cs [DOT] columbia [DOT] edu

I am a 5th year PhD student in the Department of Computer Science at Columbia University. I am part of the Spoken Language Processing Group, directed by Dr. Julia Hirschberg. My current research involves identifying spoken cues to deception, and examining cultural and gender differences in the way that people communicate and perceive lies. I am currently funded through an NSF-GRFP fellowship and I am an IGERT fellow.

Last Spring I co-taught a course called Computational Models of Speech and Language, along with psychologist Dr. Michelle Levine.


Identifying Deceptive Speech Across Cultures

Thesis Project

The goal of this work is to automatically detect deception using acoustic-prosodic and lexical-syntactic cues. We are interested in exploring the factors that play a role in deception and deception detection, such as culture, gender, and personality. Toward that end, we have collected a large corpus of deceptive and non-deceptive speech, comprised of conversations between adult native speakers of American English and of Mandarin Chinese. We are applying machine learning techniques to automatically identify deceptive statements, and exploring individual differences between cultures, genders, and personalities in deceptive behavior.

Collaborators: Julia Hirschberg, Andrew Rosenberg, Michelle Levine, Guozhen An

September 2013 - present

Automatic Gender Identification from Speech

Interactions LLC

Automatic identification of speaker traits such as gender, age and emotional state from speech is an important problem for personalized speech-driven services. In this work, we present a novel approach that leverages pitch feature trajectories with the goal of identifying the speaker’s gender with as little speech as possible.

We use the f0 (fundamental frequency) trajectory, the most discriminative feature between male and female speech, but instead of computing summary statistics of the f0 trajectory, we use the entire trajectory as input to the classifier. We model these trajectories as “text” input with each token corresponding to the binned f0 value. Our results show that the trajectory approach can be useful for obtaining fairly accurate gender predictions with as little as one second of speech.

Collaborators: Taniya Mishra, Srinivas Bangalore

May 2015 - August 2015

Entrainment in Supreme Court Oral Arguments

CRA-W Distributed REU, Columbia University

In conversation, people tend to become similar to their dialogue partner by adopting lexical, acoustic, prosodic, and syntactic characteristics of the interlocutor’s speech. Research shows that this phenomenon, known as entrainment, is associated with task success and dialogue quality. We studied entrainment patterns in the Supreme Court corpus, and examined relationships between trial success and entrainment between lawyers and justices. We used Amazon Mechanical Turk to preprocess the data and excise noisy areas in the audio files that skew the analysis process. We found that lawyers entrain more than justices, supporting the theory that the less dominant interlocutor is more likely to entrain to the more dominant speaker.

Collaborators: Julia Hirschberg, Rivka Levitan

May 2013 - August 2013



  • Individual Differences in Deception and Deception Detection in Spoken Dialogue
    Sarah Ita Levitan and Julia Hirschberg
    Mid-Atlantic Student Colloquium on Speech, Language and Learning (MASC-SLL) 2016

  • Novel Feature Representation for Automatic Gender Identification from Speech
    Sarah Ita Levitan, Taniya Mishra and Srinivas Bangalore
    10th Annual Machine Learning Symposium, 2016

  • Entrainment in Supreme Court Oral Arguments
    Sarah Ita Levitan, Rivka Levitan and Julia Hirschberg
    Grace Hopper Conference 2012