I am a fourth year PhD student in the Department of Computer Science at Columbia University. I am affiliated with the Natural Language Processing Group and the Center for Computational Learning Systems. My advisor is Dr. Mona Diab. I received my B.S. in computer science from Sun Yat-sen University in 2007, and M.S. from Columbia University in 2009.


News

  • 05/12/2013 Released the data and code in ACL2013 paper
  • 04/24/2013 Gave a guest lecture at Indiana University, in my friend Muhammad Abdul-Mageed's class social media mining.
  • 04/07/2013 One paper accepted at ACL 2013.
  • 02/14/2013 One paper accepted at NAACL 2013.
  • 01/21/2013 Started interning at Honda Research.
  • 11/19/2012 Passed the candidacy exam!
  • 10/19/2012 Talk at UMD, College Park.
  • 10/13/2012 Released the code of Weighted Textual Matrix Factorization in my acl2012 paper.


Research

My research interest lies in natural language processing and machine learning. It can be summarized into the following topics:

  • topic models and matrix factorization
  • lexical semantics (word sense disambiguation)
  • social media analysis
The projects I have worked on are:
  • Enrich a tweet by linking it to a relevant news article in the weighted matrix factorization framework
  • Weighted matrix factorization for more nuanced and robust short text/sentence semantics
  • Subgroup detection in online discussion forums
  • Incorporating lexical semantics in topic models
  • Word sense disambiguation using multilingual evidence


Publications


Links

Semantic Textual Similarity (STS) Workshop

Personal

In my spare time I enjoy skiing, billiard and watching Arsenal games. I am the most authoritative expert on Arsenal FC in US:)