Shay Cohen

Postdoctoral fellow

Department of Computer Science
Columbia University


Contact Information

scohen [strudel] cs.columbia.edu
724 CESPR
Phone: 412-657-1377


About Me
My broad interests are in the intersection of computational linguistics and machine learning. I am interested in developing ways for reasoning about compositional structures such as parse trees through the use of formalisms such as probabilistic grammars.

I work with Michael Collins. I received my doctoral degree from Carnegie Mellon University, where I worked with my advisor, Noah Smith. Before that, I completed a bachelor's degree (mathematics and computer science) and a master's degree (computer science) in Tel-Aviv University. I am now supported by an NSF/CRA CI Fellowship.

Here is a link to my dissertation, titled Computational Learning of Probabilistic Grammars in the Unsupervised Setting.

I am on the job market. My CV is available here.



Publications [show all abstracts]
  • The Effect of Non-tightness on Bayesian Estimation of PCFGs, Shay B. Cohen and Mark Johnson, ACL 2013 (accepted)
  • Experiments with Spectral Learning of Latent-Variable PCFGs, Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster and Lyle Ungar, In NAACL 2013 (accepted) [pdf]
  • Approximate PCFG Parsing Using Tensor Decomposition, Shay B. Cohen, Giorgio Satta and Michael Collins, In NAACL 2013 (accepted) [pdf]
  • Tensor Decomposition for Fast Latent-Variable PCFG Parsing, Shay B. Cohen and Michael Collins, In NIPS 2012 [pdf] [abstract] [bibtex]
  • Elimination of Spurious Ambiguity in Transition-Based Dependency Parsing, Shay B. Cohen, Carlos Gómez-Rodríguez and Giorgio Satta, In arXiv (1206.6735), 2012 [pdf] [abstract] [bibtex]
  • Spectral Learning of Latent-Variable PCFGs, Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster and Lyle Ungar, In ACL 2012 [pdf] [longer version, stronger model] [abstract] [bibtex]
  • Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning, Shay B. Cohen and Noah A. Smith, Computational Linguistics (2012) [pdf] [abstract] [bibtex]
  • Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance, Shay B. Cohen, Dipanjan Das and Noah A. Smith, In EMNLP 2011[pdf] [abstract] [bibtex]
  • Exact Inference for Generative Probabilistic Non-Projective Dependency Parsing, Shay B. Cohen, Carlos Gómez-Rodríguez and Giorgio Satta, In EMNLP 2011 [pdf] [abstract] [bibtex]
  • Products of Weighted Logic Programs, Shay B. Cohen, Robert J. Simmons and Noah A. Smith, In Theory and Practice of Logic Programming, 2011 [pdf] [abstract] [bibtex]
  • Empirical Risk Minimization with Approximations of Probabilistic Grammars, Shay B. Cohen and Noah A. Smith, In NIPS, 2010 [pdf] [appendix - pdf] [abstract] [bibtex]
  • Covariance in Unsupervised Learning of Probabilistic Grammars, Shay B. Cohen and Noah A. Smith, In JMLR, 2010 [pdf] [abstract] [bibtex]
  • Viterbi Training for PCFGs: Hardness Results and Competitiveness of Uniform Initialization, Shay B. Cohen and Noah A. Smith, In ACL 2010 [pdf] [abstract] [bibtex]
  • Variational Inference for Adaptor Grammars, Shay B. Cohen, David M. Blei and Noah A. Smith, In NAACL 2010 [pdf] [abstract] [bibtex]
  • Variational Inference for Grammar Induction with Prior Knowledge, Shay B. Cohen and Noah A. Smith, In ACL 2009 (short paper track) [pdf] [abstract] [bibtex]
  • Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction, Shay B. Cohen and Noah A. Smith, In NAACL 2009 [pdf] [abstract] [bibtex]
  • Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction, Shay B. Cohen, Kevin Gimpel and Noah A. Smith, In NIPS 2008 [pdf] [code] [abstract] [bibtex]
  • Dynamic Programming Algorithms as Products of Weighted Logic Programs, Shay B. Cohen, Robert J. Simmons and Noah A. Smith, In ICLP 2008 (best student paper award) [springer] [journal-version] [abstract] [bibtex]
  • Joint Morphological and Syntactic Disambiguation, Shay B. Cohen and Noah A. Smith, In EMNLP 2007 [pdf] [abstract] [bibtex]
  • Feature Selection Via Coalitional Game Theory, Shay B. Cohen, Gideon Dror and Eytan Ruppin, In Neural Computation 19:7, 2007 [pdf] [bibtex]
  • Feature Selection Based on the Shapley Value, Shay B. Cohen, Gideon Dror and Eytan Ruppin, In IJCAI 2005 [pdf] [bibtex]

Technical Reports and Others
Teaching
Software
  • dageem - code for unsupervised grammar induction using logistic normal prior
    News: the code has been completely rewritten in Java, and includes some extensions. You can check it out from the Google code repository here.
  • variational inference for adaptor grammars (soon - email me if you want to be notified when put online.)



None Last modified: June, 2012.