Daniel Hsu – Papers

The price of multi-group transductive learning
Noah Bergam, Samuel Deng, Daniel Hsu.
Preprint, 2026.
[ external link | bibtex ]

Fixed universal transformers
Jingwen Liu, Alexandr Andoni, Daniel Hsu.
Preprint, 2026.
[ external link | bibtex ]

ShakyPrepend: a multi-group learner with improved sample complexity
Lujing Zhang, Daniel Hsu, Sivaraman Balakrishnan.
Preprint, 2026.
[ external link | bibtex ]

Panprediction: optimal predictions for any downstream task and loss
Sivaraman Balakrishnan, Nika Haghtalab, Daniel Hsu, Brian Lee, Eric Zhao.
In Twenty-Ninth International Conference on Artificial Intelligence and Statistics, 2026.
[ external link | bibtex ]

Prior makes it possible: from sublinear graph algorithms to LLM test-time methods
Avrim Blum, Daniel Hsu, Cyrus Rashtchian, Donya Saless.
In Twenty-Ninth International Conference on Artificial Intelligence and Statistics, 2026.
[ external link | bibtex ]

Time-aware synthetic control
Saeyoung Rho, Cyrus Illick, Samhitha Narasipura, Alberto Abadie, Daniel Hsu, Vishal Misra.
In Twenty-Ninth International Conference on Artificial Intelligence and Statistics, 2026.
[ external link | bibtex ]

Group-realizable multi-group learning by minimizing empirical risk
Navid Ardeshir, Samuel Deng, Daniel Hsu, Jingwen Liu.
In Thirty-Seventh International Conference on Algorithmic Learning Theory, 2026.
[ external link | bibtex ]

Fast attention mechanisms: a tale of parallelism
Jingwen Liu, Hantao Yu, Clayton Sanford, Alexandr Andoni, Daniel Hsu.
In Advances in Neural Information Processing Systems 38, 2025.
[ external link | bibtex ]

Survey on algorithms for multi-index models
Joan Bruna, Daniel Hsu.
Statistical Science, 40(3):378-391, 2025.
[ external link | arxiv link | bibtex ]

Efficient estimation of the central mean subspace via smoothed gradient outer products
Gan Yuan, Mingyue Xu, Samory Kpotufe, Daniel Hsu.
SIAM Journal on Mathematics of Data Science, 7(3):1241-1264, 2025.
[ external link | arxiv link | bibtex ]

Learning compositional functions with transformers from easy-to-hard data
Zixuan Wang, Eshaan Nichani, Alberto Bietti, Alex Damian, Daniel Hsu, Jason D. Lee, Denny Wu.
In Thirty-Eighth Annual Conference on Learning Theory, 2025.
[ external link | talk by Eshaan | bibtex ]

Learning Gaussian multi-index models with gradient flow: time complexity and directional convergence
Berfin Şimşek, Amire Bendjeddou, Daniel Hsu.
In Twenty-Eighth International Conference on Artificial Intelligence and Statistics, 2025.
[ external link | bibtex ]

The piranha problem: large effects swimming in a small pond
Christopher Tosh, Philip Greengard, Ben Goodrich, Andrew Gelman, Aki Vehtari, Daniel Hsu.
Notices Amer. Math. Soc. 72(1):15-25, 2025.
[ external link | bibtex ]

Interactive machine teaching by labeling rules and instances
Giannis Karamanolakis, Daniel Hsu, Luis Gravano.
Transactions of the Association for Computational Linguistics, 12:1441-1459, 2024.
[ external link | tacl link | bibtex ]

Group-wise oracle-efficient algorithms for online multi-group learning
Samuel Deng, Daniel Hsu, Jingwen Liu.
In Advances in Neural Information Processing Systems 37, 2024.
[ external link | bibtex ]

One-layer transformers fail to solve the induction heads task
Clayton Sanford, Daniel Hsu, Matus Telgarsky.
Preprint, 2024.
[ external link | bibtex ]

Transformers provably learn sparse token selection while fully-connected nets cannot
Zixuan Wang, Stanley Wei, Daniel Hsu, Jason D. Lee.
In Forty-First International Conference on Machine Learning, 2024.
[ external link | pmlr link | bibtex ]

Transformers, parallel computation, and logarithmic depth
Clayton Sanford, Daniel Hsu, Matus Telgarsky.
In Forty-First International Conference on Machine Learning, 2024.
[ external link | talk slides | pmlr link | bibtex ]

Multi-group learning for hierarchical groups
Samuel Deng, Daniel Hsu.
In Forty-First International Conference on Machine Learning, 2024.
[ external link | pmlr link | bibtex ]

On the sample complexity of parameter estimation in logistic regression with normal design
Daniel Hsu, Arya Mazumdar.
In Thirty-Seventh Annual Conference on Learning Theory, 2024.
[ external link | talk slides | pmlr link | talk by Arya | bibtex ]

Distribution-specific auditing for subgroup fairness
Daniel Hsu, Jizhou Huang, Brendan Juba.
In Fifth Symposium on Foundations of Responsible Computing, 2024.
[ external link | arxiv link | bibtex ]

Statistical-computational trade-offs in tensor PCA and related problems via communication complexity
Rishabh Dudeja, Daniel Hsu.
The Annals of Statistics, 52(1):131-156, 2024.
[ local pdf file | external link | talk slides | aos link | bibtex ]

Representational strengths and limitations of transformers
Clayton Sanford, Daniel Hsu, Matus Telgarsky.
In Advances in Neural Information Processing Systems 36, 2023.
[ external link | talk slides | bibtex ]

Intrinsic dimensionality and generalization properties of the \(\mathcal{R}\)-norm inductive bias
Navid Ardeshir, Daniel Hsu, Clayton Sanford.
In Thirty-Sixth Annual Conference on Learning Theory, 2023.
[ external link | pmlr link | bibtex ]

Masked prediction: a parameter identifiability view
Bingbin Liu, Daniel Hsu, Pradeep Ravikumar, Andrej Risteski.
In Advances in Neural Information Processing Systems 35, 2022.
[ external link | bibtex ]

Unbiased estimators for random design regression
Michał Dereziński, Manfred K. Warmuth, Daniel Hsu.
Journal of Machine Learning Research, 23(167):1-46, 2022.
[ external link | arxiv link | bibtex ]

Near-optimal statistical query lower bounds for agnostically learning intersections of halfspaces with Gaussian marginals
Daniel Hsu, Clayton Sanford, Rocco A. Servedio, Emmanouil-Vasileios Vlatakis-Gkaragkounis.
In Thirty-Fifth Annual Conference on Learning Theory, 2022.
[ external link | pmlr link | bibtex ]

Learning tensor representations for meta-learning
Samuel Deng, Yilin Guo, Daniel Hsu, Debmalya Mandal.
In Twenty-Fifth International Conference on Artificial Intelligence and Statistics, 2022.
[ external link | bibtex ]

Simple and near-optimal algorithms for hidden stratification and multi-group learning
Christopher Tosh, Daniel Hsu.
In Thirty-Ninth International Conference on Machine Learning, 2022.
[ external link | talk slides | errata | pmlr link | bibtex ]

Contrastive estimation reveals topic posterior information to linear models
Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu.
Journal of Machine Learning Research, 22(281):1-31, 2021.
[ external link | talk slides | bibtex ]

Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz, Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu, Thodoris Lykouris, Miroslav Dudík, Robert E. Schapire.
In Advances in Neural Information Processing Systems 34, 2021.
[ external link | bibtex ]

Support vector machines and linear regression coincide with very high-dimensional features
Navid Ardeshir, Clayton Sanford, Daniel Hsu.
In Advances in Neural Information Processing Systems 34, 2021.
[ external link | bibtex ]

Classification vs regression in overparameterized regimes: Does the loss function matter?
Vidya Muthukumar, Adhyyan Narang, Vignesh Subramanian, Mikhail Belkin, Daniel Hsu, Anant Sahai.
Journal of Machine Learning Research, 22(222):1-69, 2021.
[ external link | arxiv link | bibtex ]

On the approximation power of two-layer networks of random ReLUs
Daniel Hsu, Clayton Sanford, Rocco A. Servedio, Emmanouil-Vasileios Vlatakis-Gkaragkounis.
In Thirty-Fourth Annual Conference on Learning Theory, 2021.
[ external link | talk slides | note about kernels | pmlr link | blog post | bibtex ]

Statistical query lower bounds for tensor PCA
Rishabh Dudeja, Daniel Hsu.
Journal of Machine Learning Research, 22(83):1-51, 2021.
[ external link | jmlr link | bibtex ]

Generalization bounds via distillation
Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang.
In Ninth International Conference on Learning Representations, 2021.
[ external link | bibtex ]

On the proliferation of support vectors in high dimensions
Daniel Hsu, Vidya Muthukumar, Ji Xu.
In Twenty-Fourth International Conference on Artificial Intelligence and Statistics, 2021.
[ external link | reviews | response | pmlr link | bibtex ]

Contrastive learning, multi-view redundancy, and linear models
Christopher Tosh, Akshay Krishnamurthy, Daniel Hsu.
In Thirty-Second International Conference on Algorithmic Learning Theory, 2021.
[ external link | pmlr link | blog post | bibtex ]

Cross-lingual text classification with minimal resources by transferring a sparse teacher
Giannis Karamanolakis, Daniel Hsu, Luis Gravano.
In Conference on Empirical Methods in Natural Language Processing: Findings, 2020.
[ external link | aclweb link | bibtex ]

Interpreting deep learning models for weak lensing
José Manuel Zorrilla Matilla, Manasi Sharma, Daniel Hsu, Zoltán Haiman.
Phys. Rev. D, 102:123506, Dec 2020.
[ external link | aps link | bibtex ]

Ensuring fairness beyond the training data
Debmalya Mandal, Samuel Deng, Suman Jana, Jeannette M. Wing, Daniel Hsu.
In Advances in Neural Information Processing Systems 33, 2020.
[ external link | bibtex ]

Diameter-based interactive structure discovery
Christopher Tosh, Daniel Hsu.
In Twenty-Third International Conference on Artificial Intelligence and Statistics, 2020.
[ external link | pmlr link | bibtex ]

Two models of double descent for weak features
Mikhail Belkin, Daniel Hsu, Ji Xu.
SIAM Journal on Mathematics of Data Science, 2(4):1167–1180, 2020.
[ external link | arxiv link | bibtex ]

Kernel approximation methods for speech recognition
Avner May, Alireza Bagheri Garakani, Zhiyun Lu, Dong Guo, Kuan Liu, Aurélien Bellet, Linxi Fan, Michael Collins, Daniel Hsu, Brian Kingsbury.
Journal of Machine Learning Research, 20(59):1-36, 2019.
[ external link | bibtex ]

On the number of variables to use in principal component regression
Ji Xu, Daniel Hsu.
In Advances in Neural Information Processing Systems 32, 2019.
[ external link | bibtex ]

Leveraging just a few keywords for fine-grained aspect detection through weakly supervised co-training
Giannis Karamanolakis, Daniel Hsu, Luis Gravano.
In Conference on Empirical Methods in Natural Language Processing, 2019.
[ external link | talk slides | bibtex ]

Privacy accounting and quality control in the Sage differentially private ML platform
Mathias Lecuyer, Riley Spahn, Kiran Vodrahalli, Roxana Geambasu, Daniel Hsu.
In Twenty-Seventh ACM Symposium on Operating Systems Principles, 2019.
[ external link | bibtex ]

Weak lensing cosmology with convolutional neural networks on noisy data
Dezső Ribli, Bálint Ármin Pataki, José Manuel Zorrilla Matilla, Daniel Hsu, Zoltán Haiman, István Csabai.
Monthly Notices of the Royal Astronomical Society, 490(2):1843–1860, 2019.
[ external link | arxiv link | bibtex ]

Reconciling modern machine learning practice and the bias-variance trade-off
Mikhail Belkin, Daniel Hsu, Siyuan Ma, Soumik Mandal.
Proceedings of the National Academy of Sciences, 116(32):15849-15854, 2019.
[ local pdf file | pnas link | arxiv link | bibtex ]

Mixing time estimation in reversible Markov chains from a single sample path
Daniel Hsu, Aryeh Kontorovich, David A. Levin, Yuval Peres, Csaba Szepesvári, Geoffrey Wolfer.
The Annals of Applied Probability, 29(4):2439–2480, 2019.
[ local pdf file | aap link | bibtex ]

Using a machine learning approach to determine the space group of a structure from the atomic pair distribution function
Chia-Hao Liu, Yunzhe Tao, Daniel Hsu, Qiang Du, Simon J.L. Billinge.
Acta Crystallographica Section A, 75(4):633–643, 2019.
[ external link | arxiv link | bibtex ]

Teaching a black-box learner
Sanjoy Dasgupta, Daniel Hsu, Stefanos Poulis, Xiaojin Zhu.
In Thirty-Sixth International Conference on Machine Learning, 2019.
[ local pdf file | pmlr link | bibtex ]

A gradual, semi-discrete approach to generative network training via explicit Wasserstein minimization
Yucheng Chen, Matus Telgarsky, Chao Zhang, Bolton Bailey, Daniel Hsu, Jian Peng.
In Thirty-Sixth International Conference on Machine Learning, 2019.
[ external link | pmlr link | bibtex ]

Certified robustness to adversarial examples with differential privacy
Mathias Lecuyer, Vaggelis Atlidakis, Roxana Geambasu, Daniel Hsu, Suman Jana.
In IEEE Symposium on Security and Privacy, 2019.
[ external link | bibtex ]

Correcting the bias in least squares regression with volume-rescaled sampling
Michał Dereziński, Manfred K. Warmuth, Daniel Hsu.
In Twenty-Second International Conference on Artificial Intelligence and Statistics, 2019.
[ local pdf file | arxiv link | pmlr link | bibtex ]

Attribute-efficient learning of monomials over highly-correlated variables
Alexandr Andoni, Rishabh Dudeja, Daniel Hsu, Kiran Vodrahalli.
In Thirtieth International Conference on Algorithmic Learning Theory, 2019.
[ local pdf file | external link | bibtex ]

Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate
Mikhail Belkin, Daniel Hsu, Partha Mitra.
In Advances in Neural Information Processing Systems 31, 2018.
[ local pdf file | external link | short version | talk slides | bibtex ]

Leveraged volume sampling for linear regression
Michał Dereziński, Manfred K. Warmuth, Daniel Hsu.
In Advances in Neural Information Processing Systems 31, 2018.
[ local pdf file | external link | bibtex ]

Benefits of over-parameterization with EM
Ji Xu, Daniel Hsu, Arian Maleki.
In Advances in Neural Information Processing Systems 31, 2018.
[ external link | bibtex ]

Learning single-index models in Gaussian space
Rishabh Dudeja, Daniel Hsu.
In Thirty-First Annual Conference on Learning Theory, 2018.
[ local pdf file | pmlr link | bibtex ]

Non-Gaussian information from weak lensing data via deep learning
Arushi Gupta, José Manuel Zorrilla Matilla, Daniel Hsu, Zoltán Haiman.
Phys. Rev. D, 97:103515, May 2018.
[ external link | aps link | bibtex ]

Discovering foodborne illness in online restaurant reviews
Thomas Effland, Anna Lawson, Sharon Balter, Katelynn Devinney, Vasudha Reddy, HaeNa Waechter, Luis Gravano, and Daniel Hsu.
Journal of the American Medical Informatics Association, 25(12):1586-1592, 2018.
[ external link | bibtex ]

Coding sets with asymmetric information
Alexandr Andoni, Javad Ghaderi, Daniel Hsu, Dan Rubenstein, Omri Weinstein.
Preprint, 2017.
[ external link | bibtex ]

Linear regression without correspondence
Daniel Hsu, Kevin Shi, Xiaorui Sun.
In Advances in Neural Information Processing Systems 30, 2017.
[ external link | talk slides | bibtex ]

Subregional nowcasts of seasonal influenza using search trends
Sasikiran Kandula, Daniel Hsu, and Jeffrey Shaman.
Journal of Medical Internet Research, 19(11):e370, 2017.
[ external link | bibtex ]

Greedy approaches to symmetric orthogonal tensor decomposition
Cun Mu, Daniel Hsu, Donald Goldfarb.
SIAM Journal on Matrix Analysis and Applications, 38(4):1210-1226, 2017.
[ local pdf file | arxiv link | siam link | bibtex ]

Parameter identification in Markov chain choice models
Arushi Gupta, Daniel Hsu.
In Twenty-Eighth International Conference on Algorithmic Learning Theory, 2017.
[ external link | pmlr link | bibtex ]

Correspondence retrieval
Alexandr Andoni, Daniel Hsu, Kevin Shi, Xiaorui Sun.
In Thirtieth Annual Conference on Learning Theory, 2017.
[ local pdf file | talk slides | pmlr link | bibtex ]

FairTest: discovering unwarranted associations in data-driven applications
Florian Tramer, Vaggelis Atlidakis, Roxana Geambasu, Daniel Hsu, Jean-Pierre Hubaux, Mathias Humbert, Ari Juels, Huang Lin.
In Second IEEE European Symposium on Security and Privacy, 2017.
[ external link | slides from privacycon | bibtex ]

Kernel ridge vs. principal component regression: minimax bounds and the qualification of regularization operators
Lee H. Dicker, Dean P. Foster, Daniel Hsu.
Electronic Journal of Statistics, 1(1):1022–1047, 2017.
[ local pdf file | ejs link | bibtex ]

Greedy bi-criteria approximations for \(k\)-medians and \(k\)-means
Daniel Hsu, Matus Telgarsky.
Preprint, 2016.
[ external link | bibtex ]

Search improves label for active learning
Alina Beygelzimer, Daniel Hsu, John Langford, Chicheng Zhang.
In Advances in Neural Information Processing Systems 29, 2016.
[ local pdf file | arxiv link | video advert | bibtex ]

Global analysis of Expectation Maximization for mixtures of two Gaussians
Ji Xu, Daniel Hsu, Arian Maleki.
In Advances in Neural Information Processing Systems 29, 2016.
[ local pdf file | short version | summary | arxiv link | bibtex ]

Do dark matter halos explain lensing peaks?
José Manuel Zorrilla Matilla, Zoltán Haiman, Daniel Hsu, Arushi Gupta, Andrea Petri.
Phys. Rev. D, 94:083506, Oct 2016.
[ external link | aps link | bibtex ]

Unsupervised part-of-speech tagging with anchor hidden Markov models
Karl Stratos, Michael Collins, Daniel Hsu.
Transactions of the Association for Computational Linguistics, 4:245–257, 2016.
[ external link | bibtex ]

Compact kernel models for acoustic modeling via random feature selection
Avner May, Michael Collins, Daniel Hsu, Brian Kingsbury.
In Forty-First IEEE International Conference on Acoustics, Speech and Signal Processing, 2016.
[ external link | bibtex ]

Loss minimization and parameter estimation with heavy tails
Daniel Hsu, Sivan Sabato.
Journal of Machine Learning Research, 17(18):1–40, 2016.
[ external link | slides for related talk | bibtex ]

Mixing time estimation in reversible Markov chains from a single sample path
Daniel Hsu, Aryeh Kontorovich, Csaba Szepesvari.
In Advances in Neural Information Processing Systems 28, 2015.
[ external link | talk slides | bibtex ]

Efficient and parsimonious agnostic active learning
Tzu-Kuo Huang, Alekh Agarwal, Daniel Hsu, John Langford, Robert E. Schapire.
In Advances in Neural Information Processing Systems 28, 2015.
[ external link | bibtex ]

Sunlight: fine-grained targeting detection at scale with statistical confidence
Mathias Lecuyer, Riley Spahn, Yannis Spiliopoulos, Augustin Chaintreau, Roxana Geambasu, Daniel Hsu.
In Twenty-Second ACM Conference on Computer and Communications Security, 2015.
[ local pdf file | project website | bibtex ]

Model-based word embeddings from decompositions of count matrices
Karl Stratos, Michael Collins, Daniel Hsu.
In Fifty-Third Annual Meeting of the Association for Computational Linguistics, 2015.
[ local pdf file | acl link | bibtex ]

When are overcomplete topic models identifiable?
Anima Anandkumar, Daniel Hsu, Majid Janzamin, Sham M. Kakade.
Journal of Machine Learning Research, 16(Dec):2643–2694, 2015.
[ external link | bibtex ]

Successive rank-one approximations for nearly orthogonally decomposable symmetric tensors
Cun Mu, Daniel Hsu, Donald Goldfarb.
SIAM Journal on Matrix Analysis and Applications, 36(4):1638–1659, 2015.
[ external link | siam link | bibtex ]

A spectral algorithm for latent Dirichlet allocation
Anima Anandkumar, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Yi-Kai Liu.
Algorithmica, 72(1):193–214, 2015.
[ local pdf file | springer link | bibtex ]

Learning sparse low-threshold linear classifiers
Sivan Sabato, Shai Shalev-Shwartz, Nathan Srebro, Daniel Hsu, Tong Zhang.
Journal of Machine Learning Research, 16(Jul):1275–1304, 2015.
[ external link | bibtex ]

Scalable nonlinear learning with adaptive polynomial expansions
Alekh Agarwal, Alina Beygelzimer, Daniel Hsu, John Langford, Matus Telgarsky.
In Advances in Neural Information Processing Systems 27, 2014.
[ external link | bibtex ]

The large margin mechanism for differentially private maximization
Kamalika Chaudhuri, Daniel Hsu, Shuang Song.
In Advances in Neural Information Processing Systems 27, 2014.
[ external link | bibtex ]

A spectral algorithm for learning class-based \(n\)-gram models of natural language
Karl Stratos, Do-kyum Kim, Michael Collins, Daniel Hsu.
In Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014.
[ local pdf file | auai link | code by Karl | more code by Karl | bibtex ]

Taming the monster: a fast and simple algorithm for contextual bandits
Alekh Agarwal, Daniel Hsu, Satyen Kale, John Langford, Lihong Li, Robert E. Schapire.
In Thirty-First International Conference on Machine Learning, 2014.
[ local pdf file | talk slides | arxiv link | bibtex ]

Heavy-tailed regression with a generalized median-of-means
Daniel Hsu, Sivan Sabato.
In Thirty-First International Conference on Machine Learning, 2014.
[ external link | arxiv link | bibtex ]

Tensor decompositions for learning latent variable models
Anima Anandkumar, Rong Ge, Daniel Hsu, Sham M. Kakade, Matus Telgarsky.
Journal of Machine Learning Research, 15(Aug):2773–2831, 2014.
[ local pdf file | tutorial slides | jmlr link | bibtex ]

Random design analysis of ridge regression
Daniel Hsu, Sham M. Kakade, Tong Zhang.
Foundations of Computational Mathematics, 14(3):569–600, 2014.
[ local pdf file | springer link | arxiv link | bibtex ]

A tensor approach to learning mixed membership community models
Anima Anandkumar, Rong Ge, Daniel Hsu, Sham M. Kakade.
Journal of Machine Learning Research, 15(Jun):2239–2312, 2014.
[ external link | arxiv link | bibtex ]

When are overcomplete topic models identifiable?
Anima Anandkumar, Daniel Hsu, Majid Janzamin, Sham M. Kakade.
In Advances in Neural Information Processing Systems 26, 2013.
[ external link | bibtex ]

Contrastive learning using spectral methods
James Zou, Daniel Hsu, David Parkes, Ryan P. Adams.
In Advances in Neural Information Processing Systems 26, 2013.
[ local pdf file | bibtex ]

A tensor spectral approach to learning mixed membership community models
Anima Anandkumar, Rong Ge, Daniel Hsu, Sham M. Kakade.
In Twenty-Sixth Annual Conference on Learning Theory, 2013.
[ external link | journal version with better title | arxiv link | bibtex ]

Learning linear Bayesian networks with latent variables
Anima Anandkumar, Daniel Hsu, Adel Javanmard, Sham M. Kakade.
In Thirtieth International Conference on Machine Learning, 2013.
[ local pdf file | pmlr link | bibtex ]

Learning mixtures of spherical Gaussians: moment methods and spectral decompositions
Daniel Hsu, Sham M. Kakade.
In Fourth Innovations in Theoretical Computer Science, 2013.
[ local pdf file | talk slides | arxiv link | video advert | bibtex ]

Stochastic convex optimization with bandit feedback
Alekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin.
SIAM Journal on Optimization, 23(1):213–240, 2013.
[ local pdf file | arxiv link | siam link | bibtex ]

A spectral algorithm for latent Dirichlet allocation
Anima Anandkumar, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Yi-Kai Liu.
In Advances in Neural Information Processing Systems 25, 2012.
[ external link | journal version | springer link | bibtex ]

Learning mixtures of tree graphical models
Anima Anandkumar, Daniel Hsu, Furong Huang, Sham M. Kakade.
In Advances in Neural Information Processing Systems 25, 2012.
[ external link | bibtex ]

Identifiability and unmixing of latent parse trees
Daniel Hsu, Sham M. Kakade, Percy Liang.
In Advances in Neural Information Processing Systems 25, 2012.
[ local pdf file | arxiv link | bibtex ]

Random design analysis of ridge regression
Daniel Hsu, Sham M. Kakade, Tong Zhang.
In Twenty-Fifth Annual Conference on Learning Theory, 2012.
[ external link | journal version | arxiv link | bibtex ]

A method of moments for mixture models and hidden Markov models
Anima Anandkumar, Daniel Hsu, Sham M. Kakade.
In Twenty-Fifth Annual Conference on Learning Theory, 2012.
[ external link | talk slides | slides for related talk | pmlr link | bibtex ]

Convergence rates for differentially private statistical estimation
Kamalika Chaudhuri, Daniel Hsu.
In Twenty-Ninth International Conference on Machine Learning, 2012.
[ local pdf file | bibtex ]

Tail inequalities for sums of random matrices that depend on the intrinsic dimension
Daniel Hsu, Sham M. Kakade, Tong Zhang.
Electronic Communications in Probability, 17(14):1–13, 2012.
[ local pdf file | errata | ecp link | bibtex ]

A spectral algorithm for learning hidden Markov models
Daniel Hsu, Sham M. Kakade, Tong Zhang.
Journal of Computer and System Sciences, 78(5):1460–1480, 2012.
[ local pdf file | errata | jcss link | arxiv link | bibtex ]

A tail inequality for quadratic forms of subgaussian random vectors
Daniel Hsu, Sham M. Kakade, Tong Zhang.
Electronic Communications in Probability, 17(52):1–6, 2012.
[ local pdf file | note about lower tail | ecp link | bibtex ]

Stochastic convex optimization with bandit feedback
Alekh Agarwal, Dean P. Foster, Daniel Hsu, Sham M. Kakade, Alexander Rakhlin.
In Advances in Neural Information Processing Systems 24, 2011.
[ external link | journal version | siam link | bibtex ]

Spectral methods for learning multivariate latent tree structure
Anima Anandkumar, Kamalika Chaudhuri, Daniel Hsu, Sham M. Kakade, Le Song, Tong Zhang.
In Advances in Neural Information Processing Systems 24, 2011.
[ local pdf file | arxiv link | bibtex ]

Sample complexity bounds for differentially private learning
Kamalika Chaudhuri, Daniel Hsu.
In Twenty-Fourth Annual Conference on Learning Theory, 2011.
[ local pdf file | pmlr link | bibtex ]

Efficient optimal learning for contextual bandits
Miroslav Dudik, Daniel Hsu, Satyen Kale, Nikos Karampatziakis, John Langford, Lev Reyzin, Tong Zhang.
In Twenty-Seventh Conference on Uncertainty in Artificial Intelligence, 2011.
[ local pdf file | bibtex ]

Robust matrix decomposition with sparse corruptions
Daniel Hsu, Sham M. Kakade, Tong Zhang.
IEEE Transactions on Information Theory, 57(11):7221–7234, 2011.
[ local pdf file | arxiv link | ieee link | bibtex ]

Agnostic active learning without constraints
Alina Beygelzimer, Daniel Hsu, John Langford, Tong Zhang.
In Advances in Neural Information Processing Systems 23, 2010.
[ local pdf file | arxiv link | bibtex ]

An online learning-based framework for tracking
Kamalika Chaudhuri, Yoav Freund, Daniel Hsu.
In Twenty-Sixth Conference on Uncertainty in Artificial Intelligence, 2010.
[ external link | bibtex ]

Algorithms for active learning
Daniel Hsu.
Ph.D. dissertation, UC San Diego, 2010.
[ local pdf file | bibtex ]

A parameter-free hedging algorithm
Kamalika Chaudhuri, Yoav Freund, Daniel Hsu.
In Advances in Neural Information Processing Systems 22, 2009.
[ local pdf file | note about \(\epsilon\)-quantile regret | bibtex ]

Multi-label prediction via compressed sensing
Daniel Hsu, Sham M. Kakade, John Langford, Tong Zhang.
In Advances in Neural Information Processing Systems 22, 2009.
[ local pdf file | talk slides | arxiv link | bibtex ]

A spectral algorithm for learning hidden Markov models
Daniel Hsu, Sham M. Kakade, Tong Zhang.
In Twenty-Second Annual Conference on Learning Theory, 2009.
[ external link | journal version | errata | bibtex ]

Hierarchical sampling for active learning
Sanjoy Dasgupta, Daniel Hsu.
In Twenty-Fifth International Conference on Machine Learning, 2008.
[ local pdf file | bibtex ]

On-line estimation with the multivariate Gaussian distribution
Sanjoy Dasgupta, Daniel Hsu.
In Twentieth Annual Conference on Learning Theory, 2007.
[ local pdf file | bibtex ]

A concentration theorem for projections
Sanjoy Dasgupta, Daniel Hsu, Nakul Verma.
In Twenty-Second Conference on Uncertainty in Artificial Intelligence, 2006.
[ local pdf file | bibtex ]