Topic modeling

Topic models are a suite of algorithms that uncover the hidden thematic structure in document collections. These algorithms help us develop new ways to search, browse and summarize large archives of texts.

Below, you will find links to introductory materials and open source software (from my research group) for topic modeling.

Introductory materials

Topic modeling software

There are many open-source packages available for topic modeling.

Our research group regularly releases code associated with our papers. We use a GitHub organization to release it. (If you are not familiar with using GitHub, see this table for some of the code.)