Topic Modeling
Introduction and BackgroundAs our collective knowledge continues to be digitized and stored in on-line documents, we need new tools for organizing and annotating them. "Topic modeling," uses a suite of new machine learning algorithms that examine texts to provide new methods of navigating digitized information. With topic models, we can search and explore a collection of documents based on the themes that run through it. We can "zoom in" and "zoom out" to find specific or broader themes; we can look at how those themes changed through time; we can see how themes are connected to each other. Topic models enable us to organize and summarize electronic archives at a scale that is impossible by human annotation. This work has been developed in collaboration with David Blei an assistant professor in the Computer Science department at Princeton University. His research interests include:
DescriptionA small example of the relationships created via topic modeling for the JSTOR Political Science discipline contents. You can explore the topics and relationships via a simple search interface or word index.
|
Contact Information
|


