All Projects → cligs → tmw

cligs / tmw

Licence: other
Topic Modeling Workflow in Python

Programming Languages

python
139335 projects - #7 most used programming language

tmw - Topic Modeling Workflow

10.5281/zenodo.32490

What is tmw?

tmw is a python module for topic modeling, including some preprocessing of texts and some postprocessing of topic model data. This set of functions is experimental in nature and quality.

Requirements

  • tmw has been developed for and tested only on Linux (Ubuntu 14.04).
  • Python 3 (tested with 3.4), Mallet (tested with 2.0.7) and TreeTagger with the desired parameter files.
  • Python 3 packages numpy, pandas, matplotlib, lxml, scipy, seaborn, wordcloud.

Usage

Acknowledgements

Most of the code has been written by Christof Schöch, with significant contributions by Daniel Schlör. The project is inspired by Allan Riddell's TaTOM tutorials. We are grateful to all the existing packages and code this project reuses.

Contact

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].