Setting up the Termite Data Server: A New Walkthrough

A Termite Topic Model Visualization of the Green Party’s Website from September 2007.

A Termite Topic Model Visualization of the Green Party’s Website from September 2007.

We’ve been working on various visualizations for our web archives collections. One bottleneck was topic modeling using MALLET: both due to limitations on just how fast we can get it running, but also into how to make the results usable for the average user.

Termite was one such option. While it has decent documentation, it can be difficult to munge data into it.

Shawn Dickinson, one of my RAs in the Web Archives for Historical Research Group, wrote up some great code that takes a directory of text files and prepares them for Termite.

As with all our walkthroughs, it is available in our GitHub repository as Setting up Termite Visualizations on OS X”. Feedback is always appreciated, either here or by submitting a Pull request.

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s