New Chapter: “Welcome to the Web: The Online Community of GeoCities during the early years of the World Wide Web”

Well, I certainly won’t win any awards for “most concise chapter title,” but my latest publication “Welcome to the Web: The Online Community of GeoCities during the early years of the World Wide Web,” is now available in the open-access publication The Web as History. This book, edited by Niels Brügger and Ralph Schroeder, has been … Continue reading New Chapter: “Welcome to the Web: The Online Community of GeoCities during the early years of the World Wide Web”

Finding Popular Images within a Web Archive: Exploring GeoCities

I’m back from the great annual Digital Humanities conference in Krakow (and a nice, two-week follow-up vacation), and have returned to the always growing warcbase platform. One of our research assistants, Youngbin Kim, has been working on some image extraction commands and I was looking forward to putting it to the test. Finding popular images … Continue reading Finding Popular Images within a Web Archive: Exploring GeoCities

Exploring the GeoCities Web Archive with Warcbase & Spark: Links (or how we can use warcbase to find amazing sites to ask historical questions!)

In my last post, we left off with scripts running to extract all URLs and a link diagram. They finished decently quickly – about three days on our rho server at York University, or about 30 minutes on our roaringly-fast cluster. Given that hopefully we will be running these only once or twice at first, … Continue reading Exploring the GeoCities Web Archive with Warcbase & Spark: Links (or how we can use warcbase to find amazing sites to ask historical questions!)

Exploring the GeoCities Web Archive with Warcbase & Spark: Getting Started

Nick Ruest and I had some great news a few weeks ago: a collection of GeoCities WARCs was on its way on a few hard drives. I’ve previously done quite a bit of work on the GeoCities torrent, but as we’ve been doing parallel development on warcbase while working with the torrent, it’s been difficult … Continue reading Exploring the GeoCities Web Archive with Warcbase & Spark: Getting Started

Herrenhausen Big Data Podcast: Coding History on GeoCities

Last post (three day conference deserves three posts, right?) for my trip to Hannover, Germany for the “Big Data in a Transdisciplinary Perspective” conference. I had the opportunity to sit down with Daniel Meßner, who hosts a podcast called Coding History. I really enjoyed our conversation, and wanted to link to it here. You can … Continue reading Herrenhausen Big Data Podcast: Coding History on GeoCities

Herrenhausen Big Data Lightning Talk: Finding Community in the Ruins of GeoCities

I was fortunate to receive a travel grant to present my research in a short, three-minute slot plus poster at the Herrenhäuser Konferenz: Big Data in a Transdisciplinary Perspective in Hanover, Germany. Here’s what I’ll be saying (pretty strictly) in my slot this afternoon. Some of it is designed to respond to the time format … Continue reading Herrenhausen Big Data Lightning Talk: Finding Community in the Ruins of GeoCities

Testing Cohesiveness in GeoCities Neighbourhoods by Extracting and Plotting Locations

This weekend, I went back to my old GeoCities archive to play around with the methods I experimented with in my last post on the Wide Web Scrape. One question that I’ve been curious about was whether GeoCities was a community (drawing on an old debate that waged in the 1990s and beyond about virtual … Continue reading Testing Cohesiveness in GeoCities Neighbourhoods by Extracting and Plotting Locations

Exploring the Neighbourhoods of GeoCities

[a random blog post of some thoughts, mostly just to keep myself thinking about stuff] With GeoCities, I have been particularly interested in the neighbourhoods and communities that have formed. How cohesive were they? Were they virtual communities (in this, getting into the debates kicked off by folks like Howard Rheingold, Lori Kendall, and Constance … Continue reading Exploring the Neighbourhoods of GeoCities