Skip to main content

Archive Team

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.



rss RSS

1,247,596
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Backup of all the webpages of the Home of the Underdogs site when it was hosted as www.the-underdogs.info, found at the ed2k network. (10/03/2008)
Topics: home of the underdogs hotu www, the-underdogs, info backup webpages
The Archive Team Just In Time Grabs
web

eye 154

favorite 1

comment 0

A static web pages backup of the Home of the Underdogs found at the ed2k network (formerly at www.the-underdogs.info), one of the largest abandonware websites on the Internet. Creative Commons license: Attribution-Noncommercial-Share Alike 3.0
Topics: Videogame, Home of the Underdogs, HotU, Metadata, Abandonware
urlteam Web Crawls
Apr 4, 2011 badcheese
software

eye 124

favorite 0

comment 0

urlteam Web Crawls
Apr 5, 2011
data

eye 62

favorite 0

comment 0

urlteam Web Crawls
Apr 5, 2011
data

eye 193

favorite 0

comment 0

urlteam Web Crawls
Apr 29, 2011
web

eye 217

favorite 0

comment 0

geocitiestorrent
The Archive Team Just In Time Grabs
May 3, 2011
web

eye 4,341

favorite 4

comment 0

Founded in 2004, Encyclopedia Dramatica (ED) was a free-form Wiki dedicated to collecting all manner of internet subculture, including illustrations, descriptions and histories, especially as related to trolling, troll activities, and internet drama. Unlike many mainstream wikis such as Wikipedia, ED was intentionally indecent, obtuse, and inaccurate - the actual information related to a situation would come with further research, not by simply reading the related ED article. Constantly on the...
Topics: encyclopedia dramatica, anonymous, trolling, wiki, wikidump, lulz
The Archive Team Geocities Valhalla
web

eye 7,177

favorite 5

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,114

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,648

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 2,196

favorite 2

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,550

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,790

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,325

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Geocities Valhalla
web

eye 1,246

favorite 1

comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). This collection was put together by nearly 100 folks assembling at the news of the death of Geocities, a website that allowed free hosting of web pages...
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 834

favorite 3

comment 0

From the README: This is a collection of mirrors maintained by gopher.quux.org. These mirrors were taken offline in 2006 due to bandwidth constraints. This collection prepared April 2010 by John Goerzen -------------------------------------------------- mirrors.tar.bz2 -------------------------------------------------- Compressed size: 1.6GB Uncompressed size: 3.8GB File count: 102736 The content includes: boombox.micro.umn.edu /pub/gopher from the FTP site, including various historic Gopher...
Topics: gopher, quux, archiveteam, usenet
The Archive Team Just In Time Grabs
May 11, 2011 boingboing.net / individual authors
web

eye 288

favorite 0

comment 0

Two collections of Boing Boing postings provided by the cultural website boingboing.net on its 5th and 11th anniversaries. Includes the HTML/text aspects of the postings, along with various author and creation information. From the 2011 BoingBoing.net posting: "Having very recently celebrated Boing Boing's eleventh bloggaversary, we're releasing an update of our previous archival release of Boing Boing posts. "This time, we're releasing a 120.3MB XML file (38.3MB zip) of 63,999 posts...
Topics: BoingBoing, posts archive, blogging
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 917

favorite 0

comment 0

Explanatory file included with this archive, with slight edits: On Monday 24th January 2011 the BBC announced [1] that it would be restructuring its online department - with 360 job losses and the deletion of 200 of its top level directories (including the websites that live under them - eg http://www.bbc.co.uk/blast). 172 of of those top level directories [2] were due to be deleted within the coming 12 months. Most of these sites are already 'mothballed' [3], which means that the BBC has...
The Archive Team Just In Time Grabs
web

eye 10,486

favorite 0

comment 0

This dataset is a collection of scraped public twitter updates used in coordination with an academic project to study the geolocation data related to twittering. From the explanatory PDF in the dataset collection: We provide both training set and test set (collected from September 2009 to January 2010) in the paper You Are Where You Tweet: A Content-Based Approach to Geo-locating Twitter Users in CIKM 2010. The training set contains 115,886 Twitter users and 3,844,612 updates from the users....
Topics: academic paper, twitter, tweets, location, geolocation, archiveteam
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 297

favorite 0

comment 0

EtherPad was a web-based collaborative real-time editor, allowing authors to simultaneously edit a text document, and see all of the participants' edits in real-time, with the ability to display each author's text in their own color. Very popular and in use by educators, businesses, and developers, Etherpad gained a strong following, but was later purchased by Google. With the introduction of the competing Wave application, Google announced a shutdown of Etherpad in favor of Wave. To outcry,...
Topics: etherpad, archiveteam, archive
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 735

favorite 0

comment 0

(BudhaM0nk) i want hard drives so small i can snort them up like powder and increase my brain capacity Comprising the wit, wisdom, brilliance and buffoonery of thousands of individuals over decades, Quote Databases have provided easy access to amusing snatches of conversation from IRC and other online gathering places. While many of these sites are still up, this package of compressed archives allow easy access to the full collections of quotes from various sources. This collection was built in...
Topics: quotes, qdb, archiveteam
The Archive Team Just In Time Grabs
web

eye 12,895

favorite 0

comment 0

Facebook data scrape related to paper "The Social Structure of Facebook Networks", by Amanda L. Traud, Peter J. Mucha, Mason A. Porter. "We study the social structure of Facebook "friendship" networks at one hundred American colleges and universities at a single point in time, and we examine the roles of user attributes - gender, class year, major, high school, and residence - at these institutions. We investigate the influence of common attributes at the dyad level in...
Topics: arcademic, facebook, facebook networks, networks, archiveteam
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 231

favorite 0

comment 0

American Powerblogs was a blog hosting service that provided ease-of-use access to blogging software. Allowing its users the ability to create their own subdomains and presentation style, Powerblogs was used by a relatively small but energetic community of bloggers. This is a 108-blog snapshot of the final month of Powerblogs, before their shutdown.
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 534

favorite 0

comment 0

Voluntary dataset on affinities of 60,000+ Reddit users, recorded in 2010. From the enclosed readme file: "I filtered the list of votes for the list of users that gave us permission to use their data. For the curious, that's 67,059 users: 62,763 with "public votes" and 6,726 with "allow my data to be used for research"...I'm trying to use it to build a recommender, and I've got some preliminary source code. I'm looking for feedback on all of these steps, since I'm not...
Topics: reddit, database, archiveteam, affinities, mysql
The Archive Team Just In Time Grabs
May 11, 2011
web

eye 286

favorite 0

comment 0

This is a panic download of the starwars.yahoo.com forums and profiles, done before the closure of same by Yahoo on December 15, 2009. This includes as many messages, profiles, and pages related to the site as could be easily brought in.
Topics: archiveteam, archive, starwars.yahoo.com, yahoo, use the force
The Archive Team Just In Time Grabs
May 14, 2011 John Goerzen
movies

eye 3,367

favorite 11

comment 1

Original Curator's statement by John Goerzen: Back in the early 1990s, before there was a World Wide Web, there was the Internet Gopher. It was a distributed information system in the same sense as the web, but didn’t use hypertext and was text-based. Gopher was popular back then, as it made it easy to hop from one server to the next in a way that FTP didn’t. Gopher has hung on over the years, and is still clinging to life in a way. Back in 2007, I was disturbed at the number of old famous...
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: gopher, gopherspace
The Archive Team Just In Time Grabs
movies

eye 287

favorite 0

comment 0

This is the working directory of the OpenAMD project's estimator from The Next Hope, as well as the complete packet dump for obtained during our collection. The collection starts at Friday 12:23:34.714999 and ends at Sunday 16:04:53.403616. It contains, all told, 200123338 packets. Almost all of these are well-formed packets from TNH badges; there are known to be some TLH badges in there (which the localizer knows how to decrypt) and maybe some surprises. Just for clarity, the aggregator was...
Topic: OpenAMD
The Archive Team Just In Time Grabs
Jun 8, 2011 Jeff Atwood, Stackoverflow.com
web

eye 912

favorite 1

comment 1

Stack Overflow / Stack Exchange Creative Commons data dump, to start of April 2011. Includes - http://stackoverflow.com - http://serverfault.com - http://superuser.com - http://meta.stackoverflow.com - http://meta.serverfault.com - http://meta.superuser.com - http://stackapps.com And any other public (non-beta) website and its corresponding meta site at http://stackexchange.com/sites The original torrent of this material was provided hosting by ClearBits.
favoritefavorite ( 1 reviews )
Topics: Stackoverflow, serverfault.com, stackoverflow.com, superuser.com
The Archive Team Friendster Snapshot Collection
Jul 5, 2011 Archiveteam
web

eye 491

favorite 0

comment 0

Before its relaunch as a gaming website, Friendster was a social networking website that allowed users to connect with their friends. One of the elements of the site were the groups that members could join. This dataset contains the group memberships of all Friendster groups. It is the result of an extensive crawl of Friendster.com at the end of June 2011. It was performed as part of the ArchiveTeam project to archive part of the Friendster data before the service relaunched. The data files...
Topics: Friendster, Groups, Group Lists, Membership Lists, Archive Team
The Archive Team Friendster Snapshot Collection
Jul 5, 2011 Archiveteam
web

eye 6,920

favorite 1

comment 0

Before its relaunch as a gaming website, Friendster was a social networking website that allowed users to connect with their friends. The central element of the site was the 'friends list', showing the contacts of the user. This dataset contains the connections between all Friendster users. It is the result of an extensive crawl of Friendster.com at the end of June 2011. It was performed as part of the ArchiveTeam project to archive part of the Friendster data before the service relaunched. The...
Topics: Friendster, Friends, Friend Lists, Membership Lists, Archive Team
The Archive Team Just In Time Grabs
Jul 5, 2011 Archive Team
web

eye 497

favorite 0

comment 0

In May of 2011, Salon announced the deletion of the Table Talk message base, with 30 days notice, after 16 years of operation. With little attempt to find a new home for the site, and with little reason given, the site was ultimately deleted in June of 2011 and replaced with an article reminiscing on the history of Table Talk. Archive Team has downloaded the full public threads of Table Talk, excluding group threads that had a semi-private setting, predating most search engines.
Topics: Table Talk, Discussions, Salon, Archive Team
The Archive Team Just In Time Grabs
Jul 25, 2011
web

eye 63

favorite 0

comment 0

WGET grab of WELL.COM user websites conducted in November of 2008. Includes every externally-findable userpage and subdirectory of Google, for purposes of historical research and archiving.
The Archive Team Just In Time Grabs
Jul 28, 2011
web

eye 470

favorite 0

comment 0

A web archive of News of the World before it closed its doors. This is a copy of *most of* the www.newsoftheworld.co.uk website. I haven't checked everything but from doing some quick exploring around the data myself, I only found a few missing pages. This mirror was started about 2-3 days before the site went down so I can't be sure if everything made it or not, any pages you find that 404 were probably scraped after the site went down. I was also away on holidays at the time this all happened...
Topics: News of The World, NOTW
The Archive Team Just In Time Grabs
data

eye 4,435

favorite 9

comment 1

From Yoann Padioleau, a merging of various GIT repositories containing the full history of the Linux Kernel from 0.01 to 2.6. From his description: It's built from 3 other git repositories: - the one from Dave Jones from 0.01 to 2.4.0, - the one from tglx from 2.4.0 to 2.6.12, - the one from Linus Torvalds from 2.6.12 to now. I used the "graft" feature of git (thanks to Junio and people on #git for the tip) to link them together. I also modified (via a git-filter-branch) the dates of...
favoritefavoritefavorite ( 1 reviews )
Topics: Linux, GIT Repository, Linux History, Code History, Kernel history
The Archive Team Just In Time Grabs
Aug 1, 2011
web

eye 303

favorite 0

comment 0

Billed as "Twitpic for Audio", the Twaud.io service (Twitter Audio) allowed a short-form URL to post audio snippets on Twitter. Launched in May of 2009 by Massive Robot, Twaud.io was one of a number of third-party services bringing rich content access to Twitter streams. With a limit of 10 megabytes, no limit on content or approach, and an easy to use API, Twaud.io seemed poised for some level of success. In 2011, Twaud.io announced it was shutting down, and gracefully cut off...
Topics: mp3, twaudio, twaud.io, Massive Robot, audio
The Archive Team Just In Time Grabs
Aug 3, 2011 THE ARCHiVERS
web

eye 534

favorite 0

comment 0

Ripped bY THE ARCHiVERS for our brothers Archive Team. Visit us: http://w4r3zh4ck.blogspot.com/
Topics: archive, team, archive team, AT, archiveteam, the archivers, archivers, site rip, rip, w4r3zh4ck,...
The Archive Team Just In Time Grabs
Aug 10, 2011
web

eye 45,695

favorite 0

comment 0

This is a download of http://forum.nos.nl/, the online discussion forums of Dutch public broadcaster NOS. It contains messages posted in 2005, 2006 and 2007 by visitors of the NOS website. Discussion topics include news, politics and NOS programmes. Downloaded June 2011. -- The archive is available in several formats: * a copy of the HTML page of each topic (including every message posted on the forum) * an XML file for each topic, providing the messages extracted from the HTML * a wget...
Topics: forum.nos.org, NOS, Forum, Archive
The Archive Team Just In Time Grabs
web

eye 27,379

favorite 0

comment 0

This is a Heritrix crawl of http://llink.nl/, the website of Dutch public broadcasting association LLiNK, made on 23 and 24 June 2011. It includes the main website as well as the programme-specific websites of LLiNK radio and television programmes. The crawl logs and order file are available in llink-20110624-crawl-logs.tar.bz2 -- The MD5 checksums of the files are: 050b714c6df98a29bdb6c1ff077c6953 llink-20110623100606-00000.warc.gz 49de8110b71ba8da7607eafbfe80fd50...
Topics: LLiNK, Dutch, Archive, Webgrab, NPO
The Archive Team Just In Time Grabs
Aug 17, 2011
web

eye 256

favorite 1

comment 0

Billed as "Twitpic for Audio", the Twaud.io service (Twitter Audio) allowed a short-form URL to post audio snippets on Twitter. Launched in May of 2009 by Massive Robot, Twaud.io was one of a number of third-party services bringing rich content access to Twitter streams. With a limit of 10 megabytes, no limit on content or approach, and an easy to use API, Twaud.io seemed poised for some level of success. In 2011, Twaud.io announced it was shutting down, and gracefully cut off...
Topics: mp3, twaudio, twaud.io, Massive Robot, audio
The Archive Team Just In Time Grabs
web

eye 552

favorite 1

comment 0

This is a Heritrix web archive of www.jana-news.ly, the site of the official state news agency in Libya. The archive was made on August 22, 2011, when the site was still online. The last article was published on August 21, 2011. On August 25 this was still the most recent information on the site. The site has sections in Arabic, English and French. From Wikipedia's description of JANA: -- The Jamahiriya News Agency, also known as JANA, was the official state news agency in Libya. It was founded...
The Archive Team Just In Time Grabs
Sep 2, 2011
web

eye 732

favorite 2

comment 0

Thingiverse is a website dedicated to the sharing of user-created digital design files. Providing primarily open source hardware designs licensed under the GNU General Public License or Creative Commons licenses, users choose the type of user license they wish to attach to the designs they share. 3D printers, laser cutters, milling machines and many other technologies can be used to physically create the files shared by the users on Thingiverse. Thingiverse is widely used in the DIY technology...
The Archive Team Just In Time Grabs
web

eye 260

favorite 0

comment 0

This is a Heritrix web crawl of VKBlog.nl, or Volkskrantblog, the weblogging service of Dutch newspaper De Volkskrant. Subscribers of the newspaper could use the service to run their own blogs, which some 18.000 of them actually did. Started since 2005, the service was hoped to increase the use of citizen journalism at De Volkskrant. In January 2011, De Volkskrant announced the closure of the service later in the year. The actual shut down was postponed several times while De Volkskrant...
The Archive Team Friendster Snapshot Collection
The Archive Team Friendster Snapshot Collection
collection
143
ITEMS
45,156
VIEWS
Sep 20, 2011
collection

eye 45,156

Founded in 2002 by Jonathan Abrams and Peter Chin, Friendster was one of the more popular social networking sites, predating later services like Facebook and MySpace. It provided a singular platform to share music, writings, photographs and profiles between a growing amount of users, which grew to roughly 112 million over 9 years. Unlike previous sites like Geocities, Angelfire and Tripod, Friendster allowed larger spaces for upload of data, resulting in exponential growth and a leading...
The Archive Team Friendster Snapshot Collection
web

eye 31

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 40

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 333

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 52

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 40

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 25

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 22

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 33

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 45

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 27

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 30

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 22

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 35

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 27

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 20

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 40

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 129

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 69

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 28

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 26

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 54

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 35

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 26

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 33

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 22

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 47

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 49

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 44

favorite 0

comment 0

The Archive Team Friendster Snapshot Collection
web

eye 21

favorite 0

comment 0