Skip to main content

9,549
UPLOADS


More right-solid

More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid

More right-solid
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Internet Archive Research Publication Crawls
Internet Archive Research Publication Crawls
collection
21,054
ITEMS
99.6M
VIEWS
by Internet Archive Web Group
collection

eye 99.6M

A series of open web crawls targeting journal articles, technical memos, essays, datasets, and other research publications. This collection contains WARC and CDX files that end up in Wayback ( https://web.archive.org ). See also bibliographic metadata corpuses at  https://archive.org/details/ia_biblio_metadata
OAI-PMH-CRAWL-2020-06
OAI-PMH-CRAWL-2020-06
collection
2,946
ITEMS
4.7M
VIEWS
by Internet Archive Web Group
collection

eye 4.7M

UNPAYWALL-PDF-CRAWL-2018-07
UNPAYWALL-PDF-CRAWL-2018-07
collection
1,241
ITEMS
14.3M
VIEWS
by Internet Archive Web Group
collection

eye 14.3M

Web archive data from a crawl of open access PDF URLs provided by Unpaywall.
OA-JOURNAL-CRAWL-2020-07
OA-JOURNAL-CRAWL-2020-07
collection
1,923
ITEMS
9.5M
VIEWS
by Internet Archive Web Group
collection

eye 9.5M

MSAG-PDF-CRAWL-2017
collection
1,855
ITEMS
11.5M
VIEWS
by Internet Archive Web Group
collection

eye 11.5M

Microsoft Academic Graph public corpus (Feb 2016) PDF URLs, filtered to remove large sites (pubmed, citeseerx, arxiv) and already-crawled URLs.
Topics: papers, journals
Open Access Journal Test Crawl (2018)
Open Access Journal Test Crawl (2018)
collection
794
ITEMS
10.6M
VIEWS
by Internet Archive Web Group
collection

eye 10.6M

UNPAYWALL-PDF-CRAWL-2019-04
UNPAYWALL-PDF-CRAWL-2019-04
collection
641
ITEMS
5.2M
VIEWS
by Internet Archive Web Group
collection

eye 5.2M

MAG-PDF-CRAWL-2020-03
MAG-PDF-CRAWL-2020-03
collection
489
ITEMS
3.6M
VIEWS
by Internet Archive Web Group
collection

eye 3.6M

DIRECT-OA-CRAWL-2019
DIRECT-OA-CRAWL-2019
collection
2,566
ITEMS
5.1M
VIEWS
by Internet Archive Web Group
collection

eye 5.1M

CORE-UPSTREAM-CRAWL-2018-11
CORE-UPSTREAM-CRAWL-2018-11
collection
741
ITEMS
1.5M
VIEWS
by Internet Archive Web Group
collection

eye 1.5M

Crawl of "upstream" URLs from CORE (core.ac.uk) metadata dump. Only a partial seedlist of files crawled.
JOURNALS-PATCH-CRAWL-2022-01
JOURNALS-PATCH-CRAWL-2022-01
collection
104
ITEMS
590,763
VIEWS
collection

eye 590,763

OA-DOI-CRAWL-2020-02
OA-DOI-CRAWL-2020-02
collection
278
ITEMS
3.2M
VIEWS
by Internet Archive Web Group
collection

eye 3.2M

UNPAYWALL-PDF-CRAWL-2020-03
UNPAYWALL-PDF-CRAWL-2020-03
collection
344
ITEMS
1.7M
VIEWS
by Internet Archive Web Group
collection

eye 1.7M

DATACITE-DOI-CRAWL-2020-01
DATACITE-DOI-CRAWL-2020-01
collection
1,417
ITEMS
3.6M
VIEWS
by Internet Archive Web Group
collection

eye 3.6M

OA-DOI-CRAWL-2020-12
OA-DOI-CRAWL-2020-12
collection
191
ITEMS
1.4M
VIEWS
by Internet Archive Web Group
collection

eye 1.4M

MAG-PDF-CRAWL-2021-08
MAG-PDF-CRAWL-2021-08
collection
189
ITEMS
660,560
VIEWS
collection

eye 660,560

UNPAYWALL-PDF-CRAWL-2021-07
UNPAYWALL-PDF-CRAWL-2021-07
collection
174
ITEMS
908,257
VIEWS
collection

eye 908,257

MAG-PDF-CRAWL-2020-07
MAG-PDF-CRAWL-2020-07
collection
196
ITEMS
1.5M
VIEWS
by Internet Archive Web Group
collection

eye 1.5M

UNPAYWALL-PDF-CRAWL-2020-11
UNPAYWALL-PDF-CRAWL-2020-11
collection
199
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

DOI-LANDING-CRAWL-2018-06
DOI-LANDING-CRAWL-2018-06
collection
279
ITEMS
3.2M
VIEWS
by Internet Archive Web Group
collection

eye 3.2M

TARGETED-ARTICLE-CRAWL-2022-04
TARGETED-ARTICLE-CRAWL-2022-04
collection
219
ITEMS
201,633
VIEWS
collection

eye 201,633

UNPAYWALL-PDF-CRAWL-2020-05
UNPAYWALL-PDF-CRAWL-2020-05
collection
282
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

Wide Web Targeted PDF Crawling (2017)
Wide Web Targeted PDF Crawling (2017)
collection
922
ITEMS
3M
VIEWS
by Internet Archive Web Group
collection

eye 3M

OA-JOURNAL-CRAWL-2019-08
OA-JOURNAL-CRAWL-2019-08
collection
201
ITEMS
2.7M
VIEWS
by Internet Archive Web Group
collection

eye 2.7M

PLATFORM-CRAWL-2020
PLATFORM-CRAWL-2020
collection
649
ITEMS
379,588
VIEWS
by Internet Archive Web Group
collection

eye 379,588

SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
collection
1,011
ITEMS
1.4M
VIEWS
by Internet Archive Web Group
collection

eye 1.4M

UNPAYWALL-PDF-CRAWL-2021-05
UNPAYWALL-PDF-CRAWL-2021-05
collection
123
ITEMS
841,987
VIEWS
by Internet Archive Web Group
collection

eye 841,987

collection

eye 1.9M

IA crawl of PDF urls provided by Semantic Scholar.
Topic: pdf
OAI-PMH-PATCH-CRAWL-2021-12
OAI-PMH-PATCH-CRAWL-2021-12
collection
75
ITEMS
284,225
VIEWS
collection

eye 284,225

DOAJ-CRAWL-2020-11
DOAJ-CRAWL-2020-11
collection
102
ITEMS
854,580
VIEWS
by Internet Archive Web Group
collection

eye 854,580

CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.1M
VIEWS
collection

eye 1.1M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal
DOI-CRAWL-2022-02
DOI-CRAWL-2022-02
collection
25
ITEMS
166,413
VIEWS
collection

eye 166,413

JOURNAL-HOMEPAGE-CRAWL-2022-03
JOURNAL-HOMEPAGE-CRAWL-2022-03
collection
44
ITEMS
218,024
VIEWS
collection

eye 218,024

PubMed Central Crawl (2019-10)
PubMed Central Crawl (2019-10)
collection
216
ITEMS
400,669
VIEWS
by Internet Archive Web Group
collection

eye 400,669

PUBMEDCENTRAL-CRAWL-2020-02
PUBMEDCENTRAL-CRAWL-2020-02
collection
108
ITEMS
232,578
VIEWS
by Internet Archive Web Group
collection

eye 232,578

ARXIV-PUBMEDCENTRAL-CRAWL-2020-04
ARXIV-PUBMEDCENTRAL-CRAWL-2020-04
collection
60
ITEMS
101,172
VIEWS
by Internet Archive Web Group
collection

eye 101,172

arXiv Content Crawl (2019-10)
arXiv Content Crawl (2019-10)
collection
37
ITEMS
64,150
VIEWS
by Internet Archive Web Group
collection

eye 64,150

TARGETED-ARTICLE-CRAWL-2022-03
TARGETED-ARTICLE-CRAWL-2022-03
collection
9
ITEMS
44,165
VIEWS
collection

eye 44,165

SCIELO-CRAWL-2020-07
SCIELO-CRAWL-2020-07
collection
41
ITEMS
187,797
VIEWS
by Internet Archive Web Group
collection

eye 187,797

UNPAYWALL-PDF-CRAWL-2022-04
UNPAYWALL-PDF-CRAWL-2022-04
collection
38
ITEMS
11,972
VIEWS
collection

eye 11,972

Open Science Framework
Open Science Framework
collection
95,324
ITEMS
104,378
VIEWS
by Center for Open Science
collection

eye 104,378

Top-level collection for content mirrored from Open Science Framework (OSF, https://osf.io) repositories into Internet Archive.
OSF Registrations
OSF Registrations
collection
95,455
ITEMS
104,099
VIEWS
by Center for Open Science
collection

eye 104,099

Top-level collection for archiving Open Science Framework (OSF) Registrations into Internet Archive. Part of a collaboration with Center for Open Science.
arXiv.org Bulk Content
arXiv.org Bulk Content
collection
6,767
ITEMS
171,970
VIEWS
by arxiv.org
collection

eye 171,970

This collection contains PDF and source file (LaTeX) copies of content from the arxiv.org pre-print server, in the bulk-access format they provide via AWS S3. More information available at:  https://arxiv.org/help/bulk_data_s3 Note that direct access to the internal PDF files is possible, eg: https://archive.org/download/arXiv_pdf_0001_001/arXiv_pdf_0001_001.tar/0001%2Fastro-ph0001001.pdf However, we strongly prefer folks access these files via the individual items associated with each...
Tor Project Archives
Tor Project Archives
collection
3,858
ITEMS
27,706
VIEWS
by The Tor Project
collection

eye 27,706

Archived versions of Tor Browser Bundle software and other Tor Project artifacts. This item is maintained by the Tor Project organization for historical interest and research use, not as a primary installation mechanism. Please visit  https://torproject.org/  to download and install Tor software.
Tianchi V700 KTV
Tianchi V700 KTV
collection
3,697
ITEMS
98,488
VIEWS
collection

eye 98,488

Music, Instrumentals and Wistful Backgrounds and Music to Sing Korean Hits To.
Topic: karaoke, North Korea
Movies
by "Paywall The Movie"
movies

eye 6,422

favorite 3

comment 0

"Paywall: The Business of Scholarship" is a documentary film released in 2018 about the scholarly publishing industry and the Open Access movement. More information available from https://paywallthemovie.com/paywall Website blurb: "Paywall: The Business of Scholarship is a documentary which focuses on the need for open access to research and science, questions the rationale behind the $25.2 billion a year that flows into for-profit academic publishers, examines the 35-40% profit...
Topics: Open Access, Copyright, Publishing
CiteSeerX URL Crawl 2017
web

eye 6,007

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:40:24 PDT 2017 to Mon Jul 10 16:07:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,246

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 15:04:04 PDT 2017 to Thu Jul 6 17:45:16 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,533

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:31:07 PDT 2017 to Thu Jul 6 08:54:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,493

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:04:31 PDT 2017 to Thu Jul 6 00:05:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,578

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 01:37:24 PDT 2017 to Wed Jul 5 19:42:27 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,849

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:36:43 PDT 2017 to Wed Jul 5 14:00:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,408

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:26:07 PDT 2017 to Wed Jul 5 20:39:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,904

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 22:56:35 PDT 2017 to Thu Jul 6 17:34:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,050

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:39:30 PDT 2017 to Wed Jul 5 22:34:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,766

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:00:34 PDT 2017 to Thu Jul 6 01:11:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,441

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:09:11 PDT 2017 to Thu Jul 6 04:53:41 PDT 2017.
Topic: crawldata
Dat Early Days Collection
Dat Early Days Collection
collection
4
ITEMS
6,302
VIEWS
collection

eye 6,302

'dat' is a distributed web data archiving and transfer tool, originally developed by Code for Science, a grant-funded US non-profit. This collection preserves a selection of early and experimental dat archives. Note that important dat metadata is contained in a '.dat/' subdirectory, which is not displayed under "download" file listings by defaults, but can be browsed and downloaded from archive.org over HTTP(S) as expected.
Topics: dat, distributed web
DATASET-CRAWL-2022-01
DATASET-CRAWL-2022-01
collection
2
ITEMS
4,363
VIEWS
collection

eye 4,363

CiteSeerX URL Crawl 2017
web

eye 6,487

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:35:29 PDT 2017 to Wed Jul 5 00:48:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,560

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:21:08 PDT 2017 to Wed Jul 5 13:39:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,474

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:18:25 PDT 2017 to Thu Jul 6 01:29:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,595

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:35:43 PDT 2017 to Thu Jul 6 01:46:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 9,063

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:08:27 PDT 2017 to Wed Jul 5 05:22:22 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,572

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 23:14:09 PDT 2017 to Wed Jul 5 17:01:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,136

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:41:00 PDT 2017 to Wed Jul 5 18:35:58 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,066

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:25:18 PDT 2017 to Thu Jul 6 06:39:00 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,586

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:27:34 PDT 2017 to Wed Jul 5 05:39:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,758

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:01:08 PDT 2017 to Thu Jul 6 02:12:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,002

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:06:40 PDT 2017 to Wed Jul 5 06:20:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,471

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:19:51 PDT 2017 to Wed Jul 5 20:32:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,742

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 21:50:17 PDT 2017 to Thu Jul 6 16:11:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,839

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 21:11:06 PDT 2017 to Wed Jul 5 14:42:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,063

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:28:55 PDT 2017 to Wed Jul 5 12:44:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,273

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:27:22 PDT 2017 to Wed Jul 5 06:40:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,654

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 22:33:37 PDT 2017 to Wed Jul 5 16:22:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,235

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:14:00 PDT 2017 to Thu Jul 6 18:31:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,808

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:45:37 PDT 2017 to Wed Jul 5 05:59:07 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,407

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:54:02 PDT 2017 to Wed Jul 5 13:11:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,041

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:29:50 PDT 2017 to Wed Jul 5 08:40:44 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,468

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:28:50 PDT 2017 to Wed Jul 5 20:41:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,045

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:08:25 PDT 2017 to Thu Jul 6 01:20:22 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,722

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:38:19 PDT 2017 to Wed Jul 5 20:50:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,060

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:18:30 PDT 2017 to Wed Jul 5 12:32:14 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,531

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:55:20 PDT 2017 to Wed Jul 5 14:21:01 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,733

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:29:23 PDT 2017 to Thu Jul 6 02:41:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,267

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:01:18 PDT 2017 to Wed Jul 5 17:59:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,330

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 21:11:09 PDT 2017 to Thu Jul 6 15:05:55 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,769

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 18:52:51 PDT 2017 to Wed Jul 5 12:06:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 9,914

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 01:25:08 PDT 2017 to Wed Jul 5 18:40:27 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,006

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 14:06:22 PDT 2017 to Wed Jul 5 07:19:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,865

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 18:19:44 PDT 2017 to Thu Jul 6 13:45:57 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,561

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 17:47:56 PDT 2017 to Wed Jul 5 11:02:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,331

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:41:55 PDT 2017 to Wed Jul 5 12:59:15 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,777

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:59:15 PDT 2017 to Wed Jul 5 05:11:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 10,096

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:45:15 PDT 2017 to Thu Jul 6 01:55:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,033

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 22:05:23 PDT 2017 to Wed Jul 5 15:42:16 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,089

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:11:26 PDT 2017 to Wed Jul 5 20:24:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,420

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 16:02:02 PDT 2017 to Thu Jul 6 09:30:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,691

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:57:33 PDT 2017 to Thu Jul 6 02:08:23 PDT 2017.
Topic: crawldata