Skip to main content

313
UPLOADS


More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Internet Archive Research Publication Crawls
Internet Archive Research Publication Crawls
collection
21,054
ITEMS
101.7M
VIEWS
by Internet Archive Web Group
collection

eye 101.7M

A series of open web crawls targeting journal articles, technical memos, essays, datasets, and other research publications. This collection contains WARC and CDX files that end up in Wayback ( https://web.archive.org ). See also bibliographic metadata corpuses at  https://archive.org/details/ia_biblio_metadata
UNPAYWALL-PDF-CRAWL-2018-07
UNPAYWALL-PDF-CRAWL-2018-07
collection
1,241
ITEMS
14.6M
VIEWS
by Internet Archive Web Group
collection

eye 14.6M

Web archive data from a crawl of open access PDF URLs provided by Unpaywall.
OAI-PMH-CRAWL-2020-06
OAI-PMH-CRAWL-2020-06
collection
2,946
ITEMS
4.9M
VIEWS
by Internet Archive Web Group
collection

eye 4.9M

MSAG-PDF-CRAWL-2017
collection
1,855
ITEMS
11.8M
VIEWS
by Internet Archive Web Group
collection

eye 11.8M

Microsoft Academic Graph public corpus (Feb 2016) PDF URLs, filtered to remove large sites (pubmed, citeseerx, arxiv) and already-crawled URLs.
Topics: papers, journals
OA-JOURNAL-CRAWL-2020-07
OA-JOURNAL-CRAWL-2020-07
collection
1,923
ITEMS
9.7M
VIEWS
by Internet Archive Web Group
collection

eye 9.7M

Open Access Journal Test Crawl (2018)
Open Access Journal Test Crawl (2018)
collection
794
ITEMS
10.8M
VIEWS
by Internet Archive Web Group
collection

eye 10.8M

UNPAYWALL-PDF-CRAWL-2019-04
UNPAYWALL-PDF-CRAWL-2019-04
collection
641
ITEMS
5.3M
VIEWS
by Internet Archive Web Group
collection

eye 5.3M

DATACITE-DOI-CRAWL-2020-01
DATACITE-DOI-CRAWL-2020-01
collection
1,417
ITEMS
3.7M
VIEWS
by Internet Archive Web Group
collection

eye 3.7M

MAG-PDF-CRAWL-2020-03
MAG-PDF-CRAWL-2020-03
collection
489
ITEMS
3.7M
VIEWS
by Internet Archive Web Group
collection

eye 3.7M

DIRECT-OA-CRAWL-2019
DIRECT-OA-CRAWL-2019
collection
2,566
ITEMS
5.2M
VIEWS
by Internet Archive Web Group
collection

eye 5.2M

CORE-UPSTREAM-CRAWL-2018-11
CORE-UPSTREAM-CRAWL-2018-11
collection
741
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

Crawl of "upstream" URLs from CORE (core.ac.uk) metadata dump. Only a partial seedlist of files crawled.
OA-DOI-CRAWL-2020-02
OA-DOI-CRAWL-2020-02
collection
278
ITEMS
3.2M
VIEWS
by Internet Archive Web Group
collection

eye 3.2M

JOURNALS-PATCH-CRAWL-2022-01
JOURNALS-PATCH-CRAWL-2022-01
collection
104
ITEMS
640,417
VIEWS
collection

eye 640,417

UNPAYWALL-PDF-CRAWL-2020-03
UNPAYWALL-PDF-CRAWL-2020-03
collection
344
ITEMS
1.8M
VIEWS
by Internet Archive Web Group
collection

eye 1.8M

MAG-PDF-CRAWL-2020-07
MAG-PDF-CRAWL-2020-07
collection
196
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

UNPAYWALL-PDF-CRAWL-2021-07
UNPAYWALL-PDF-CRAWL-2021-07
collection
174
ITEMS
943,493
VIEWS
collection

eye 943,493

MAG-PDF-CRAWL-2021-08
MAG-PDF-CRAWL-2021-08
collection
189
ITEMS
695,718
VIEWS
collection

eye 695,718

OA-DOI-CRAWL-2020-12
OA-DOI-CRAWL-2020-12
collection
191
ITEMS
1.4M
VIEWS
by Internet Archive Web Group
collection

eye 1.4M

UNPAYWALL-PDF-CRAWL-2020-11
UNPAYWALL-PDF-CRAWL-2020-11
collection
199
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

DOI-LANDING-CRAWL-2018-06
DOI-LANDING-CRAWL-2018-06
collection
279
ITEMS
3.2M
VIEWS
by Internet Archive Web Group
collection

eye 3.2M

UNPAYWALL-PDF-CRAWL-2020-05
UNPAYWALL-PDF-CRAWL-2020-05
collection
282
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

Wide Web Targeted PDF Crawling (2017)
Wide Web Targeted PDF Crawling (2017)
collection
922
ITEMS
3M
VIEWS
by Internet Archive Web Group
collection

eye 3M

OA-JOURNAL-CRAWL-2019-08
OA-JOURNAL-CRAWL-2019-08
collection
201
ITEMS
2.7M
VIEWS
by Internet Archive Web Group
collection

eye 2.7M

TARGETED-ARTICLE-CRAWL-2022-04
TARGETED-ARTICLE-CRAWL-2022-04
collection
219
ITEMS
226,602
VIEWS
collection

eye 226,602

PLATFORM-CRAWL-2020
PLATFORM-CRAWL-2020
collection
649
ITEMS
404,310
VIEWS
by Internet Archive Web Group
collection

eye 404,310

UNPAYWALL-PDF-CRAWL-2021-05
UNPAYWALL-PDF-CRAWL-2021-05
collection
123
ITEMS
863,674
VIEWS
by Internet Archive Web Group
collection

eye 863,674

CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.1M
VIEWS
collection

eye 1.1M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal
collection

eye 1.9M

IA crawl of PDF urls provided by Semantic Scholar.
Topic: pdf
SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
collection
1,011
ITEMS
1.4M
VIEWS
by Internet Archive Web Group
collection

eye 1.4M

DOAJ-CRAWL-2020-11
DOAJ-CRAWL-2020-11
collection
102
ITEMS
871,619
VIEWS
by Internet Archive Web Group
collection

eye 871,619

OAI-PMH-PATCH-CRAWL-2021-12
OAI-PMH-PATCH-CRAWL-2021-12
collection
75
ITEMS
301,349
VIEWS
collection

eye 301,349

DOI-CRAWL-2022-02
DOI-CRAWL-2022-02
collection
25
ITEMS
180,583
VIEWS
collection

eye 180,583

JOURNAL-HOMEPAGE-CRAWL-2022-03
JOURNAL-HOMEPAGE-CRAWL-2022-03
collection
44
ITEMS
230,418
VIEWS
collection

eye 230,418

PubMed Central Crawl (2019-10)
PubMed Central Crawl (2019-10)
collection
216
ITEMS
409,763
VIEWS
by Internet Archive Web Group
collection

eye 409,763

PUBMEDCENTRAL-CRAWL-2020-02
PUBMEDCENTRAL-CRAWL-2020-02
collection
108
ITEMS
237,656
VIEWS
by Internet Archive Web Group
collection

eye 237,656

TARGETED-ARTICLE-CRAWL-2022-03
TARGETED-ARTICLE-CRAWL-2022-03
collection
9
ITEMS
46,809
VIEWS
collection

eye 46,809

SCIELO-CRAWL-2020-07
SCIELO-CRAWL-2020-07
collection
41
ITEMS
190,307
VIEWS
by Internet Archive Web Group
collection

eye 190,307

ARXIV-PUBMEDCENTRAL-CRAWL-2020-04
ARXIV-PUBMEDCENTRAL-CRAWL-2020-04
collection
60
ITEMS
103,812
VIEWS
by Internet Archive Web Group
collection

eye 103,812

arXiv Content Crawl (2019-10)
arXiv Content Crawl (2019-10)
collection
37
ITEMS
66,742
VIEWS
by Internet Archive Web Group
collection

eye 66,742

UNPAYWALL-PDF-CRAWL-2022-04
UNPAYWALL-PDF-CRAWL-2022-04
collection
38
ITEMS
13,849
VIEWS
collection

eye 13,849

CiteSeerX URL Crawl 2017
web

eye 6,207

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:40:24 PDT 2017 to Mon Jul 10 16:07:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,716

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:31:07 PDT 2017 to Thu Jul 6 08:54:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,420

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 15:04:04 PDT 2017 to Thu Jul 6 17:45:16 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,014

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:36:43 PDT 2017 to Wed Jul 5 14:00:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,602

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:09:11 PDT 2017 to Thu Jul 6 04:53:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,059

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 22:56:35 PDT 2017 to Thu Jul 6 17:34:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,559

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:54:02 PDT 2017 to Wed Jul 5 13:11:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,716

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:21:08 PDT 2017 to Wed Jul 5 13:39:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,622

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:18:25 PDT 2017 to Thu Jul 6 01:29:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,887

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 21:50:17 PDT 2017 to Thu Jul 6 16:11:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,208

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:28:55 PDT 2017 to Wed Jul 5 12:44:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,912

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 18:52:51 PDT 2017 to Wed Jul 5 12:06:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,681

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:55:20 PDT 2017 to Wed Jul 5 14:21:01 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,634

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:04:31 PDT 2017 to Thu Jul 6 00:05:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,379

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:14:00 PDT 2017 to Thu Jul 6 18:31:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,373

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:56:39 PDT 2017 to Wed Jul 5 23:08:54 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,399

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:27:22 PDT 2017 to Wed Jul 5 06:40:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,223

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:47:41 PDT 2017 to Wed Jul 5 20:59:49 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 9,209

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:08:27 PDT 2017 to Wed Jul 5 05:22:22 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,454

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 21:11:09 PDT 2017 to Thu Jul 6 15:05:55 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,397

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:01:18 PDT 2017 to Wed Jul 5 17:59:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,127

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:06:40 PDT 2017 to Wed Jul 5 06:20:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,181

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:39:30 PDT 2017 to Wed Jul 5 22:34:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,189

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:18:30 PDT 2017 to Wed Jul 5 12:32:14 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,815

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:57:33 PDT 2017 to Thu Jul 6 02:08:23 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,240

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 01:31:00 PDT 2017 to Thu Jul 6 19:44:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,456

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:41:55 PDT 2017 to Wed Jul 5 12:59:15 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,542

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:26:07 PDT 2017 to Wed Jul 5 20:39:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,036

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:44:33 PDT 2017 to Wed Jul 5 00:57:33 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,467

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 14:52:50 PDT 2017 to Thu Jul 6 08:15:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 10,216

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:45:15 PDT 2017 to Thu Jul 6 01:55:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,395

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:48:17 PDT 2017 to Wed Jul 5 05:01:28 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,259

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:41:00 PDT 2017 to Wed Jul 5 18:35:58 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,977

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:40:25 PDT 2017 to Wed Jul 5 08:53:07 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,844

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:38:19 PDT 2017 to Wed Jul 5 20:50:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,590

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:19:51 PDT 2017 to Wed Jul 5 20:32:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,456

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:08:34 PDT 2017 to Wed Jul 5 13:25:24 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,698

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 23:14:09 PDT 2017 to Wed Jul 5 17:01:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,714

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:35:43 PDT 2017 to Thu Jul 6 01:46:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,714

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:37:05 PDT 2017 to Wed Jul 5 22:50:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,222

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:57:06 PDT 2017 to Wed Jul 5 06:10:16 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,776

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 22:33:37 PDT 2017 to Wed Jul 5 16:22:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,224

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:17:48 PDT 2017 to Wed Jul 5 05:29:23 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,388

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:38:04 PDT 2017 to Wed Jul 5 04:54:20 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,983

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 18:19:44 PDT 2017 to Thu Jul 6 13:45:57 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,419

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 17:35:44 PDT 2017 to Wed Jul 5 10:49:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,757

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:46:51 PDT 2017 to Wed Jul 5 22:58:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,214

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:17:51 PDT 2017 to Wed Jul 5 00:28:29 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,699

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 01:37:24 PDT 2017 to Wed Jul 5 19:42:27 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,528

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 16:02:02 PDT 2017 to Thu Jul 6 09:30:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,604

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:35:29 PDT 2017 to Wed Jul 5 00:48:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,754

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 17:26:32 PDT 2017 to Thu Jul 6 10:55:12 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,447

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 17:23:45 PDT 2017 to Wed Jul 5 10:38:21 PDT 2017.
Topic: crawldata
DATASET-CRAWL-2022-01
DATASET-CRAWL-2022-01
collection
2
ITEMS
4,486
VIEWS
collection

eye 4,486

CiteSeerX URL Crawl 2017
web

eye 8,472

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:54:24 PDT 2017 to Wed Jul 5 01:08:02 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,315

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:05:46 PDT 2017 to Wed Jul 5 12:18:30 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 9,686

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:53:13 PDT 2017 to Thu Jul 6 02:05:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,095

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:12:55 PDT 2017 to Wed Jul 5 01:26:34 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,218

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 18:41:03 PDT 2017 to Wed Jul 5 11:56:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,241

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 18:09:15 PDT 2017 to Wed Jul 5 11:23:47 PDT 2017.
Topic: crawldata