Skip to main content

Custom Crawl Services

Internet Archive

Large-scale web harvests and national domain crawls performed for National Libraries, National Archives, preservation partners, research initiatives, and as part of special projects and custom crawling and research services.



rss RSS

166,121
RESULTS


More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Internet Archive Research Publication Crawls
by CNKI
data

eye 0

favorite 0

comment 0

Metadata about COVID-19 papers downloaded from:  http://en.gzbd.cnki.net/GZBT/brief/Default.aspx
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Wed Jan 7 20:23:10 PST 2015 to Wed Jan 7 12:39:29 PST 2015.
Topic: crawldata
Olympics Crawl 2014
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from the Olympics 2014 crawl, captured by wbgrp-crawl013.us.archive.org:olympics2014 from Sun Jan 12 03:18:51 PST 2014 to Sat Jan 11 19:45:29 PST 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Wed Oct 9 11:34:20 PDT 2019 to Wed Oct 9 05:27:59 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Thu Dec 10 09:18:38 PST 2020 to Thu Dec 10 05:18:24 PST 2020.
Topic: crawldata
NLIL_2013
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from National Library of Israel, captured by wbgrp-crawl008.us.archive.org:NLIL-CRAWL-01 from Mon Oct 28 17:21:10 PDT 2013 to Mon Oct 28 22:23:52 PDT 2013.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sun Dec 27 04:52:35 PST 2020 to Sat Dec 26 20:52:51 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Mon Jan 12 09:04:00 PST 2015 to Mon Jan 12 01:27:37 PST 2015.
Topic: crawldata
BNL 2020 Winter Domain Crawl
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from the National Library of LUXEMBOURG, captured by wbgrp-crawl005.us.archive.org:LUX-008-PATCH-2020-02-26 from Sat Feb 29 19:34:21 PST 2020 to Mon Mar 2 11:33:19 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Mon Oct 7 22:14:03 PDT 2019 to Mon Oct 7 15:42:29 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Sun Nov 29 05:19:18 PST 2020 to Sun Nov 29 01:18:27 PST 2020.
Topic: crawldata
BNL 2017 Summer Domain Crawl
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of LUXEMBOURG, captured by wbgrp-crawl206.us.archive.org:LUX-003-2017-07-24-PATCH from Tue Aug 1 09:23:35 PDT 2017 to Tue Aug 1 09:52:07 PDT 2017.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sun Dec 20 12:53:37 PST 2020 to Sun Dec 20 08:47:48 PST 2020.
Topic: crawldata
BNL 2019 Winter Domain Crawl
web

eye 22

favorite 0

comment 0

Internet Archive crawldata from the National Library of LUXEMBOURG, captured by wbgrp-crawl008.us.archive.org:LUX-006-PREF-2019-01-28 from Tue Feb 19 08:41:59 PST 2019 to Tue Feb 19 04:57:13 PST 2019.
Topic: crawldata
National Libary of Ireland 2017 Web Archive
web

eye 13

favorite 0

comment 0

Internet Archive crawldata from the National Library of IRELAND, captured by wbgrp-crawl005.us.archive.org:NLI-2017-PREF-2017-11-13 from Mon Nov 20 13:44:28 PST 2017 to Mon Nov 20 06:14:09 PST 2017.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Wed Dec 30 00:53:58 PST 2020 to Tue Dec 29 20:52:51 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Fri Jan 9 03:59:40 PST 2015 to Thu Jan 8 20:19:49 PST 2015.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sun Jan 17 09:35:05 PST 2021 to Sun Jan 17 01:37:05 PST 2021.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sat Dec 26 16:53:29 PST 2020 to Sat Dec 26 12:52:51 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Wed Jan 20 16:12:07 PST 2021 to Wed Jan 20 09:57:53 PST 2021.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Fri Dec 25 01:18:33 PST 2020 to Thu Dec 24 21:18:28 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress115th from Fri Nov 16 16:05:01 PST 2018 to Fri Nov 16 17:07:12 PST 2018.
Topic: crawldata
NLA 2019 Domain Crawl
web

eye 630

favorite 0

comment 0

Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-crawl003.us.archive.org:NLA-AU-CRAWL-2019 from Mon Mar 25 23:49:21 PDT 2019 to Mon Mar 25 20:44:48 PDT 2019.
Topic: crawldata
Wide Web Targeted PDF Crawling (2017)
web

eye 1,011

favorite 0

comment 0

Internet Archive crawldata of web PDF content captured by wbgrp-svc284.us.archive.org:TARGETED-PDF-CRAWL-2017 from Sat Nov 18 06:46:08 PST 2017 to Fri Nov 17 23:41:59 PST 2017.
Topic: crawldata
Internet Archive crawldata of web PDF content captured by wbgrp-svc284.us.archive.org:TARGETED-PDF-CRAWL-2017 from Sun Nov 19 16:16:08 PST 2017 to Sun Nov 19 08:53:13 PST 2017.
Topic: crawldata
Internet Archive crawldata of web PDF content captured by wbgrp-svc284.us.archive.org:TARGETED-PDF-CRAWL-2017 from Tue Nov 28 10:05:58 PST 2017 to Tue Nov 28 03:41:42 PST 2017.
Topic: crawldata
Internet Archive crawldata from the NARA 114th Congressional Crawl, captured by wbgrp-crawl204.us.archive.org:congress114th from Tue Nov 15 22:12:07 PST 2016 to Tue Nov 15 14:32:52 PST 2016.
Topic: crawldata
Internet Archive crawldata from the NARA 114th Congressional Crawl, captured by wbgrp-crawl204.us.archive.org:congress114th from Thu Nov 17 03:41:29 PST 2016 to Wed Nov 16 20:45:33 PST 2016.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Sat Sep 29 07:12:40 PDT 2018 to Sat Sep 29 00:43:58 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Tue Sep 25 20:17:02 PDT 2018 to Sat Sep 29 23:48:08 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Thu Oct 4 17:01:37 PDT 2018 to Thu Oct 4 11:00:00 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Fri Nov 27 01:19:23 PST 2020 to Thu Nov 26 21:18:23 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Wed Jan 20 21:40:57 PST 2021 to Wed Jan 20 13:57:54 PST 2021.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Mon Jan 18 21:58:57 PST 2021 to Mon Jan 18 15:42:29 PST 2021.
Topic: crawldata
Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-svc282.us.archive.org:NLA-AU-CRAWL-2019 from Mon Apr 8 22:37:04 PDT 2019 to Mon Apr 8 19:11:05 PDT 2019.
Topic: crawldata
NLNZ_2020
web

eye 399

favorite 0

comment 0

Internet Archive crawldata from New Zealand Winter 2020 domain crawl, captured by wbgrp-crawl001.us.archive.org:NLNZ-NZ-CRAWL-009 from Sat Feb 22 01:23:26 PST 2020 to Fri Feb 21 22:20:02 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 12:37:46 PDT 2019 to Tue Oct 8 06:04:21 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Mon Oct 7 23:46:23 PDT 2019 to Mon Oct 7 17:15:15 PDT 2019.
Topic: crawldata
National Libary of Ireland 2017 Web Archive
by Internet Archive
web

eye 15

favorite 0

comment 0

Internet Archive crawldata from the National Library of IRELAND, captured by wbgrp-crawl009.us.archive.org:NLI-2017-PATCH-2018-01-16 from to Thu Feb 15 14:23:11 PST 2018.
Topic: crawldata
NLA_2015
web

eye 119

favorite 0

comment 0

Internet Archive crawldata from National Library of AUSTRALIA, captured by wbgrp-crawl008.us.archive.org:NLA-AU-CRAWL from Mon Mar 16 07:51:41 PDT 2015 to Mon Mar 16 03:14:02 PDT 2015.
Topic: crawldata
NLIL_2013
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from National Library of Israel, captured by wbgrp-crawl008.us.archive.org:NLIL-CRAWL-01 from Tue Oct 29 19:58:48 PDT 2013 to Wed Oct 30 01:53:56 PDT 2013.
Topic: crawldata
Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-svc231.us.archive.org:NLA-AU-CRAWL-2019-PATCH from Mon Apr 29 16:07:33 PDT 2019 to Sat May 4 12:14:03 PDT 2019.
Topic: crawldata
National Libary of Ireland 2017 Web Archive
web

eye 10

favorite 0

comment 0

Internet Archive crawldata from the National Library of IRELAND, captured by wbgrp-crawl006.us.archive.org:NLI-2017-YOUTUBE-PATCH-2017-12-11 from Wed Dec 13 15:33:53 PST 2017 to Wed Dec 13 07:51:45 PST 2017.
Topic: crawldata
NLA 2019 Domain Crawl
web

eye 963

favorite 0

comment 0

Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-crawl007.us.archive.org:NLA-AU-CRAWL-2019 from Thu Apr 18 13:05:33 PDT 2019 to Thu Apr 18 10:25:21 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 19:26:36 PDT 2019 to Tue Oct 8 12:57:55 PDT 2019.
Topic: crawldata
BNL 2017 Winter Domain Crawl
by Internet Archive
web

eye 16

favorite 0

comment 0

Internet Archive crawldata from the National Library of LUXEMBOURG, captured by wbgrp-crawl008.us.archive.org:LUX-004-PATCH-2017-12-22 from to Mon Feb 5 14:59:49 PST 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Thu Dec 3 09:18:42 PST 2020 to Thu Dec 3 05:18:25 PST 2020.
Topic: crawldata
Internet Archive crawldata from NARA End of Term Congressional Harvest of the 112th Congressional Session Test Crawl, captured by wbgrp-crawl025.us.archive.org:congress112th-test from Sat Nov 17 06:36:40 PST 2012 to Sat Nov 17 06:35:24 PST 2012.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress115th from Thu Oct 11 01:43:51 PDT 2018 to Wed Oct 10 22:18:49 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Thu Oct 4 07:12:11 PDT 2018 to Thu Oct 4 00:44:53 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Tue Oct 2 23:29:48 PDT 2018 to Tue Oct 2 16:44:44 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Sat Dec 12 21:18:37 PST 2020 to Sat Dec 12 17:18:28 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Thu Dec 24 08:54:07 PST 2020 to Thu Dec 24 04:52:50 PST 2020.
Topic: crawldata
Olympics Crawl 2014
web

eye 574

favorite 0

comment 0

Internet Archive crawldata from the Olympics 2014 crawl, captured by wbgrp-crawl013.us.archive.org:olympics2014 from Sun Jan 26 03:48:33 PST 2014 to Sat Jan 25 20:20:46 PST 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 06:49:23 PDT 2019 to Tue Oct 8 00:19:25 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc278.us.archive.org:congress116th-1 from Sun Dec 27 21:18:35 PST 2020 to Sun Dec 27 17:18:27 PST 2020.
Topic: crawldata
BNL 2017 Summer Domain Crawl
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of LUXEMBOURG, captured by wbgrp-crawl206.us.archive.org:LUX-003-2017-07-24-PATCH from Wed Aug 2 08:53:36 PDT 2017 to Wed Aug 2 09:08:56 PDT 2017.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Tue Jan 13 16:11:57 PST 2015 to Tue Jan 13 08:29:04 PST 2015.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Wed Dec 30 04:54:26 PST 2020 to Wed Dec 30 00:52:51 PST 2020.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Tue Jan 19 17:52:35 PST 2021 to Tue Jan 19 09:57:34 PST 2021.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sat Dec 19 09:05:15 PST 2020 to Sat Dec 19 04:52:16 PST 2020.
Topic: crawldata
Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-svc231.us.archive.org:NLA-AU-CRAWL-2019-PATCH from Wed May 1 09:53:45 PDT 2019 to Fri May 3 08:18:09 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Tue Dec 30 04:55:27 PST 2014 to Mon Dec 29 21:18:47 PST 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Tue Dec 30 03:50:22 PST 2014 to Mon Dec 29 20:20:46 PST 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Test Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th-test from Wed Sep 24 00:04:03 PDT 2014 to Tue Sep 23 17:28:28 PDT 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 113th Congressional Test Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th-test from Sun Sep 21 19:00:43 PDT 2014 to Sun Sep 21 12:20:44 PDT 2014.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Sun Sep 30 08:47:43 PDT 2018 to Sun Sep 30 02:34:28 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Fri Sep 28 18:58:51 PDT 2018 to Fri Sep 28 12:55:18 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Mon Oct 1 07:03:06 PDT 2018 to Mon Oct 1 00:43:05 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Mon Oct 1 06:28:49 PDT 2018 to Mon Oct 1 00:12:13 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Mon Oct 1 13:58:45 PDT 2018 to Mon Oct 1 07:30:30 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Tue Oct 2 07:47:04 PDT 2018 to Tue Oct 2 01:46:03 PDT 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Tue Jan 1 08:45:48 PST 2019 to Tue Jan 1 02:09:25 PST 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Mon Dec 31 09:40:07 PST 2018 to Mon Dec 31 02:26:45 PST 2018.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-crawl009.us.archive.org:congress116th-1 from Mon Oct 5 16:28:12 PDT 2020 to Mon Oct 5 12:33:01 PDT 2020.
Topic: crawldata