Skip to main content

Custom Crawl Services

Internet Archive

Large-scale web harvests and national domain crawls performed for National Libraries, National Archives, preservation partners, research initiatives, and as part of special projects and custom crawling and research services.



rss RSS

166,816
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
NLIL_2013
May 17, 2020 Internet Archive
web

eye 468,382

favorite 0

comment 1

Internet Archive crawldata from National Library of Israel, captured by wbgrp-crawl010.us.archive.org:NLIL-CRAWL-01 from Mon Nov 4 04:49:09 PST 2013 to Mon Nov 4 06:58:51 PST 2013.
( 1 reviews )
Topic: crawldata
web_locrl
data

eye 124

favorite 0

comment 0

web_locrl
data

eye 110

favorite 0

comment 0

web_locrl
data

eye 48

favorite 0

comment 0

Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Sat Jan 10 19:31:15 PST 2015 to Sat Jan 10 11:43:20 PST 2015.
Topic: crawldata
NARA 116th Congressional Crawl
data

eye 0

favorite 0

comment 0

Configuration, Reports, and Logs for nara congress116th-0 crawl.
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Fri Dec 18 08:14:42 PST 2020 to Fri Dec 18 03:44:21 PST 2020.
Topic: crawldata
National Library of Australia Crawl
data

eye 0

favorite 0

comment 0

Configuration, Reports, and Logs for NLA-AU-CRAWL 2012 crawl.
web_locrl
data

eye 3

favorite 0

comment 0

Topics: crawl, logs
web_locrl
data

eye 69

favorite 0

comment 0

Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 22:02:38 PDT 2019 to Tue Oct 8 15:30:39 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Fri Dec 25 00:57:45 PST 2020 to Thu Dec 24 20:52:50 PST 2020.
Topic: crawldata
web_locrl
data

eye 0

favorite 0

comment 0

web_locrl
data

eye 125

favorite 0

comment 0

web_locrl
data

eye 13

favorite 0

comment 0

Internet Archive crawldata from the Olympics 2014 crawl, captured by wbgrp-crawl013.us.archive.org:olympics2014 from Sun Jan 26 03:27:09 PST 2014 to Sat Jan 25 19:42:35 PST 2014.
Topic: crawldata
web_locrl
data

eye 194

favorite 0

comment 0

web_locrl
data

eye 0

favorite 0

comment 0

Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 08:15:14 PDT 2019 to Tue Oct 8 01:48:42 PDT 2019.
Topic: crawldata
Internet Archive crawldata from the Olympics 2014 crawl, captured by wbgrp-crawl013.us.archive.org:olympics2014 from Sun Jan 19 08:44:48 PST 2014 to Sun Jan 19 01:01:32 PST 2014.
Topic: crawldata
web_locrl
data

eye 0

favorite 0

comment 0

web_locrl
data

eye 3

favorite 0

comment 0

web_locrl
data

eye 10

favorite 0

comment 0

web_locrl
data

eye 0

favorite 0

comment 0

web_locrl
data

eye 56

favorite 0

comment 0

web_locrl
data

eye 157

favorite 0

comment 0

Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Wed Oct 9 10:07:57 PDT 2019 to Wed Oct 9 04:01:27 PDT 2019.
Topic: crawldata
web_locrl
data

eye 0

favorite 0

comment 0

Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Tue Jan 6 17:25:39 PST 2015 to Tue Jan 6 09:44:30 PST 2015.
Topic: crawldata
web_locrl
data

eye 23

favorite 0

comment 0

Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Tue Oct 8 13:28:52 PDT 2019 to Tue Oct 8 06:54:08 PDT 2019.
Topic: crawldata
web_locrl
data

eye 105

favorite 0

comment 0

web_locrl
data

eye 63

favorite 0

comment 0

Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-crawl002.us.archive.org:NLA-AU-CRAWL-2019 from Mon Apr 8 15:30:45 PDT 2019 to Mon Apr 8 06:10:43 PDT 2019.
Topic: crawldata
Internet Archive crawldata from National Library of Australia 2019 domain crawl, captured by wbgrp-crawl003.us.archive.org:NLA-AU-CRAWL-2019 from Sun Apr 7 01:19:46 PDT 2019 to Sat Apr 6 19:29:37 PDT 2019.
Topic: crawldata
web_locrl
data

eye 45

favorite 0

comment 0

Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-svc276.us.archive.org:congress115th from Fri Oct 12 09:17:20 PDT 2018 to Fri Oct 12 14:16:22 PDT 2018.
Topic: crawldata
web_locrl
data

eye 189

favorite 0

comment 0

Internet Archive crawldata from the National Library of IRELAND, captured by wbgrp-crawl006.us.archive.org:NLI-2017-YOUTUBE-PATCH-2017-12-11 from Wed Dec 13 17:05:22 PST 2017 to Wed Dec 13 09:25:20 PST 2017.
Topic: crawldata
Internet Archive crawldata from the NARA 114th Congressional Crawl, captured by wbgrp-crawl204.us.archive.org:congress114th from Mon Jan 23 00:19:01 PST 2017 to Sun Jan 22 22:25:23 PST 2017.
Topic: crawldata
web_locrl
data

eye 19

favorite 0

comment 0

NLA 2020 Domain Crawl
data

eye 0

favorite 0

comment 0

Configuration, Reports, and Logs for NLA-AU-CRAWL-2020 crawl.
Internet Archive crawldata from the NARA 116th Congressional Crawl, captured by wbgrp-svc277.us.archive.org:congress116th-1 from Sat Dec 19 00:50:16 PST 2020 to Fri Dec 18 20:28:38 PST 2020.
Topic: crawldata
web_locrl
data

eye 0

favorite 0

comment 0

web_locrl
data

eye 141

favorite 0

comment 0

Internet Archive crawldata from the NARA 113th Congressional Crawl, captured by wbgrp-crawl013.us.archive.org:congress113th from Tue Jan 6 10:55:39 PST 2015 to Tue Jan 6 03:15:15 PST 2015.
Topic: crawldata
web_locrl
data

eye 3

favorite 0

comment 0

web_locrl
data

eye 32

favorite 0

comment 0

web_locrl
data

eye 241

favorite 0

comment 0

NLNZ Domain Crawl 2021
data

eye 0

favorite 0

comment 0

"Language analysis for NLNZ-NZ-CRAWL-010 crawl"
web_locrl
data

eye 0

favorite 0

comment 0

Olympics Crawl 2014
web

eye 247

favorite 0

comment 0

Internet Archive crawldata from the Olympics 2014 crawl, captured by wbgrp-crawl013.us.archive.org:olympics2014 from Sun Jan 26 18:34:16 PST 2014 to Sun Jan 26 11:15:35 PST 2014.
Topic: crawldata
web_locrl
data

eye 77

favorite 0

comment 0

NLNZ Domain Crawl 2018
data

eye 1

favorite 0

comment 0

Configuration, Reports, and Logs for NLNZ-NZ-CRAWL-007 crawl.
Internet Archive crawldata from the NARA 115th Congressional Crawl, captured by wbgrp-crawl004.us.archive.org:congress115th from Wed Oct 9 09:31:11 PDT 2019 to Wed Oct 9 03:18:11 PDT 2019.
Topic: crawldata
web_locrl
data

eye 78

favorite 0

comment 0

web_locrl
data

eye 155

favorite 0

comment 0