Skip to main content

Custom Crawl Services

Internet Archive

Large-scale web harvests and national domain crawls performed for National Libraries, National Archives, preservation partners, research initiatives, and as part of special projects and custom crawling and research services.



rss RSS

166,198
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Published
Creator
web_domain_tests
web

eye 22

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl245.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Sat 19 Feb 2022 09:31:14 AM PST to Sat 19 Feb 2022 01:55:07 AM PST.
Topic: crawldata
web_domain_tests
web

eye 21

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl054.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Tue 22 Feb 2022 05:56:16 PM PST to Tue 22 Feb 2022 10:40:50 AM PST.
Topic: crawldata
web_domain_tests
web

eye 16

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl247.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Thu 24 Feb 2022 02:05:30 PM PST to Thu 24 Feb 2022 06:41:33 AM PST.
Topic: crawldata
web_domain_tests
web

eye 27

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl247.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Thu 24 Feb 2022 02:36:07 PM PST to Thu 24 Feb 2022 07:10:34 AM PST.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl041.us.archive.org:LUX-017-TEST-2022-06-20 from Fri 24 Jun 2022 01:11:33 PM PDT to Fri 24 Jun 2022 09:00:29 AM PDT.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Thu Apr 21 06:54:41 PDT 2022 to Thu Apr 21 02:20:32 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-03 from Wed Mar 16 15:05:34 PDT 2022 to Fri Mar 18 10:50:22 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri Apr 22 03:50:09 PDT 2022 to Thu Apr 21 23:34:59 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue Apr 26 00:00:09 PDT 2022 to Mon Apr 25 22:26:26 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Mon Apr 25 00:01:43 PDT 2022 to Sun Apr 24 22:32:05 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri Apr 22 06:37:08 PDT 2022 to Fri Apr 22 01:56:50 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri Apr 22 09:20:59 PDT 2022 to Fri Apr 22 05:13:08 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Sat Apr 23 09:21:00 PDT 2022 to Sat Apr 23 05:35:37 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Mon Apr 25 16:38:52 PDT 2022 to Mon Apr 25 17:00:09 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Sun May 1 13:06:13 PDT 2022 to Sun May 1 10:34:13 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri Apr 29 14:29:54 PDT 2022 to Fri Apr 29 11:58:14 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Mon May 2 13:01:07 PDT 2022 to Mon May 2 09:34:15 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue Apr 26 23:50:49 PDT 2022 to Tue Apr 26 22:22:52 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri Apr 29 21:22:06 PDT 2022 to Fri Apr 29 19:42:12 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue May 3 11:22:19 PDT 2022 to Tue May 3 07:37:13 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue May 3 23:59:43 PDT 2022 to Tue May 3 19:40:37 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri May 6 06:47:04 PDT 2022 to Fri May 6 01:40:52 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue May 3 21:21:46 PDT 2022 to Tue May 3 17:00:11 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 4 13:59:17 PDT 2022 to Wed May 4 09:42:24 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 4 03:58:49 PDT 2022 to Tue May 3 23:35:22 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Thu May 5 13:56:44 PDT 2022 to Thu May 5 09:06:56 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 4 16:56:20 PDT 2022 to Wed May 4 12:58:27 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 4 01:28:34 PDT 2022 to Tue May 3 21:14:46 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Fri May 6 09:21:27 PDT 2022 to Fri May 6 04:15:25 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Mon May 9 09:05:26 PDT 2022 to Mon May 9 03:40:38 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Sun May 8 23:34:16 PDT 2022 to Sun May 8 19:33:08 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue May 10 13:04:18 PDT 2022 to Tue May 10 09:48:48 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Sat May 7 02:45:28 PDT 2022 to Fri May 6 21:34:17 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Mon May 9 12:26:49 PDT 2022 to Mon May 9 09:10:34 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 11 09:21:18 PDT 2022 to Wed May 11 05:58:22 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Tue May 10 15:09:09 PDT 2022 to Tue May 10 17:39:34 PDT 2022.
Topic: crawldata
Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc279.us.archive.org:TARGETED-ARTICLE-CRAWL-2022-04 from Wed May 11 11:17:21 PDT 2022 to Wed May 11 10:13:54 PDT 2022.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl044.us.archive.org:WEWA-002-TEST-2022-06-23 from Fri 24 Jun 2022 05:44:18 PM PDT to Fri 24 Jun 2022 01:30:00 PM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl043.us.archive.org:LUX-017-TEST-2022-06-20 from Fri 24 Jun 2022 01:59:17 PM PDT to Fri 24 Jun 2022 12:29:31 PM PDT.
Topic: crawldata
BNL 2021 Winter Domain Crawl
web

eye 32

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl041.us.archive.org:LUX-015-2021-12-22 from Wed Jan 12 17:58:03 PST 2022 to Wed Jan 12 12:24:47 PST 2022.
Topic: crawldata
web_domain_tests
web

eye 34

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl041.us.archive.org:LUX-016-TEST-2022-03-21 from Fri Mar 25 16:53:13 PDT 2022 to Fri Mar 25 10:23:12 PDT 2022.
Topic: crawldata
BNL 2021 Winter Domain Crawl
web

eye 10

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl043.us.archive.org:LUX-015-2021-12-22 from Sun Jan 16 09:14:19 PST 2022 to Sun Jan 16 03:39:11 PST 2022.
Topic: crawldata
NLNZ Domain Crawl 2022
web

eye 9

favorite 0

comment 0

Internet Archive crawldata from the National Library of New Zealand, captured by wbgrp-crawl044.us.archive.org:NLNZ-NZ-CRAWL-011 from Sat Feb 12 22:01:55 PST 2022 to Sat Feb 12 15:57:29 PST 2022.
Topic: crawldata
NLNZ Domain Crawl 2022
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from the National Library of New Zealand, captured by wbgrp-crawl045.us.archive.org:NLNZ-NZ-CRAWL-011 from Tue Mar 1 07:12:11 PST 2022 to Tue Mar 1 03:01:42 PST 2022.
Topic: crawldata
web_domain_tests
web

eye 34

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl302.us.archive.org:IA-COLLIE-003-2021-11-10 from Fri 28 Jan 2022 12:31:15 AM PST to Thu 27 Jan 2022 05:09:12 PM PST.
Topic: crawldata
Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl052.us.archive.org:IA-UA-2022-04-06 from Tue 03 May 2022 07:31:34 AM PDT to Tue 03 May 2022 07:22:32 AM PDT.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl048.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Tue 22 Feb 2022 03:52:22 AM PST to Mon 21 Feb 2022 08:32:04 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl048.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Tue 22 Feb 2022 06:59:43 AM PST to Mon 21 Feb 2022 11:33:06 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl051.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Tue 22 Feb 2022 09:30:03 PM PST to Tue 22 Feb 2022 03:37:09 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl246.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 12:07:52 AM PST to Tue 22 Feb 2022 07:15:37 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl048.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 08:29:36 AM PST to Wed 23 Feb 2022 12:53:28 AM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl247.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 08:26:34 AM PST to Wed 23 Feb 2022 03:55:17 AM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl052.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 02:14:48 AM PST to Tue 22 Feb 2022 08:49:38 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl053.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 03:34:18 AM PST to Tue 22 Feb 2022 11:02:53 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl049.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 05:21:08 AM PST to Tue 22 Feb 2022 09:46:21 PM PST.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl245.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 12:24:25 PM PST to Wed 23 Feb 2022 06:57:17 AM PST.
Topic: crawldata
Worldwide Government Web
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl054.us.archive.org:IA-WGW-20220613 from Sat 25 Jun 2022 05:03:05 AM PDT to Sat 25 Jun 2022 12:19:46 AM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl045.us.archive.org:WEWA-002-TEST-2022-06-23 from Sat 25 Jun 2022 05:53:27 AM PDT to Sat 25 Jun 2022 12:41:23 AM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl043.us.archive.org:LUX-017-TEST-2022-06-20 from Sat 25 Jun 2022 12:58:10 AM PDT to Fri 24 Jun 2022 11:54:32 PM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl044.us.archive.org:WEWA-002-TEST-2022-06-23 from Sat 25 Jun 2022 11:34:32 AM PDT to Sat 25 Jun 2022 07:21:21 AM PDT.
Topic: crawldata
DOI-CRAWL-2022-02
web

eye 1,121

favorite 0

comment 0

Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc206.us.archive.org:DOI-CRAWL-2022-02 from Sat Apr 2 20:36:27 PDT 2022 to Thu Apr 7 01:35:01 PDT 2022.
Topic: crawldata
DOI-CRAWL-2022-02
web

eye 1,786

favorite 0

comment 0

Internet Archive crawldata of scholarly web landing page content captured by wbgrp-svc206.us.archive.org:DOI-CRAWL-2022-02 from Wed Mar 30 02:31:26 PDT 2022 to Sun Apr 3 01:35:14 PDT 2022.
Topic: crawldata
Worldwide Government Web
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl053.us.archive.org:IA-WGW-20220613 from Sat 25 Jun 2022 02:49:41 PM PDT to Sat 25 Jun 2022 12:03:19 PM PDT.
Topic: crawldata
Worldwide Government Web
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl052.us.archive.org:IA-WGW-20220613 from Sat 25 Jun 2022 02:15:23 PM PDT to Sat 25 Jun 2022 09:41:21 AM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl046.us.archive.org:WEWA-002-TEST-2022-06-23 from Sat 25 Jun 2022 07:04:00 AM PDT to Sat 25 Jun 2022 04:36:08 AM PDT.
Topic: crawldata
web_domain_tests
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl046.us.archive.org:WEWA-002-TEST-2022-06-23 from Sat 25 Jun 2022 03:00:46 AM PDT to Sat 25 Jun 2022 12:14:58 AM PDT.
Topic: crawldata
Worldwide Government Web
web

eye 0

favorite 0

comment 0

Internet Archive crawldata from the Internet Archive, captured by wbgrp-crawl051.us.archive.org:IA-WGW-20220613 from Fri 24 Jun 2022 07:37:52 PM PDT to Sat 25 Jun 2022 03:00:15 AM PDT.
Topic: crawldata
NLNZ Domain Crawl 2022
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from the National Library of New Zealand, captured by wbgrp-crawl047.us.archive.org:NLNZ-NZ-CRAWL-011 from Fri Feb 25 04:42:11 PST 2022 to Thu Feb 24 22:28:35 PST 2022.
Topic: crawldata
BNL 2021 Winter Domain Crawl
web

eye 149

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl041.us.archive.org:LUX-015-2021-12-22 from Tue Jan 18 08:23:43 PST 2022 to Tue Jan 18 01:47:21 PST 2022.
Topic: crawldata
BNL 2021 Winter Domain Crawl
web

eye 20

favorite 0

comment 0

Internet Archive crawldata from the National Library of Luxembourg, captured by wbgrp-crawl043.us.archive.org:LUX-015-2021-12-22 from Sat Jan 15 17:29:46 PST 2022 to Sat Jan 15 11:03:57 PST 2022.
Topic: crawldata
web_domain_tests
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from the National Library of Australia, captured by wbgrp-crawl246.us.archive.org:NLA-AU-CRAWL-TEST-2022 from Wed 23 Feb 2022 07:15:56 PM PST to Wed 23 Feb 2022 11:44:01 AM PST.
Topic: crawldata