0
0.0
Jul 20, 2020
07/20
by
Internet Archive Web Group
data
eye 0
favorite 0
comment 0
See also the crawl logs item for this crawl.
2
2.0
Sep 29, 2020
09/20
by
Internet Archive Web Group
data
eye 2
favorite 0
comment 0
1.8M
1.8M
May 4, 2020
05/20
by
Internet Archive Web Group
973,161
973K
Apr 27, 2021
04/21
by
Internet Archive Web Group
1.6M
1.6M
Dec 9, 2020
12/20
by
Internet Archive Web Group
3.2M
3.2M
Sep 21, 2017
09/17
by
Internet Archive Web Group
4.2M
4.2M
Mar 5, 2020
03/20
by
Internet Archive Web Group
0
0.0
May 29, 2018
05/18
by
Internet Archive Web Group
data
eye 0
favorite 0
comment 0
This item contains a copy of log files found on the Internet Archive (Web Group) machine `wbgrp-svc263.us.archive.org` on 2018-05-29, under the `/3` directory. These are logs of file transfer status between various crawler machines; they are not known to contain any sensitive metadata (eg, personal information, IPs, or other security-sensitive information), but are being keep `access-restricted` anyways. This data is almost certainly unimportant and could be deleted; it is being preserved out...
2
2.0
Aug 5, 2020
08/20
by
Internet Archive Web Group
data
eye 2
favorite 0
comment 0
URL lists to PDFs on the web (and preserved in the wayback machine) which are likely to contain research materials.
32
32
Jun 20, 2019
06/19
by
Internet Archive Web Group
data
eye 32
favorite 0
comment 0
17
17
Jul 30, 2019
07/19
by
Internet Archive Web Group
See: https://guide.fatcat.wiki/reference_graph.html License: CC-0
41
41
Jun 12, 2019
06/19
by
Internet Archive Web Group
data
eye 41
favorite 0
comment 0
This item contains bulk metadata exported from https://fatcat.wiki. With the exception of the 'abstracts' file (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to upstream sources (including Crossref, ORCID, DOAJ, the ISSN...
5.9M
5.9M
Apr 26, 2019
04/19
by
Internet Archive Web Group
2M
2.0M
Mar 5, 2020
03/20
by
Internet Archive Web Group
1.6M
1.6M
Feb 6, 2020
02/20
by
Internet Archive Web Group
79,989
80K
Oct 12, 2019
10/19
by
Internet Archive Web Group
959,382
959K
Nov 24, 2020
11/20
by
Internet Archive Web Group
3.4M
3.4M
Jun 1, 2018
06/18
by
Internet Archive Web Group
1.9M
1.9M
Oct 31, 2018
10/18
by
Internet Archive Web Group
Crawl of "upstream" URLs from CORE (core.ac.uk) metadata dump. Only a partial seedlist of files crawled.
0
0.0
Aug 5, 2020
08/20
by
Internet Archive Web Group
data
eye 0
favorite 0
comment 0
178
178
Jun 12, 2019
06/19
by
Internet Archive Web Group
5.6M
5.6M
Feb 15, 2019
02/19
by
Internet Archive Web Group
This item contains a complete PostgreSQL SQL database snapshot from https://fatcat.wiki, in binary 'pg_dump tar mode' format. With the exception of the 'abstracts' table (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to...
This is a mapping between: - DOIs (Crossref) - PubMed PMID and PMCID (NIH) - CORE record identifier (core.ac.uk) - Wikidata QIDs See README and scripts for details.
262,794
263K
Feb 5, 2020
02/20
by
Internet Archive Web Group
1,409
1.4K
Sep 9, 2019
09/19
by
Internet Archive Web Group
This collection holds database snapshots (SQL) and bulk metadata exports (JSON and TSV) from https:///fatcat.wiki (an Internet Archive service)
This item contains an example corpus of citations between scholarly documents, as extracted from the fatcat (https://fatcat.wiki) corpus as of the 2020-08-05 bulk release export. This corpus itself was generated from a fatcat-scholar "intermediate" fulltext dump which is not public, using software in the fatcat-scholar repository in mid-September 2020. See also the README for some more notes, and the "sample" file.
This item contains a complete PostgreSQL SQL database snapshot from https://fatcat.wiki, in binary 'pg_dump tar mode' format. With the exception of the 'abstracts' table (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to...