Skip to main content
SHOW DETAILS
eye
Title
Date Archived
Creator
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02095. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02095/7927067/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 2

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19359. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19359/7928666/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19795. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19795/7932278/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA21144. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA21144/7951910/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01256. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01256/7885169/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01670. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01670/7897691/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01747. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01747/7898108/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01797. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01797/7901240/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA18951. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18951/7892684/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03095. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03095/7890959/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01682. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01682/7897922/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 2

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19463. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19463/7929641/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02142. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02142/7927550/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19429. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19429/7929002/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03977. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03977/7929782/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 2

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03926. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03926/7929389/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA20344. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20344/7936628/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01347. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01347/7890095/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03464. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03464/7901504/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA20282. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20282/7934999/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03279. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03279/7897946/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03446. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03446/7901288/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01702. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01702/7898024/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01948. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01948/7910672/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03598. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03598/7908563/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19472. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19472/7929845/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03897. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03897/7928993/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02070. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02070/7925396/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02799. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02799/7940798/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA21086. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA21086/7950086/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA20289. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20289/7935014/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02860. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02860/7884980/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01362. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01362/7890326/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA18907. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18907/7891043/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG00530. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG00530/7879991/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03190. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03190/7895180/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01746. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01746/7898099/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19737. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19737/7931006/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02536. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02536/7931063/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03730. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03730/7923617/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA20884. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20884/7949462/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01171. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01171/7884881/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02006. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02006/7914449/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG00154.
Source: https://figshare.com/articles/dataset/gVCF_HG00154/7872560/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19663. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19663/7930124/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03792. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03792/7927244/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19467. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19467/7929692/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02661. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02661/7934975/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02793. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02793/7940702/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA11931. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA11931/7936667/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01113. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01113/7883471/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01204. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01204/7885034/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG00097.
Source: https://figshare.com/articles/dataset/gVCF_HG00097/7841411/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA18864. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18864/7890737/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01624. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01624/7895774/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03452. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03452/7901339/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG01950. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01950/7911317/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03558. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03558/7907651/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19185. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19185/7927103/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA19475. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19475/7929941/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03812. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03812/7928066/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02286. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02286/7928966/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG03874. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03874/7928795/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02561. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02561/7931462/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for HG02554. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02554/7931255/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA18504. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18504/7944293/1
Academic Data and Datasets
by Jonathan Pevsner
data

eye 1

favorite 0

comment 0

1000 Genomes gVCF mapped to hs37d5 for NA20786. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20786/7944296/1
DOI-LANDING-CRAWL-2018-06
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0

This item contains output files related to the DOI-LANDING-CRAWL-2018-06 crawl of Crossref DOI redirect landing pages: - list of Crossref DOI numbers attempted - an index of DOI, URL, and final HTTP status codes
Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 14

favorite 0

comment 0

This is a derivative of https://archive.org/download/ia_papers_manifest_2018-01-25, which contains JSON objects that can be inserted into a fatcat catalog.
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 13

favorite 0

comment 0

See README.md
Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 28

favorite 0

comment 0

This dump includes all tables (including oauth authentication tables which could be a privacy, but not security, concern). At this time only IA staff have accounts, so the snapshot, which is intended mostly for disaster recovery, is still public.
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 83

favorite 0

comment 0

Open Access Journal Test Crawl (2018)
by Internet Archive Web Group
data

eye 8

favorite 0

comment 0

Web PDF Training Sets
by Internet Archive Web Group
data

eye 15

favorite 0

comment 0

This item contains three .zip archives, each containing a sample corpus of about 10,000 (or more) HTML documents from the IA web archive. For each, there is some form of metadata (CDX or JSON) with information about the original URL and timestamp for each document, and then directories containing HTML, extracted TEI-XML, and extracted TXT for each document. There are some fraction of documents which failed to download or failed to extract, so there are fewer .html (and derivative) files than...
DIRECT-OA-CRAWL-2019
by Internet Archive Web Group
data

eye 4

favorite 0

comment 0

Web PDF Training Sets
by Internet Archive Web Group
data

eye 28

favorite 0

comment 0

CORE-UPSTREAM-CRAWL-2018-11
by Internet Archive Web Group
data

eye 3

favorite 0

comment 0

"Full" crawl logs (for every hit) from CORE-UPSTREAM-CRAWL-2018-11 crawl. See also 'CORE-UPSTREAM-CRAWL-2018-11-CRL' item for reports etc.
Web PDF Training Sets
by Internet Archive Web Group
data

eye 21

favorite 0

comment 0

Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 8

favorite 0

comment 0

Snapshot of Internet Archive (petabox) file-level metadata (eg, PDF hashes) for files under the 'journals' collection as of December 2018. Note: includes a small number of items not actually under the 'journals' collection hierarchy due to how the input item list was generated, and a small fraction (estimate 500?) of items didn't dump successfully. A bit sloppy!
Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 7

favorite 0

comment 0

About 1 million unique PDFs from Global Wayback before year 2000.
Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 6

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2018-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

See also the crawl logs item for this crawl.
MAG-PDF-CRAWL-2020-07
by Internet Archive Web Group
data

eye 0

favorite 0

comment 0

OA-JOURNAL-CRAWL-2020-07
by Internet Archive Web Group
data

eye 2

favorite 0

comment 0

OMICS-DOI-LANDING-CRAWL-2019-04
by Internet Archive Web Group
data

eye 4

favorite 0

comment 0

SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 6

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 32

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 11

favorite 0

comment 0

DIRECT-OA-CRAWL-2019
by Internet Archive Web Group
data

eye 5

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2020-11
by Internet Archive Web Group
data

eye 0

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2018-07
by Internet Archive Web Group
data

eye 12

favorite 0

comment 0

MAG-PDF-CRAWL-2020-07
by Internet Archive Web Group
data

eye 0

favorite 0

comment 0

Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 10

favorite 0

comment 0

This item contains datasets of homepage URLs found by hand using search engines and bibliographic metadata (eg, ISSN and journal title). The "long-tail" batch contains about 4,600 journal lookup results, with about 3,900 successful homepage URLs found. The list of journals was created in May 2020, and the lookup work completed in June 2020. IA staff member Richard Greydanus ran this batch of lookups. All of this metadata can be considered public domain, or CC-0 (Creative Commons Zero)...
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 27

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 12

favorite 0

comment 0

See README.md
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 12

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 0

favorite 0

comment 0

Bulk Bibliographic Metadata
by Internet Archive Web Group
data

eye 7

favorite 0

comment 0

This item contains hash lists of PDF files crawled from the public web specifically to preserve the scholarly record. It does not contain hashes of *all* PDFs the archive has ever seen, only a subset. Not all of these hashes are necessarily journal articles or other research outputs, but we have reason to believe the large majority are.