1000 Genomes gVCF mapped to hs37d5 for HG02095. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02095/7927067/1
1000 Genomes gVCF mapped to hs37d5 for NA19359. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19359/7928666/1
1000 Genomes gVCF mapped to hs37d5 for NA19795. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19795/7932278/1
1000 Genomes gVCF mapped to hs37d5 for NA21144. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA21144/7951910/1
1000 Genomes gVCF mapped to hs37d5 for HG01256. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01256/7885169/1
1000 Genomes gVCF mapped to hs37d5 for HG01670. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01670/7897691/1
1000 Genomes gVCF mapped to hs37d5 for HG01747. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01747/7898108/1
1000 Genomes gVCF mapped to hs37d5 for HG01797. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01797/7901240/1
1000 Genomes gVCF mapped to hs37d5 for NA18951. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18951/7892684/1
1000 Genomes gVCF mapped to hs37d5 for HG03095. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03095/7890959/1
1000 Genomes gVCF mapped to hs37d5 for HG01682. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01682/7897922/1
1000 Genomes gVCF mapped to hs37d5 for NA19463. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19463/7929641/1
1000 Genomes gVCF mapped to hs37d5 for HG02142. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02142/7927550/1
1000 Genomes gVCF mapped to hs37d5 for NA19429. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19429/7929002/1
1000 Genomes gVCF mapped to hs37d5 for HG03977. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03977/7929782/1
1000 Genomes gVCF mapped to hs37d5 for HG03926. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03926/7929389/1
1000 Genomes gVCF mapped to hs37d5 for NA20344. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20344/7936628/1
1000 Genomes gVCF mapped to hs37d5 for HG01347. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01347/7890095/1
1000 Genomes gVCF mapped to hs37d5 for HG03464. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03464/7901504/1
1000 Genomes gVCF mapped to hs37d5 for NA20282. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20282/7934999/1
1000 Genomes gVCF mapped to hs37d5 for HG03279. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03279/7897946/1
1000 Genomes gVCF mapped to hs37d5 for HG03446. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03446/7901288/1
1000 Genomes gVCF mapped to hs37d5 for HG01702. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01702/7898024/1
1000 Genomes gVCF mapped to hs37d5 for HG01948. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01948/7910672/1
1000 Genomes gVCF mapped to hs37d5 for HG03598. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03598/7908563/1
1000 Genomes gVCF mapped to hs37d5 for NA19472. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19472/7929845/1
1000 Genomes gVCF mapped to hs37d5 for HG03897. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03897/7928993/1
1000 Genomes gVCF mapped to hs37d5 for HG02070. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02070/7925396/1
1000 Genomes gVCF mapped to hs37d5 for HG02799. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02799/7940798/1
1000 Genomes gVCF mapped to hs37d5 for NA21086. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA21086/7950086/1
1000 Genomes gVCF mapped to hs37d5 for NA20289. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20289/7935014/1
1000 Genomes gVCF mapped to hs37d5 for HG02860. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02860/7884980/1
1000 Genomes gVCF mapped to hs37d5 for HG01362. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01362/7890326/1
1000 Genomes gVCF mapped to hs37d5 for NA18907. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18907/7891043/1
1000 Genomes gVCF mapped to hs37d5 for HG00530. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG00530/7879991/1
1000 Genomes gVCF mapped to hs37d5 for HG03190. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03190/7895180/1
1000 Genomes gVCF mapped to hs37d5 for HG01746. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01746/7898099/1
1000 Genomes gVCF mapped to hs37d5 for NA19737. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19737/7931006/1
1000 Genomes gVCF mapped to hs37d5 for HG02536. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02536/7931063/1
1000 Genomes gVCF mapped to hs37d5 for HG03730. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03730/7923617/1
1000 Genomes gVCF mapped to hs37d5 for NA20884. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20884/7949462/1
1000 Genomes gVCF mapped to hs37d5 for HG01171. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01171/7884881/1
1000 Genomes gVCF mapped to hs37d5 for HG02006. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02006/7914449/1
1000 Genomes gVCF mapped to hs37d5 for HG00154.
Source: https://figshare.com/articles/dataset/gVCF_HG00154/7872560/1
1000 Genomes gVCF mapped to hs37d5 for NA19663. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19663/7930124/1
1000 Genomes gVCF mapped to hs37d5 for HG03792. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03792/7927244/1
1000 Genomes gVCF mapped to hs37d5 for NA19467. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19467/7929692/1
1000 Genomes gVCF mapped to hs37d5 for HG02661. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02661/7934975/1
1000 Genomes gVCF mapped to hs37d5 for HG02793. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02793/7940702/1
1000 Genomes gVCF mapped to hs37d5 for NA11931. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA11931/7936667/1
1000 Genomes gVCF mapped to hs37d5 for HG01113. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01113/7883471/1
1000 Genomes gVCF mapped to hs37d5 for HG01204. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01204/7885034/1
1000 Genomes gVCF mapped to hs37d5 for HG00097.
Source: https://figshare.com/articles/dataset/gVCF_HG00097/7841411/1
1000 Genomes gVCF mapped to hs37d5 for NA18864. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18864/7890737/1
1000 Genomes gVCF mapped to hs37d5 for HG01624. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01624/7895774/1
1000 Genomes gVCF mapped to hs37d5 for HG03452. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03452/7901339/1
1000 Genomes gVCF mapped to hs37d5 for HG01950. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG01950/7911317/1
1000 Genomes gVCF mapped to hs37d5 for HG03558. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03558/7907651/1
1000 Genomes gVCF mapped to hs37d5 for NA19185. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19185/7927103/1
1000 Genomes gVCF mapped to hs37d5 for NA19475. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA19475/7929941/1
1000 Genomes gVCF mapped to hs37d5 for HG03812. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03812/7928066/1
1000 Genomes gVCF mapped to hs37d5 for HG02286. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02286/7928966/1
1000 Genomes gVCF mapped to hs37d5 for HG03874. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG03874/7928795/1
1000 Genomes gVCF mapped to hs37d5 for HG02561. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02561/7931462/1
1000 Genomes gVCF mapped to hs37d5 for HG02554. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_HG02554/7931255/1
1000 Genomes gVCF mapped to hs37d5 for NA18504. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA18504/7944293/1
1000 Genomes gVCF mapped to hs37d5 for NA20786. Complete collection: https://doi.org/10.6084/m9.figshare.c.4414307
Source: https://figshare.com/articles/dataset/gVCF_NA20786/7944296/1
This item contains output files related to the DOI-LANDING-CRAWL-2018-06 crawl of Crossref DOI redirect landing pages: - list of Crossref DOI numbers attempted - an index of DOI, URL, and final HTTP status codes
This is a derivative of https://archive.org/download/ia_papers_manifest_2018-01-25, which contains JSON objects that can be inserted into a fatcat catalog.
This dump includes all tables (including oauth authentication tables which could be a privacy, but not security, concern). At this time only IA staff have accounts, so the snapshot, which is intended mostly for disaster recovery, is still public.
15
15
Nov 10, 2020
11/20
by
Internet Archive Web Group
data
eye 15
favorite 0
comment 0
This item contains three .zip archives, each containing a sample corpus of about 10,000 (or more) HTML documents from the IA web archive. For each, there is some form of metadata (CDX or JSON) with information about the original URL and timestamp for each document, and then directories containing HTML, extracted TEI-XML, and extracted TXT for each document. There are some fraction of documents which failed to download or failed to extract, so there are fewer .html (and derivative) files than...
4
4.0
Apr 11, 2019
04/19
by
Internet Archive Web Group
data
eye 4
favorite 0
comment 0
28
28
Jun 12, 2019
06/19
by
Internet Archive Web Group
data
eye 28
favorite 0
comment 0
"Full" crawl logs (for every hit) from CORE-UPSTREAM-CRAWL-2018-11 crawl. See also 'CORE-UPSTREAM-CRAWL-2018-11-CRL' item for reports etc.
21
21
Jun 20, 2019
06/19
by
Internet Archive Web Group
data
eye 21
favorite 0
comment 0
Snapshot of Internet Archive (petabox) file-level metadata (eg, PDF hashes) for files under the 'journals' collection as of December 2018. Note: includes a small number of items not actually under the 'journals' collection hierarchy due to how the input item list was generated, and a small fraction (estimate 500?) of items didn't dump successfully. A bit sloppy!
About 1 million unique PDFs from Global Wayback before year 2000.
See also the crawl logs item for this crawl.
0
0.0
Jul 20, 2020
07/20
by
Internet Archive Web Group
data
eye 0
favorite 0
comment 0
2
2.0
Sep 29, 2020
09/20
by
Internet Archive Web Group
data
eye 2
favorite 0
comment 0
5
5.0
Apr 11, 2019
04/19
by
Internet Archive Web Group
data
eye 5
favorite 0
comment 0
0
0.0
Jul 20, 2020
07/20
by
Internet Archive Web Group
data
eye 0
favorite 0
comment 0
This item contains datasets of homepage URLs found by hand using search engines and bibliographic metadata (eg, ISSN and journal title). The "long-tail" batch contains about 4,600 journal lookup results, with about 3,900 successful homepage URLs found. The list of journals was created in May 2020, and the lookup work completed in June 2020. IA staff member Richard Greydanus ran this batch of lookups. All of this metadata can be considered public domain, or CC-0 (Creative Commons Zero)...
This item contains hash lists of PDF files crawled from the public web specifically to preserve the scholarly record. It does not contain hashes of *all* PDFs the archive has ever seen, only a subset. Not all of these hashes are necessarily journal articles or other research outputs, but we have reason to believe the large majority are.