Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Fatcat Database Snapshots and Bulk Metadata Exports
Nov 26, 2022 Internet Archive Web Group
data

eye 0

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Nov 18, 2022 Internet Archive Web Group
data

eye 0

favorite 0

comment 0

Bulk Bibliographic Metadata
Oct 24, 2022
data

eye 3

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Aug 1, 2022 Internet Archive Web Group
data

eye 33

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 30, 2022
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 19, 2022
data

eye 8

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Jul 15, 2022 Internet Archive Web Group
data

eye 12

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 13, 2022 dblp
data

eye 5

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 13, 2022 dblp
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 13, 2022 Japan Link Center
data

eye 14

favorite 0

comment 0

Downloaded from http://japanlinkcenter.org/top/material/material_metadata.html
Bulk Bibliographic Metadata
Jul 13, 2022 ORCID, Inc.
data

eye 15

favorite 0

comment 0

This item contains an annual copy of the ORCID public data file, as originally downloaded from: https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2021/16750535 See also: https://info.orcid.org/orcids-2021-public-data-file-is-now-available More details about this content and it's use available at: https://orcid.org/content/orcid-public-data-file This dataset is available under the public domain (CC-0).
Bulk Bibliographic Metadata
Jul 6, 2022
data

eye 2

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 6, 2022
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Apr 28, 2022 Internet Archive Web Group
data

eye 19

favorite 1

comment 0

URL lists to PDFs on the web (and preserved in the wayback machine) which are likely to contain research materials.
Fatcat Database Snapshots and Bulk Metadata Exports
Apr 27, 2022 Internet Archive Web Group
data

eye 5

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Apr 22, 2022 Internet Archive Web Group
data

eye 40

favorite 1

comment 0

Bulk Bibliographic Metadata
Apr 18, 2022 creator
data

eye 33

favorite 0

comment 0

Bulk Bibliographic Metadata
Mar 9, 2022
data

eye 14

favorite 0

comment 0

Bulk Bibliographic Metadata
Mar 8, 2022
data

eye 21

favorite 1

comment 0

Bulk Bibliographic Metadata
data

eye 11

favorite 1

comment 0

Bulk Bibliographic Metadata
data

eye 7

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Feb 23, 2022 Internet Archive Web Group
data

eye 28

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Feb 22, 2022 Internet Archive Web Group
data

eye 74

favorite 0

comment 0

Bulk Bibliographic Metadata
Feb 8, 2022 Internet Archive
data

eye 8

favorite 0

comment 0

This item contains KBART files of Internet Archive "serials" (aka, journals, magazines, conference proceedings, other periodicals) preservation holdings. They include both digitized content in archive.org, and web archived content ("fatcat").
Bulk Bibliographic Metadata
Jan 6, 2022
data

eye 167

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 20

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Dec 4, 2021 Internet Archive Web Group
data

eye 46

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 1, 2021
data

eye 10

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 1, 2021
data

eye 8

favorite 0

comment 0

Bulk Bibliographic Metadata
Nov 24, 2021 OurResearch
data

eye 147

favorite 0

comment 1

This is an archive of the "beta" pre-release of the OpenAlex bibliographic metadata corpus. It was downloaded from AWS S3 "requester pays" bucket, then the individual files were compressed with gzip (pigz command), which reduced on-disk size significantly. Downloads of some files needed to be restarted, which seems to have worked ok, but potentially could have introduced corruption. This initial snapshot is dated in file names as "2021-10-11", and that date is used...
( 1 reviews )
Fatcat Database Snapshots and Bulk Metadata Exports
Nov 9, 2021
data

eye 27

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Oct 11, 2021
data

eye 31

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Oct 11, 2021
data

eye 40

favorite 0

comment 0

Bulk Bibliographic Metadata
Sep 9, 2021 Internet Archive Web Group
data

eye 12

favorite 1

comment 0

URL lists to PDFs on the web (and preserved in the wayback machine) which are likely to contain research materials.
Bulk Bibliographic Metadata
Aug 20, 2021 Wikipedia Editors
data

eye 29

favorite 0

comment 0

This is a corpus of millions of citations from Wikipedia articles, for a subset of language wikis, created using the wikiciteparser Python library. 
Fatcat Database Snapshots and Bulk Metadata Exports
Aug 7, 2021 Internet Archive Web Group
data

eye 201

favorite 3

comment 0

See: https://guide.fatcat.wiki/reference_graph.html License: CC-0
Bulk Bibliographic Metadata
Jul 27, 2021 Microsoft Academic
data

eye 1,014

favorite 2

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
Jul 7, 2021 Impactstory
data

eye 27

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Fatcat Database Snapshots and Bulk Metadata Exports
Jun 7, 2021 Internet Archive Web Group
data

eye 50

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Jun 6, 2021 Internet Archive Web Group
data

eye 80

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021
data

eye 10

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021
data

eye 21

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021 DOAJ
data

eye 9

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021 dblp
data

eye 23

favorite 0

comment 0

Bulk Bibliographic Metadata
May 27, 2021 Crossref
data

eye 2,899

favorite 1

comment 0

Mirrored via torrent from academic torrents: https://academictorrents.com/details/0c6c3fbfdc13f0169b561d29354ea8b188eb9d63
Bulk Bibliographic Metadata
May 27, 2021 Microsoft Academic
data

eye 1,021

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
May 24, 2021 CORE.ac.uk
data

eye 76

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset CORE Dataset to Microsoft Academic Graph (MAG) mapping (80MB compressed, 173 MB in total) - 8.9M items License: Open Data Commons Attribution (ODC-By) license.
Bulk Bibliographic Metadata
May 24, 2021 CORE.ac.uk
data

eye 20

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset Dataset created for Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings (LREC 2020) (62 MB compressed, 204 MB in total) License: Open Data Commons Attribution (ODC-By) license.
Bulk Bibliographic Metadata
Apr 23, 2021
data

eye 19

favorite 0

comment 0

Bulk Bibliographic Metadata
Apr 23, 2021
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Apr 22, 2021 Japan Link Center
data

eye 28

favorite 0

comment 0

Downloaded from http://japanlinkcenter.org/top/material/material_metadata.html
Fatcat Database Snapshots and Bulk Metadata Exports
Mar 11, 2021 Internet Archive Web Group
data

eye 56

favorite 1

comment 0

Bulk Bibliographic Metadata
Mar 2, 2021 Harshdeep Singh, Robert West, & Giovanni Colavizza
data

eye 12

favorite 0

comment 0

Mirrored from: https://zenodo.org/record/3940692 Harshdeep Singh, Robert West, & Giovanni Colavizza. (2020). Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia (Version 0.2) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3940692
Bulk Bibliographic Metadata
Feb 23, 2021 Impactstory
data

eye 52

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Fatcat Database Snapshots and Bulk Metadata Exports
Feb 11, 2021 Bryan Newbold
data

eye 29

favorite 0

comment 0

This item contains compiled binaries and packages (for apt and homebrew) for the fatcat-cli utility. Source code available at: https://gitlab.com/bnewbold/fatcat-cli
Fatcat Database Snapshots and Bulk Metadata Exports
Dec 30, 2020 Internet Archive Web Group
data

eye 29

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Dec 30, 2020 Internet Archive Web Group
data

eye 17

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 18, 2020 dblp
data

eye 22

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Dec 8, 2020 Internet Archive Web Group
data

eye 17

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 7, 2020
data

eye 11

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 7, 2020
data

eye 40

favorite 0

comment 0

Bulk Bibliographic Metadata
Dec 1, 2020 ORCID, Inc.
data

eye 168

favorite 0

comment 0

This item contains an annual copy of the ORCID public data file, as originally downloaded from:  https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2020/13066970 More details about this content and it's use available at: https://orcid.org/content/orcid-public-data-file This dataset is available under the public domain (CC-0).
Bulk Bibliographic Metadata
Nov 17, 2020 DOAJ
data

eye 41

favorite 1

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Oct 11, 2020 Internet Archive Web Group
data

eye 12

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Oct 10, 2020 Internet Archive Web Group
data

eye 28

favorite 0

comment 0

Bulk Bibliographic Metadata
Oct 9, 2020 Impactstory
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Oct 9, 2020
data

eye 7

favorite 0

comment 0

Bulk Bibliographic Metadata
Oct 8, 2020 Cariniana
data

eye 21

favorite 0

comment 0

Downloaded from, eg:  https://cariniana.ibict.br/index.php/preservacao-de-publicacoes-digitais/periodicos-eletronicos
Bulk Bibliographic Metadata
Oct 6, 2020 JURN
data

eye 13

favorite 0

comment 0

JURN is a scholarly web search engine implemented as a custom Google search index. A subset of resources are included in a directory at:  http://www.jurn.org/directory/ This item contains snapshots of the directory in the form of TSV files. At least to start these are only title + URL, but we hope to reconcile or lookup to ISSN number.
Fatcat Database Snapshots and Bulk Metadata Exports
Sep 29, 2020 Internet Archive Web Group
data

eye 58

favorite 0

comment 0

This item contains an example corpus of citations between scholarly documents, as extracted from the fatcat (https://fatcat.wiki) corpus as of the 2020-08-05 bulk release export. This corpus itself was generated from a fatcat-scholar "intermediate" fulltext dump which is not public, using software in the fatcat-scholar repository in mid-September 2020. See also the README for some more notes, and the "sample" file.
Bulk Bibliographic Metadata
Sep 4, 2020 Allen Institute for Artificial Intelligence
data

eye 185

favorite 0

comment 0

Semantic Scholar Open Research Corpus is licensed under  ODC-BY . When using the Semantic Scholar Open Research Corpus (“S2 ORC”) in a product or service, or including data in a redistribution, please cite the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL https://www.semanticscholar.org/paper/09e3cf5704bcb16e6657f6ceed70e93373a54618 This site is provided by The Allen Institute for Artificial Intelligence (“AI2”) as a service...
Bulk Bibliographic Metadata
Sep 3, 2020
data

eye 17

favorite 0

comment 0

Bulk Bibliographic Metadata
Sep 2, 2020
data

eye 9

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 8

favorite 0

comment 0

Bulk Bibliographic Metadata
Aug 8, 2020 dblp
data

eye 27

favorite 0

comment 0

Bulk Bibliographic Metadata
Aug 8, 2020 DOAJ
data

eye 11

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Aug 7, 2020 Internet Archive Web Group
data

eye 34

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Aug 7, 2020 Internet Archive Web Group
data

eye 26

favorite 0

comment 0

Bulk Bibliographic Metadata
Aug 4, 2020
data

eye 30

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 31, 2020
data

eye 8

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Jul 23, 2020 Internet Archive Web Group
data

eye 10

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 9, 2020
data

eye 13

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 9, 2020
data

eye 11

favorite 0

comment 0

Bulk Bibliographic Metadata
Jul 6, 2020 Microsoft Academic
data

eye 64

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
Jun 24, 2020
data

eye 22

favorite 0

comment 0

Bulk Bibliographic Metadata
Jun 24, 2020
data

eye 6

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 6

favorite 0

comment 0

Mirrored from:  https://www.arc.gov.au/excellence-research-australia/era-2018-journal-list
Bulk Bibliographic Metadata
Jun 23, 2020
data

eye 6

favorite 0

comment 0

Mirrored from:  https://isaw.nyu.edu/publications/awol-index/ Note creator request: The content of the  The AWOL Index  is derived from: Charles E. Jones,  AWOL - The Ancient World Online  (ISSN 2156-2253), 2009-. That content is re-used and re-mixed here under the terms of  AWOL's  Creative Commons Attribution Share-Alike 3.0 Unported license. The production and publication of  The AWOL Index  contributes significant additional value both to the content itself and to its presentation...
Bulk Bibliographic Metadata
Jun 23, 2020
data

eye 8

favorite 0

comment 0

Mirrored from:  https://github.com/njahn82/vanished_journals/tree/master/data
Bulk Bibliographic Metadata
Jun 23, 2020 Internet Archive Web Group
data

eye 11

favorite 0

comment 0

This item contains datasets of homepage URLs found by hand using search engines and bibliographic metadata (eg, ISSN and journal title). The "long-tail" batch contains about 4,600 journal lookup results, with about 3,900 successful homepage URLs found. The list of journals was created in May 2020, and the lookup work completed in June 2020. IA staff member Richard Greydanus ran this batch of lookups. All of this metadata can be considered public domain, or CC-0 (Creative Commons Zero)...
Bulk Bibliographic Metadata
Jun 23, 2020 SciELO
data

eye 11

favorite 0

comment 0

Bulk Bibliographic Metadata
Jun 12, 2020
data

eye 27

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
May 27, 2020 Internet Archive Web Group
data

eye 19

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
May 27, 2020 Internet Archive Web Group
data

eye 8

favorite 0

comment 0

Bulk Bibliographic Metadata
May 5, 2020 SciELO
data

eye 11

favorite 0

comment 0

Bulk Bibliographic Metadata
May 4, 2020 Impactstory
data

eye 117

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Bulk Bibliographic Metadata
Apr 9, 2020 Crossref
data

eye 91

favorite 0

comment 0

Mirrored via torrent from academic torrents: https://academictorrents.com/details/0c6c3fbfdc13f0169b561d29354ea8b188eb9d63 https://www.crossref.org/blog/free-public-data-file-of-112-million-crossref-records/
Bulk Bibliographic Metadata
Apr 8, 2020 Internet Archive Web Group
data

eye 9

favorite 0

comment 0

About 1 million unique PDFs from Global Wayback before year 2000.
Bulk Bibliographic Metadata
Apr 7, 2020 Internet Archive Web Group
data

eye 33

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Fatcat Database Snapshots and Bulk Metadata Exports
Apr 6, 2020 Internet Archive Web Group
data

eye 33

favorite 0

comment 0