Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
Bulk Bibliographic Metadata
data

eye 28

favorite 1

comment 0

Data mirrored from https://europepmc.org/downloads Contains a mapping between PubMed IDs (PMID), PubMedCentral IDs (PMCID), and DOI numbers, for over 29 million works.
Bulk Bibliographic Metadata
- Allen Institute for Artificial Intelligence
data

eye 33

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions, and the request: We request that any published research that makes use of this data cites the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL. ...
Bulk Bibliographic Metadata
data

eye 37

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Bulk Bibliographic Metadata
- Allen Institute for Artificial Intelligence
data

eye 24

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus". These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-*-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Bulk Bibliographic Metadata
data

eye 81

favorite 0

comment 0

Snapshot as of 2019-04-15, contains SQL dumps for multiple databases: Complete Library Genesis Comic book database Fiction database 'Compact' Library Genesis database Scientific magazines SQL dumps generated by MySQL/MariaDB database. *** THIS ITEM DOES NOT CONTAIN ANY BOOKS *** Upstream does not provide checksums and all checksums should be taken with some doubt. Databases were archived by the upstream with RAR archiver, file names has been changed to include creation date.
Bulk Bibliographic Metadata
- Japan Link Center
data

eye 29

favorite 0

comment 0

Downloaded from http://japanlinkcenter.org/top/material/material_metadata.html
Bulk Bibliographic Metadata
- ISSN
data

eye 416

favorite 1

comment 0

Unlike most ISSN metadata, this mapping file is publicly available.
Bulk Bibliographic Metadata
data

eye 15

favorite 0

comment 0

This is the 2020 "baseline" PubMed/MEDLINE bibliographic metadata corpus, originally published in December 2019. Downloaded from https://www.nlm.nih.gov/databases/download/pubmed_medline.html
Bulk Bibliographic Metadata
- Internet Archive Web Group
data

eye 29

favorite 0

comment 0

This dump includes all tables (including oauth authentication tables which could be a privacy, but not security, concern). At this time only IA staff have accounts, so the snapshot, which is intended mostly for disaster recovery, is still public.
Bulk Bibliographic Metadata
data

eye 17

favorite 0

comment 0

See README for details. Scraped from: http://ezb.uni-regensburg.de/ezeit/services/collections.phtml?bibid=AAAAA&colors=1〈=en http://ezb.uni-regensburg.de/ezeit/services/xmloutput.phtml?bibid=AAAAA&colors=1〈=de#6.2
Bulk Bibliographic Metadata
data

eye 24

favorite 0

comment 0

OAI-PMH metadata collected from the arxiv.org endpoint, using the arXivRaw schema. Collected in two batches: up through ~2017, then up through May 22nd, 2019.
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 17

favorite 0

comment 0

Bulk Bibliographic Metadata
- Bruns A, Lenke C, Schmidt C, Taubert NC
data

eye 20

favorite 0

comment 0

ISSN-GOLD-OA provides a matching list of ISSN for Gold Open Access (OA) journals. The intention was to compile a matching table that is as complete as possible by using different publicly available sources. The data set offers a basis for various journal-related issues in bibliometric studies on Gold OA. The list is an updated version of ISSN-GOLD-OA . For a detailed description of the method, data sources used and the definition of the table fields, please refer to the original...
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 19

favorite 0

comment 0

This item contains bulk metadata exported from https://fatcat.wiki. With the exception of the 'abstracts' file (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to upstream sources (including Crossref, ORCID, DOAJ, the ISSN...
Bulk Bibliographic Metadata
data

eye 36

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Bulk Bibliographic Metadata
data

eye 10

favorite 0

comment 0

This item contains snapshots of the PubMed Central OA subset file manifests, linked from https://www.ncbi.nlm.nih.gov/pmc/tools/openftlist
Bulk Bibliographic Metadata
data

eye 245

favorite 0

comment 0

A copy of the "Open Academic Graph" corpus published by aminer.org and Microsoft Academic Graph in Summer 2017. Contains almost 120 GB (compressed) of bibliographic metadata for hundreds of millions of publications. Related publications include: Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. ArnetMiner: Extraction and Mining of Academic Social Networks. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining...
Bulk Bibliographic Metadata
- Internet Archive Web Group
data

eye 24

favorite 0

comment 0

This item contains some bulk research affiliation datasets from Internet Archive cataloging efforts. These are mostly strings included in research papers that indicate the institutional affiliations of specific authors (eg, with a home department, university, or company) at the time of publication. These might be useful datasets for efforts to build complete indices of research organizations, or to test normalization code that maps raw strings to organization identifiers. Attribution and links...
Bulk Bibliographic Metadata
- DIrectory of Open Access Journals
data

eye 50

favorite 0

comment 0

From: https://doaj.org/public-data-dump
Bulk Bibliographic Metadata
data

eye 20

favorite 0

comment 0

This is the 2019 "baseline" PubMed/MEDLINE bibliographic metadata corpus, originally published in December 2018. Downloaded from https://www.nlm.nih.gov/databases/download/pubmed_medline.html
Bulk Bibliographic Metadata
- Allen Institute for Artificial Intelligence
data

eye 15

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions, and the request: We request that any published research that makes use of this data cites the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL. ...
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 24

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 12

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 67

favorite 0

comment 0

This item contains snapshots of the Datacite OAI-PHM metadata feed, as captured with the tool 'metha'.
Bulk Bibliographic Metadata
- Sci-Hub
data

eye 264

favorite 0

comment 0

On 2017-03-19, The Twitter user @Sci_Hub posted a list of 62,835,101 DOIs contained in Sci-Hub: https://twitter.com/Sci_Hub/status/843546352219017218 This item contains a copy of the list. This item contains no PDFs, papers, fulltext, or other copyrighted content. Important note: not all DOIs in this list are valid (aka, do not resolve via doi.org).
Bulk Bibliographic Metadata
data

eye 76

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset CORE Dataset to Microsoft Academic Graph (MAG) mapping (80MB compressed, 173 MB in total) - 8.9M items License: Open Data Commons Attribution (ODC-By) license.
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 17

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 18

favorite 0

comment 0

This item contains mappings between CORE (https://core.ac.uk/) internal identifiers (simple integer numbers) and DOIs. This listing (a simple two-column TSV file) is derived from their publicly available metadata corpus.
Bulk Bibliographic Metadata
- aiminer.org
data

eye 696

favorite 0

comment 0

A copy of the "Open Academic Graph v2" (OAGv2) corpus published by aminer.org and Microsoft Academic Graph in early 2019. Contains roughly 90 GB (compressed) of bibliographic metadata for hundreds of millions of publications. Related publications include: Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. ArnetMiner: Extraction and Mining of Academic Social Networks. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data...
Bulk Bibliographic Metadata
- Internet Archive Web Group
data

eye 39

favorite 0

comment 0

This item contains a complete PostgreSQL SQL database snapshot from https://fatcat.wiki, in binary 'pg_dump tar mode' format. With the exception of the 'abstracts' table (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to...
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 28

favorite 1

comment 0

This item contains a complete PostgreSQL SQL database snapshot from https://fatcat.wiki, in binary 'pg_dump tar mode' format. With the exception of the 'abstracts' table (for which no aggregate license or copyright claims can be made; downstream users are responsible for their use), all metadata here is licensed CC-0 (public domain release) and may be used for any purpose. Downstream users are strongly encouraged to provide attribution and link here to the snapshot, as well as give credit to...
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 29

favorite 0

comment 0

Bulk Bibliographic Metadata
- DIrectory of Open Access Journals
data

eye 50

favorite 0

comment 0

Downloaded from https://doaj.org/csv and the OAI-PMH interface. File names encode the date when data was downloaded.
Downloaded from https://core.ac.uk/services "The data aggregated from repositories by the CORE system can be accessed in two ways, through the CORE API or by downloading the data to your computer. The former option is practical if you want to build a service on top of CORE while the latter is something we recommend to those who would like to analyse the CORE dataset and/or apply some computationally intensive batch processes. If you use CORE in your work, we kindly request you to cite one...
Bulk Bibliographic Metadata
data

eye 170

favorite 0

comment 0

This item contains an annual copy of the ORCID public data file, as originally downloaded from: https://orcid.org/content/download-file More details about this content and it's use available at: https://orcid.org/content/orcid-public-data-file This dataset is available under the public domain (CC-0). The DOI of this dataset is: https://doi.org/10.6084/m9.figshare.5479792
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 11

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 35

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 9

favorite 0

comment 0

Downloaded from: ftp://ftp.ncbi.nlm.nih.gov/pubmed/J_Entrez.txt
Bulk Bibliographic Metadata
- Internet Archive Web Group
data

eye 5

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 21

favorite 0

comment 0

Downloaded from, eg:  https://cariniana.ibict.br/index.php/preservacao-de-publicacoes-digitais/periodicos-eletronicos
Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 10

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 22

favorite 1

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
- Internet Archive Web Group
data

eye 20

favorite 0

comment 0

See README.md
Bulk Bibliographic Metadata
data

eye 55

favorite 0

comment 0

This item contains a set of "Keeper's Reports" summarizing journal content preservation coverage from major archival services and networks (Portico, LOCKSS, CLOCKSS).
Bulk Bibliographic Metadata
data

eye 343

favorite 1

comment 0

This file is a snapshot dump of the Crossref DOI metadata API, containing entries for over 107 million DOIs. This was generated by running the scripts at: https://github.com/greenelab/crossref (git commit: 768a49ba1d8ba1971f00471950514716a9f699c8) The script started on 2019-09-09 and completed on 2019-10-06. Format is xz-compressed JSON (one JSON object per line).