Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Added
Creator
OAI-PMH-CRAWL-2022-10
Nov 16, 2022
data

eye 0

favorite 0

comment 0

OAI-PMH-CRAWL-2022-10
Nov 16, 2022
data

eye 0

favorite 0

comment 0

Community Texts
Oct 28, 2022 Bryan Newbold
software

eye 0

favorite 0

comment 0

See source code at https://gitlab.com/bnewbold/adenosine Software license is AGPLv3+
OAI-PMH-CRAWL-2022-10
OAI-PMH-CRAWL-2022-10
collection
72
ITEMS
62,745
VIEWS
Oct 10, 2022
collection

eye 62,745

TARGETED-ARTICLE-CRAWL-2022-07
Aug 30, 2022
data

eye 0

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-07
Aug 30, 2022
data

eye 0

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-07
TARGETED-ARTICLE-CRAWL-2022-07
collection
43
ITEMS
217,124
VIEWS
Aug 1, 2022
collection

eye 217,124

Community Texts
Jul 25, 2022
data

eye 255

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2022-04
Jul 6, 2022
data

eye 3

favorite 0

comment 0

DATASET-CRAWL-2022-01
May 17, 2022
data

eye 6

favorite 0

comment 0

DATASET-CRAWL-2022-01
May 17, 2022
data

eye 2

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-04
May 14, 2022
data

eye 0

favorite 0

comment 0

JOURNAL-HOMEPAGE-CRAWL-2022-03
May 11, 2022
data

eye 3

favorite 0

comment 0

JOURNAL-HOMEPAGE-CRAWL-2022-03
May 11, 2022
data

eye 1

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-04
May 11, 2022
data

eye 3

favorite 0

comment 0

Community Texts
May 10, 2022
data

eye 7

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2022-04
UNPAYWALL-PDF-CRAWL-2022-04
collection
41
ITEMS
189,812
VIEWS
Apr 20, 2022
collection

eye 189,812

TARGETED-ARTICLE-CRAWL-2022-04
TARGETED-ARTICLE-CRAWL-2022-04
collection
220
ITEMS
650,705
VIEWS
Apr 20, 2022
collection

eye 650,705

DOI-CRAWL-2022-02
Apr 8, 2022
data

eye 0

favorite 0

comment 0

DOI-CRAWL-2022-02
Apr 8, 2022
data

eye 0

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-03
Mar 18, 2022
data

eye 0

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-03
Mar 18, 2022
data

eye 1

favorite 0

comment 0

TARGETED-ARTICLE-CRAWL-2022-03
TARGETED-ARTICLE-CRAWL-2022-03
collection
9
ITEMS
83,499
VIEWS
Mar 8, 2022
collection

eye 83,499

JOURNAL-HOMEPAGE-CRAWL-2022-03
JOURNAL-HOMEPAGE-CRAWL-2022-03
collection
47
ITEMS
443,681
VIEWS
Mar 8, 2022
collection

eye 443,681

Bulk Bibliographic Metadata
data

eye 11

favorite 1

comment 0

Bulk Bibliographic Metadata
data

eye 7

favorite 0

comment 0

DOI-CRAWL-2022-02
DOI-CRAWL-2022-02
collection
25
ITEMS
403,743
VIEWS
Feb 23, 2022
collection

eye 403,743

JOURNALS-PATCH-CRAWL-2022-01
JOURNALS-PATCH-CRAWL-2022-01
collection
104
ITEMS
1.4M
VIEWS
Jan 13, 2022
collection

eye 1.4M

OAI-PMH-PATCH-CRAWL-2021-12
Jan 11, 2022
data

eye 0

favorite 0

comment 0

OAI-PMH-PATCH-CRAWL-2021-12
Jan 11, 2022
data

eye 1

favorite 0

comment 0

DATASET-CRAWL-2022-01
DATASET-CRAWL-2022-01
collection
5
ITEMS
6,069
VIEWS
Jan 5, 2022
collection

eye 6,069

Community Texts
Dec 29, 2021
data

eye 0

favorite 0

comment 0

OAI-PMH-PATCH-CRAWL-2021-12
OAI-PMH-PATCH-CRAWL-2021-12
collection
75
ITEMS
535,208
VIEWS
Dec 2, 2021
collection

eye 535,208

Community Texts
Nov 23, 2021
data

eye 9

favorite 0

comment 0

Community Texts
Oct 6, 2021
data

eye 2

favorite 0

comment 0

dataverse.harvard.edu Dataset doi:10.7910/DVN/CLSFKX
MAG-PDF-CRAWL-2021-08
MAG-PDF-CRAWL-2021-08
collection
189
ITEMS
1.2M
VIEWS
Aug 11, 2021
collection

eye 1.2M

UNPAYWALL-PDF-CRAWL-2021-07
UNPAYWALL-PDF-CRAWL-2021-07
collection
174
ITEMS
1.3M
VIEWS
Jul 14, 2021
collection

eye 1.3M

Scholarly TDM Corpora
Scholarly TDM Corpora
collection
44
ITEMS
31
VIEWS
Jul 14, 2021 Internet Archive Web Group
collection

eye 31

Access-restricted text and data-mining corpora. If you are interested in getting access to work with this content, contact info@archive.org
Bulk Bibliographic Metadata
Jul 7, 2021 Impactstory
data

eye 27

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Fatcat Database Snapshots and Bulk Metadata Exports
Jun 7, 2021 Internet Archive Web Group
data

eye 50

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Jun 6, 2021 Internet Archive Web Group
data

eye 80

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021 DOAJ
data

eye 9

favorite 0

comment 0

Bulk Bibliographic Metadata
May 29, 2021 dblp
data

eye 23

favorite 0

comment 0

Bulk Bibliographic Metadata
May 27, 2021 Crossref
data

eye 2,900

favorite 1

comment 0

Mirrored via torrent from academic torrents: https://academictorrents.com/details/0c6c3fbfdc13f0169b561d29354ea8b188eb9d63
Bulk Bibliographic Metadata
May 27, 2021 Microsoft Academic
data

eye 1,021

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
May 24, 2021 CORE.ac.uk
data

eye 76

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset CORE Dataset to Microsoft Academic Graph (MAG) mapping (80MB compressed, 173 MB in total) - 8.9M items License: Open Data Commons Attribution (ODC-By) license.
Bulk Bibliographic Metadata
May 24, 2021 CORE.ac.uk
data

eye 20

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset Dataset created for Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings (LREC 2020) (62 MB compressed, 204 MB in total) License: Open Data Commons Attribution (ODC-By) license.
UNPAYWALL-PDF-CRAWL-2021-05
May 4, 2021
data

eye 10

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2021-05
UNPAYWALL-PDF-CRAWL-2021-05
collection
123
ITEMS
1.2M
VIEWS
Apr 27, 2021 Internet Archive Web Group
collection

eye 1.2M

Bulk Bibliographic Metadata
Apr 22, 2021 Japan Link Center
data

eye 28

favorite 0

comment 0

Downloaded from http://japanlinkcenter.org/top/material/material_metadata.html
Community Data
Apr 7, 2021
data

eye 4

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
Mar 11, 2021 Internet Archive Web Group
data

eye 56

favorite 1

comment 0

Bulk Bibliographic Metadata
Mar 2, 2021 Harshdeep Singh, Robert West, & Giovanni Colavizza
data

eye 12

favorite 0

comment 0

Mirrored from: https://zenodo.org/record/3940692 Harshdeep Singh, Robert West, & Giovanni Colavizza. (2020). Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia (Version 0.2) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3940692
Bulk Bibliographic Metadata
Feb 23, 2021 Impactstory
data

eye 52

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Fatcat Database Snapshots and Bulk Metadata Exports
Feb 11, 2021 Bryan Newbold
data

eye 29

favorite 0

comment 0

This item contains compiled binaries and packages (for apt and homebrew) for the fatcat-cli utility. Source code available at: https://gitlab.com/bnewbold/fatcat-cli
Community Data
Jan 22, 2021
data

eye 5

favorite 0

comment 0

Community Data
Jan 13, 2021
data

eye 6

favorite 0

comment 0

Community Data
Dec 30, 2020 Internet Archive
data

eye 188

favorite 0

comment 0

Debian package files for extra software used in our cluster. Files from this item may be downloaded and installed automatically by ansible scripts (etc). Notes and packaging templates at:  https://git.archive.org/bnewbold/ia-deb-pkgs IA collections folks: feel free to move this from 'opensource' to somewhere more appropriate.
OA-DOI-CRAWL-2020-12
Dec 30, 2020
data

eye 0

favorite 0

comment 0

Community Data
Dec 23, 2020
data

eye 3

favorite 0

comment 0

OA-DOI-CRAWL-2020-12
OA-DOI-CRAWL-2020-12
collection
191
ITEMS
1.9M
VIEWS
Dec 9, 2020 Internet Archive Web Group
collection

eye 1.9M

DOAJ-CRAWL-2020-11
Dec 2, 2020
data

eye 3

favorite 0

comment 0

Community Texts
Dec 1, 2020
data

eye 9

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 8

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 9

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 4

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 8

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 7

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 5

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0

Community Data
Nov 25, 2020
data

eye 6

favorite 0

comment 0