Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Bulk Bibliographic Metadata
Dec 14, 2017 Allen Institute for Artificial Intelligence
data

eye 23

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus". These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-*-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Wide Web Targeted PDF Crawling (2017)
Wide Web Targeted PDF Crawling (2017)
collection
922
ITEMS
3.4M
VIEWS
Sep 21, 2017 Internet Archive Web Group
collection

eye 3.4M

MSAG-PDF-CRAWL-2017
collection
1,855
ITEMS
13.9M
VIEWS
Aug 4, 2017 Internet Archive Web Group
collection

eye 13.9M

Microsoft Academic Graph public corpus (Feb 2016) PDF URLs, filtered to remove large sites (pubmed, citeseerx, arxiv) and already-crawled URLs.
Topics: papers, journals
Bulk Bibliographic Metadata
Jun 26, 2017 Allen Institute for Artificial Intelligence
data

eye 51

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus", as downloaded June 26th, 2017. These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-2017-02-21-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.