Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
MSAG-PDF-CRAWL-2017
collection
1,855
ITEMS
13.9M
VIEWS
Aug 4, 2017 Internet Archive Web Group
collection

eye 13.9M

Microsoft Academic Graph public corpus (Feb 2016) PDF URLs, filtered to remove large sites (pubmed, citeseerx, arxiv) and already-crawled URLs.
Topics: papers, journals
CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.3M
VIEWS
Jun 20, 2017
collection

eye 1.3M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal