Skip to main content
SHOW DETAILS
up-solid down-solid
eye
Title
Date Published
Creator
MSAG-PDF-CRAWL-2017
collection
1,855
ITEMS
12.2M
VIEWS
Aug 1, 2017 Internet Archive Web Group
collection

eye 12.2M

Microsoft Academic Graph public corpus (Feb 2016) PDF URLs, filtered to remove large sites (pubmed, citeseerx, arxiv) and already-crawled URLs.
Topics: papers, journals
CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.2M
VIEWS
Jun 1, 2017
collection

eye 1.2M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal