Skip to main content
SHOW DETAILS
eye
Title
Date Archived
Creator
UNPAYWALL-PDF-CRAWL-2018-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

See also the crawl logs item for this crawl.
Community Video
by Benjamin "Mako" Hill
movies

eye 55

favorite 1

comment 0

See also: https://mako.cc/copyrighteous/libreplanet-2018-keynote
Bulk Bibliographic Metadata
data

eye 17

favorite 0

comment 0

This item contains a set of "Keeper's Reports" summarizing journal content preservation coverage from major archival services and networks (Portico, LOCKSS, CLOCKSS). See README for links to where these files were downloaded from.
Topics: Keeper's Reports, Metadata, Preservation
CiteSeerX URL Crawl 2017
web

eye 4,576

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 09:23:13 PDT 2017 to Wed Jul 5 02:37:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,990

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 22:33:37 PDT 2017 to Wed Jul 5 16:22:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,925

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 23:14:09 PDT 2017 to Wed Jul 5 17:01:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,005

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 16:02:30 PDT 2017 to Wed Jul 5 09:16:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,533

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:05:46 PDT 2017 to Wed Jul 5 12:18:30 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,830

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 14:59:25 PDT 2017 to Wed Jul 5 08:13:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,381

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:29:50 PDT 2017 to Wed Jul 5 08:40:44 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,714

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 17:01:13 PDT 2017 to Wed Jul 5 10:15:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,961

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 16:13:58 PDT 2017 to Wed Jul 5 09:29:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,043

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:19:45 PDT 2017 to Wed Jul 5 08:32:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,112

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 01:56:59 PDT 2017 to Wed Jul 5 19:14:04 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,779

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:15:53 PDT 2017 to Wed Jul 5 23:27:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,898

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:37:05 PDT 2017 to Wed Jul 5 22:50:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,159

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:09:06 PDT 2017 to Thu Jul 6 02:22:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,853

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:28:50 PDT 2017 to Wed Jul 5 20:41:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,015

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:04:47 PDT 2017 to Wed Jul 5 23:17:34 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,737

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:26:07 PDT 2017 to Wed Jul 5 20:39:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,700

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 07:14:32 PDT 2017 to Thu Jul 6 00:28:30 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,209

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:41:55 PDT 2017 to Wed Jul 5 19:54:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,399

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:29:12 PDT 2017 to Thu Jul 6 04:42:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,111

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 16:43:44 PDT 2017 to Thu Jul 6 10:08:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,060

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 17:09:19 PDT 2017 to Thu Jul 6 10:32:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,538

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:49:51 PDT 2017 to Thu Jul 6 05:02:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,564

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:14:00 PDT 2017 to Thu Jul 6 18:31:17 PDT 2017.
Topic: crawldata
Wide Web Targeted PDF Crawling (2017)
Wide Web Targeted PDF Crawling (2017)
collection
922
ITEMS
3.1M
VIEWS
by Internet Archive Web Group
collection

eye 3.1M

Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 23

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus". These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-*-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Bulk Bibliographic Metadata
by DOAJ
data

eye 8

favorite 0

comment 0

UNPAYWALL-PDF-CRAWL-2021-05
UNPAYWALL-PDF-CRAWL-2021-05
collection
123
ITEMS
906,382
VIEWS
by Internet Archive Web Group
collection

eye 906,382

Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 999

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
by ORCID, Inc.
data

eye 4

favorite 0

comment 0

This item contains an annual copy of the ORCID public data file, as originally downloaded from: https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2021/16750535 See also: https://info.orcid.org/orcids-2021-public-data-file-is-now-available More details about this content and it's use available at: https://orcid.org/content/orcid-public-data-file This dataset is available under the public domain (CC-0).
Bulk Bibliographic Metadata
by Japan Link Center
data

eye 20

favorite 0

comment 0

Downloaded from http://japanlinkcenter.org/top/material/material_metadata.html
Github Mirror by Narabot
software

eye 32

favorite 0

comment 0

Scoop by Rusty Foster and the CMF running Kuro5hin and other websites scoop Scoop 1.27 by Rusty Foster and the CMF running Kuro5hin and other websites Also found here:http://archive.debian.net/sarge/web/scoop Scoop is an ealry clone of the Slashdot system with user diaries, story sumission queues, and comments ratings. Saved by Blastar of India and China http://blastar.in/ Deleted from Wikipedia and deemed non-notable:http://en.wikipedia.org/wiki/Wikipedia:Articles for deletion/Scoop (software)...
Topics: GitHub, code, software, git
Academic Data and Datasets
by Yechao Yan; Yangyang Xu; Shuping Yue
data

eye 1

favorite 0

comment 0

Daily files of human thermal-stress indices for March 1988 over South and East Asia
Source: https://springernature.figshare.com/articles/dataset/HiTiSEA_1988-03/14560443/1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

CHARMM36 DOPS simulations (303 K, starting structure from the CHARMM-GUI) performed with a 1.0 nm point at which to switch off the van der Waals interactions. Two different simulations generated with different starting velocities are provided (the files are named v1 and v2 for these different simulations). The trajectories contain only the data from 400-500 ns of the simulations (as per the analysis provided on the nmrlipids blog) and additionally they have been processed with trjconv -skip...
Source: https://zenodo.org/record/1129411
Academic Data and Datasets
by Raul F. Pérez; J. Ramón Tejedor; Gustavo F. Bayón; Agustín F. Fernandez; Mario F. Fraga
data

eye 1

favorite 0

comment 0

Cancer is an aging-associated disease but the underlying molecular links between these processes are still largely unknown. Gene promoters that become hypermethylated in aging and cancer share a common chromatin signature in ES cells. In addition, there is also global DNA hypomethylation in both processes. However, any similarities of the regions where this loss of DNA methylation occurs is currently not well characterized, nor is it known whether such regions also share a common chromatin...
Source: https://zenodo.org/record/1086491
Academic Data and Datasets
by Family name, given names
data

eye 3

favorite 0

comment 0

opendata.dwd.de - OpenData by Deutscher Wetter Dienst Conditions: https://www.dwd.de/EN/service/copyright/copyright_node.html dates and times are UTC.
Source: https://zenodo.org/record/1404410
Academic Data and Datasets
by SXS Collaboration
data

eye 1

favorite 0

comment 0

Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2639283
There are two files needed to replicate the analyses described and depicted in “The Effects of Militarized Interstate Disputes on Incumbent Voting Across Genders,” by Shane P. Singh and Jaroslav Tir: (1) The data, “Singh_and_Tir_PB_Replication”; (2) The Stata code, in a do-file, included as “Singh and Tir Replication, Political Behavior.” To proceed with the replication, open the data in Stata. Then, open the do-file. The code can be run directly from that file. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/O9UVFU&version=1.0
GeoTIFF layer (8 x 8 m) containing a model of potential floodplains of watercourses in the landscape of the Czech Republic. Based on watercourse network model (see http://doi.org/10.13140/RG.2.2.19409.48489). For a detailed description of layers, see http://doi.org/10.5281/zenodo.3367296
Source: https://zenodo.org/record/3367357
Academic Data and Datasets
by Yahai Zhang; Aizhong Ye
data

eye 1

favorite 0

comment 0

A new global gross primary production dataset covering 1980–2018 (under review on Scientific Data)
Source: https://figshare.com/articles/dataset/BTCH_zip/14332580/3
Accompanying data to "The complexity of high-frequency electric fields impairs jamming avoidance: a potential trade-off in electric sensing" (submitted)
Source: https://figshare.com/articles/dataset/The_complexity_of_high-frequency_electric_fields_impairs_jamming_avoidance_a_potential_trade-off_in_electric_sensing/5361007/1
Academic Data and Datasets
by marta severo
data

eye 1

favorite 0

comment 0

RSS feeds of 36 daily newspapers (in french, english, spanish) of 23 countries, RSS feeds international, 1 January 2014-30 June 2015 – UPD (collected during French research project ANR Geomedia : free access for scientific use only)
Source: https://figshare.com/articles/dataset/Geomedia_extract_AGENDA_titre_desc_zip/5873649/2
Academic Data and Datasets
by Young-Gun Kim
data

eye 1

favorite 0

comment 0

ECG-ViEW II sample dataset
Source: https://figshare.com/articles/dataset/person_csv/4584772/2
High-throughput sequencing raw data for IPEC_B2_B_ samples, IPEC_B2_B_ are the samples from treatment group
Source: https://figshare.com/articles/dataset/High-throughput_sequencing_raw_data_for_1_samples_porcine_epithelial_cell_line_IPEC-J2_/7440755/1
Academic Data and Datasets
by Nicholas E. Protonotarios; Athanassios S. Fokas; Kostas Kostarelos; George A. Kastis
data

eye 1

favorite 0

comment 0

All code, data and reconstructed images used in all studies involved (simulations, real phantom and clinical), as part of the Electronic Supplementary Material.
Source: https://rs.figshare.com/articles/dataset/Code_Data_and_Reconstructed_images_from_The_attenuated_spline_reconstruction_technique_for_single_photon_emission_computed_tomography/7346204/1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

Source code and datasets for "FastNet: Fast and Accurate Inference of Phylogenetic Networks Using Large-Scale Genomic Sequence Data".
Source: https://figshare.com/articles/dataset/Source_code_and_datasets_for_FastNet_Fast_and_Accurate_Inference_of_Phylogenetic_Networks_Using_Large-Scale_Genomic_Sequence_Data_/5785479/2
Academic Data and Datasets
by Bjorn Herrmann
data

eye 2

favorite 0

comment 0

sub-47
Source: https://figshare.com/articles/dataset/sub-47_rar/16566861/1
Academic Data and Datasets
by Daniel Whitt
data

eye 1

favorite 0

comment 0

see Table 1 in Whitt et al. (2017) , JGR
Source: https://figshare.com/articles/dataset/WTL17_out_XW4d_02/4959236/1
Academic Data and Datasets
by Daniel Whitt
data

eye 1

favorite 0

comment 0

Whitt et al. 2017 Table 1
Source: https://figshare.com/articles/dataset/WTL17_drifters_RW/4958939/1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

Replication Files (datasets and codes in Stata format): - comparative survey data with individual-level analyses - second-level data of estimates from individual-level analyses - TESS experimental study - MTurk experimental study (pilot) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/E852VT&version=1.0
This is an agreement (“Agreement”) between you the downloader (“Downloader”) and the owner of the materials (“User”) governing the use of the materials (“Materials”) to be downloaded. I. Acceptance of this Agreement By downloading or otherwise accessing the Materials, Downloader represents his/her acceptance of the terms of this Agreement.   II. Modification of this Agreement Users may modify the terms of this Agreement at any time. However, any modifications to this Agreement...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IAH6Z6&version=6.1
Academic Data and Datasets
by Delehanty, Casey; Welch, Ryan; Mewhirter, Jack; Wilks, Jason
data

eye 7

favorite 1

comment 0

Description: Does increased militarization of law enforcement agencies (LEAs) lead to an increase in violent behavior among officers? We theorize that the receipt of military equipment increases multiple dimensions of LEA militarization (material, cultural, organizational, and operational) and that such increases lead to more violent behavior. The U.S. Department of Defense 1033 program makes excess military equipment, including weapons and vehicles, available to local LEAs. The variation in...
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=tWo-B0_Pha0 strain : RB2294 timestamp : 2010-02-24T10:25:25+00:00 gene : acr-6 chromosome : I allele : ok3117 strain_description : acr-6(ok3117)I sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1005747
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=QMJA5q749RU strain : QL22 timestamp : 2012-10-31T12:35:16+00:00 gene : ins-10 chromosome : -N/A- allele : tm3498 strain_description : ins-10(tm3498) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation :...
Source: https://zenodo.org/record/1022992
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=hdGuEbCxL2o strain : N2 timestamp : 2011-06-01T11:38:11+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1018191
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=BsMCkwqjKl4 strain : NL795 timestamp : 2010-03-05T10:35:38+00:00 gene : gpa-7 chromosome : IV allele : pk610 strain_description : gpa-7(pk610)IV sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation :...
Source: https://zenodo.org/record/1014191
C.elegans behavioural database
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=U7c_x71WXu4 strain : OW939 timestamp : 2014-04-20T16:19:06+02:00 strain_description : zgIs113[P(dat-1)::alpha-Synuclein::YFP] sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who :...
Source: https://zenodo.org/record/1191594
C.elegans behavioural database
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=1dNYcJcxrwo strain : OW949 timestamp : 2014-03-03T13:11:21+01:00 strain_description : zgIs125[P(dat-1)::alpha-Synuclein::YFP] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who :...
Source: https://zenodo.org/record/1203472
DOI-LANDING-CRAWL-2018-06
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0

This item contains output files related to the DOI-LANDING-CRAWL-2018-06 crawl of Crossref DOI redirect landing pages: - list of Crossref DOI numbers attempted - an index of DOI, URL, and final HTTP status codes
CiteSeerX URL Crawl 2017
web

eye 11,442

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:06:40 PDT 2017 to Wed Jul 5 06:20:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,380

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:27:04 PDT 2017 to Wed Jul 5 04:42:02 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,447

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:17:51 PDT 2017 to Wed Jul 5 00:28:29 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,167

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:45:37 PDT 2017 to Wed Jul 5 05:59:07 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,023

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:43:10 PDT 2017 to Wed Jul 5 01:56:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,600

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:23:44 PDT 2017 to Wed Jul 5 01:37:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,479

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 05:56:50 PDT 2017 to Tue Jul 4 23:09:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,450

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:28:55 PDT 2017 to Wed Jul 5 12:44:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,466

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:41:00 PDT 2017 to Wed Jul 5 18:35:58 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,288

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:57:08 PDT 2017 to Wed Jul 5 07:09:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,417

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:46:59 PDT 2017 to Wed Jul 5 06:59:12 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,004

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:21:08 PDT 2017 to Wed Jul 5 13:39:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,337

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:36:43 PDT 2017 to Wed Jul 5 14:00:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 10,485

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:45:15 PDT 2017 to Thu Jul 6 01:55:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,388

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:39:30 PDT 2017 to Wed Jul 5 22:34:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,076

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:00:34 PDT 2017 to Thu Jul 6 01:11:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,489

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:11:26 PDT 2017 to Wed Jul 5 20:24:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,872

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:18:25 PDT 2017 to Thu Jul 6 01:29:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,267

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:06:06 PDT 2017 to Wed Jul 5 22:19:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,480

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:19:33 PDT 2017 to Thu Jul 6 02:30:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,340

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:25:18 PDT 2017 to Thu Jul 6 06:39:00 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,870

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 10:59:03 PDT 2017 to Thu Jul 6 04:11:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,347

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:04:32 PDT 2017 to Thu Jul 6 06:17:57 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,053

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 02:29:51 PDT 2017 to Thu Jul 6 20:40:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,435

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 01:31:00 PDT 2017 to Thu Jul 6 19:44:19 PDT 2017.
Topic: crawldata
Community Video
by Caveh Zahedi
movies

eye 466

favorite 1

comment 0

COS Sandbox Collection
by Stephen Politzer-Ahles, Edward Matthew Husband
data

eye 37

favorite 0

comment 0

This is an OSF registration, part of a Center for Open Science (COS) / Internet Archive (IA) partnership. For more read the blog post . You can browse the full collection at https://archive.org/details/cos-dev-sandbox . See the original registration at: osf.io That's all, folks! This was a dry run.
Bulk Bibliographic Metadata
by Crossref
data

eye 644

favorite 2

comment 0

This file is a snapshot dump of the Crossref DOI metadata API, containing entries for over 94 million DOIs. Compared to the previous 2017-03 version (see archive.org item "crossref_doi_dump_201703"), this snapshot has a few million more works, but the corpus size is much larger (29 GB compressed vs. 7 GB compressed) as it now contains significantly more citation data, due to the efforts of the Initiative for Open Citations (I4OC) project. This was generated by running the scripts...
CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.2M
VIEWS
collection

eye 1.2M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal
Bulk Bibliographic Metadata
by dblp
data

eye 20

favorite 0

comment 0

Bulk Bibliographic Metadata
by ISSN
data

eye 331

favorite 1

comment 0

Unlike most ISSN metadata, this mapping file is publicly available.
Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 936

favorite 1

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Bulk Bibliographic Metadata
by ROAD: Directory of Open Access Scholarly Resources
data

eye 139

favorite 0

comment 0

This is a backup of ROAD/ISSN metadata from http://road.issn.org/en/contenu/download-road-records Dumps in both MARC XML and RDF format are included; see sub-directory for date of download. See also earlier July 2017 dump at: https://archive.org/download/road-issn-2017 These files are under the Creative Commons Attribution-NonCommercial 4.0 International Public License (aka, CC-BY-NC).
Topic: metadata
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

This data set contains the stellar velocities used in the paper. It also includes profiles and parameter distributions derived using CJAM. It does not include the raw MUSE data which is available in the ESO archive. CC0 Waiver
Source: https://data.goettingen-research-online.de/dataset.xhtml?persistentId=doi:10.25625/VCNHOR&version=1.0
Academic Data and Datasets
data

eye 2

favorite 0

comment 0

ENDOR measurement raw data (263 GHz), summarized DFT results, simulation results and processing notebooks for re-creating all figures in the manuscript and supplementary information in the paper: "Distribution of H$^\beta$ Hyperfine Couplings in a Tyrosyl Radical Revealed by 263 GHz ENDOR Spectroscopy" This dataset is published used under the CC BY-NC-ND 4.0 license (Attribution-NonCommercial-NoDerivatives 4.0 International).
Source: https://data.goettingen-research-online.de/dataset.xhtml?persistentId=doi:10.25625/AAFR6T&version=1.0
The data and programs replicate tables and figures from "Human Capital and Development Accounting: New Evidence from Wage Gains at Migration", by Hendricks and Schoellman. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IPIBQP&version=1.1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

This is a gzipped CSV file containing the 13 million Duolingo student learning traces used in experiments by Settles & Meeder (2016). For more details and replication source code, visit: https://github.com/duolingo/halflife-regression This work is released under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N8XJME&version=1.0
Academic Data and Datasets
by Ginsburg, Adam
data

eye 1

favorite 0

comment 0

APEX map observations of the W51 Main/IRS2 region in the 217-221 GHz band and the 289-293 GHz band as part of project E-098.C-0421A-2016 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RYEANM&version=1.0