Skip to main content
SHOW DETAILS
eye
Title
Date Archived
Creator
DOI-LANDING-CRAWL-2018-06
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0

This item contains output files related to the DOI-LANDING-CRAWL-2018-06 crawl of Crossref DOI redirect landing pages: - list of Crossref DOI numbers attempted - an index of DOI, URL, and final HTTP status codes
Bulk Bibliographic Metadata
by ISSN
data

eye 331

favorite 1

comment 0

Unlike most ISSN metadata, this mapping file is publicly available.
Bulk Bibliographic Metadata
by ROAD: Directory of Open Access Scholarly Resources
data

eye 139

favorite 0

comment 0

This is a backup of ROAD/ISSN metadata from http://road.issn.org/en/contenu/download-road-records Dumps in both MARC XML and RDF format are included; see sub-directory for date of download. See also earlier July 2017 dump at: https://archive.org/download/road-issn-2017 These files are under the Creative Commons Attribution-NonCommercial 4.0 International Public License (aka, CC-BY-NC).
Topic: metadata
Community Video
by Caveh Zahedi
movies

eye 466

favorite 1

comment 0

CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.2M
VIEWS
collection

eye 1.2M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal
COS Sandbox Collection
by Stephen Politzer-Ahles, Edward Matthew Husband
data

eye 37

favorite 0

comment 0

This is an OSF registration, part of a Center for Open Science (COS) / Internet Archive (IA) partnership. For more read the blog post . You can browse the full collection at https://archive.org/details/cos-dev-sandbox . See the original registration at: osf.io That's all, folks! This was a dry run.
Bulk Bibliographic Metadata
by Crossref
data

eye 644

favorite 2

comment 0

This file is a snapshot dump of the Crossref DOI metadata API, containing entries for over 94 million DOIs. Compared to the previous 2017-03 version (see archive.org item "crossref_doi_dump_201703"), this snapshot has a few million more works, but the corpus size is much larger (29 GB compressed vs. 7 GB compressed) as it now contains significantly more citation data, due to the efforts of the Initiative for Open Citations (I4OC) project. This was generated by running the scripts...
CiteSeerX URL Crawl 2017
web

eye 6,447

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:17:51 PDT 2017 to Wed Jul 5 00:28:29 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,479

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 05:56:50 PDT 2017 to Tue Jul 4 23:09:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,600

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:23:44 PDT 2017 to Wed Jul 5 01:37:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,023

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:43:10 PDT 2017 to Wed Jul 5 01:56:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,288

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:57:08 PDT 2017 to Wed Jul 5 07:09:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,417

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:46:59 PDT 2017 to Wed Jul 5 06:59:12 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,380

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:27:04 PDT 2017 to Wed Jul 5 04:42:02 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,167

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:45:37 PDT 2017 to Wed Jul 5 05:59:07 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,442

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:06:40 PDT 2017 to Wed Jul 5 06:20:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,450

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:28:55 PDT 2017 to Wed Jul 5 12:44:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,466

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:41:00 PDT 2017 to Wed Jul 5 18:35:58 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,004

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:21:08 PDT 2017 to Wed Jul 5 13:39:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,337

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:36:43 PDT 2017 to Wed Jul 5 14:00:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,388

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:39:30 PDT 2017 to Wed Jul 5 22:34:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,489

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:11:26 PDT 2017 to Wed Jul 5 20:24:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,267

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:06:06 PDT 2017 to Wed Jul 5 22:19:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,076

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:00:34 PDT 2017 to Thu Jul 6 01:11:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,870

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 10:59:03 PDT 2017 to Thu Jul 6 04:11:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,347

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:04:32 PDT 2017 to Thu Jul 6 06:17:57 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,480

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:19:33 PDT 2017 to Thu Jul 6 02:30:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 10,485

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:45:15 PDT 2017 to Thu Jul 6 01:55:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,872

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:18:25 PDT 2017 to Thu Jul 6 01:29:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,053

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 02:29:51 PDT 2017 to Thu Jul 6 20:40:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,340

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:25:18 PDT 2017 to Thu Jul 6 06:39:00 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,435

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 01:31:00 PDT 2017 to Thu Jul 6 19:44:19 PDT 2017.
Topic: crawldata
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=fnw3aj0tG-U strain : N2 timestamp : 2012-04-03T12:13:56+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1011287
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=BDXGxnoR11M strain : RB1226 timestamp : 2010-02-24T12:04:14+00:00 gene : acr-18 chromosome : V allele : ok1285 strain_description : acr-18(ok1285)V sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1014737
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=oeHe7uJNoJQ strain : CB402 timestamp : 2011-11-04T10:10:54+00:00 gene : unc-55 chromosome : I allele : e402 strain_description : unc-55(e402)I sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1022984
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=nC7ZCfRX77s strain : VC1340 timestamp : 2011-09-15T11:39:00+01:00 gene : C52B9.11 chromosome : X allele : gk596 strain_description : C52B9.11(gk596)X sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation...
Source: https://zenodo.org/record/1022173
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=IsOlHPX3E-Q strain : AQ908 timestamp : 2010-11-11T11:05:00+00:00 gene : mec-4 chromosome : X allele : u253 strain_description : mec-4(u253)X; bzIs17[pmec-4::YC2.12; lin-15(+)] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri...
Source: https://zenodo.org/record/1015920
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=wRa_0BMNsfE strain : RB1316 timestamp : 2010-06-15T12:08:36+01:00 gene : unc-105 chromosome : II allele : ok1432 strain_description : unc-105(ok1432)II sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1027630
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=w5EpBg8pYsU strain : N2 timestamp : 2010-07-13T10:04:15+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1029473
C.elegans behavioural database
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=_4xGXdIUSgs strain : CB1112 timestamp : 2014-04-27T15:28:17+02:00 strain_description : cat-2(e1112)II sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1189908
Academic Data and Datasets
by Martin Bencsik
data

eye 3

favorite 0

comment 0

Honeybee accelerometer data
Source: https://figshare.com/articles/dataset/folder_10/4758511/1
Academic Data and Datasets
by Chuan Li; Zhuofan Zhao; Yongming Liu; Bing Liang; Shuxian Guan; Hai Lan; Jing Wang; Yanli Lu; Moju Cao
data

eye 1

favorite 0

comment 0

File 1 of RNA-seq raw data of N48-2 biological replicate 3 spikelets at mononuclear stage. Paired-end sequencing strategy was applied on the Illumina HiSeq 4000 sequencing platform.
Source: https://figshare.com/articles/dataset/RNA-seq_raw_data_of_MS-N_3_1/4906994/1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

Dodders ( Cuscuta spp., Convolvulaceae) are root- and leafless parasitic plants that parasitize a wide range of hosts. C. australis genome harbors only 19671 protein-coding genes, and 11.7% of the conserved orthologs in autotrophic plants are lost in C. australis . Many of these gene loss events likely result from its parasitic lifestyle and large body plan changes. This dataset is the phylogenetic analysis data for gene losses detection and gene alignments used for selection analysis.
Source: https://figshare.com/articles/dataset/Extend_Data_for_cuscuta_australis_genome_analysis/6072131/1
Academic Data and Datasets
by Xiaolu Yu
data

eye 1

favorite 0

comment 0

sequencing data of microRNA released from mPFC in normal mice.
Source: https://figshare.com/articles/dataset/sequencing_data_NPFC_/5808885/1
Academic Data and Datasets
by D. Louis Collins; Gabriel Allan Devenyi; Raihaan Patel; Stephanie Tullo; Min Tae M Park; M. Mallar Chakravarty
data

eye 1

favorite 0

comment 0

Resulting atlas from the atlas-to-template warping technique performed registering the histologically-derived atlas to each of the 5 MRI templates using the ANIMAL_script code, delineating 108 subcortical structures.
Source: https://springernature.figshare.com/articles/dataset/Labels_of_108_subcortical_structures_for_brain3_NIfTI_format_/6068123/1
Academic Data and Datasets
by Salamon, Justin; Bittner, Rachel; Bonada, Jordi; Bosch, Juan Jose; Gómez, Emilia; Bello, Juan Pablo
data

eye 1

favorite 0

comment 0

MDB-mf0-synth ============= MDB-mf0-synth (c) by Justin Salamon, Rachel Bittner, Jordi Bonada, Juan Jose Bosch, Emilia Gómez and Juan Pablo Bello. MDB-mf0-synth is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).  You should have received a copy of the license along with this work. If not, see http://creativecommons.org/licenses/by-nc/4.0/ Created By ---------- Justin Salamon*, Rachel Bittner*, Jordi Bonada^, Juan Jose Bosch^, Emilia...
Source: https://zenodo.org/record/1481170
Academic Data and Datasets
by Spencer, Alan R.T.; Garwood, Russell J.; Rees, Andrew R.; Raine, Robert J.; Rothwell, Gar W.; Hollingworth, Neville T. J.; Hilton, Jason
data

eye 1

favorite 0

comment 0

Dataset S4. SRXMT 8-bit BMP tomographic dataset of BU5265.2. The dataset consists of 1200 8-bit bitmap images compressed as a ZIP archive. Image brightness/contrast optimized and despeckling applied. Note that images from the tomographic stack beginning and end, without specimen data present, have not been included. [ZIP/BMP format 10.6 GB]
Source: https://zenodo.org/record/824047
The data and programs replicate tables and figures from "Human Capital and Development Accounting: New Evidence from Wage Gains at Migration", by Hendricks and Schoellman. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IPIBQP&version=1.1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

This is a gzipped CSV file containing the 13 million Duolingo student learning traces used in experiments by Settles & Meeder (2016). For more details and replication source code, visit: https://github.com/duolingo/halflife-regression This work is released under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N8XJME&version=1.0
Academic Data and Datasets
by Ginsburg, Adam
data

eye 1

favorite 0

comment 0

APEX map observations of the W51 Main/IRS2 region in the 217-221 GHz band and the 289-293 GHz band as part of project E-098.C-0421A-2016 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RYEANM&version=1.0
The isotopic composition of water vapour provides integrated perspectives on the hydrological histories of air masses and has been widely used for tracing physical processes in hydrological and climatic studies. Over the last two decades, the infrared laser spectroscopy technique has been used to measure the isotopic composition of water vapour near the Earth’s surface. Here, we have assembled a global database of high temporal resolution stable water vapour isotope ratios (𝛿18O and 𝛿D)...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/UTE3QI&version=5.0
UNPAYWALL-PDF-CRAWL-2018-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

See also the crawl logs item for this crawl.
Community Video
by Benjamin "Mako" Hill
movies

eye 55

favorite 1

comment 0

See also: https://mako.cc/copyrighteous/libreplanet-2018-keynote
Bulk Bibliographic Metadata
data

eye 17

favorite 0

comment 0

This item contains a set of "Keeper's Reports" summarizing journal content preservation coverage from major archival services and networks (Portico, LOCKSS, CLOCKSS). See README for links to where these files were downloaded from.
Topics: Keeper's Reports, Metadata, Preservation
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 23

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus". These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-*-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
CiteSeerX URL Crawl 2017
web

eye 4,005

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 16:02:30 PDT 2017 to Wed Jul 5 09:16:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,381

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:29:50 PDT 2017 to Wed Jul 5 08:40:44 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,714

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 17:01:13 PDT 2017 to Wed Jul 5 10:15:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,961

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 16:13:58 PDT 2017 to Wed Jul 5 09:29:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,576

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 09:23:13 PDT 2017 to Wed Jul 5 02:37:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,830

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 14:59:25 PDT 2017 to Wed Jul 5 08:13:10 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,043

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 15:19:45 PDT 2017 to Wed Jul 5 08:32:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,990

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 22:33:37 PDT 2017 to Wed Jul 5 16:22:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,533

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:05:46 PDT 2017 to Wed Jul 5 12:18:30 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,925

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 23:14:09 PDT 2017 to Wed Jul 5 17:01:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,209

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:41:55 PDT 2017 to Wed Jul 5 19:54:59 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,853

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:28:50 PDT 2017 to Wed Jul 5 20:41:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,737

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 02:26:07 PDT 2017 to Wed Jul 5 20:39:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,112

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 01:56:59 PDT 2017 to Wed Jul 5 19:14:04 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,779

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:15:53 PDT 2017 to Wed Jul 5 23:27:06 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,898

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:37:05 PDT 2017 to Wed Jul 5 22:50:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,399

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:29:12 PDT 2017 to Thu Jul 6 04:42:47 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,159

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:09:06 PDT 2017 to Thu Jul 6 02:22:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,015

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 06:04:47 PDT 2017 to Wed Jul 5 23:17:34 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,538

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 11:49:51 PDT 2017 to Thu Jul 6 05:02:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,700

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 07:14:32 PDT 2017 to Thu Jul 6 00:28:30 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,564

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 00:14:00 PDT 2017 to Thu Jul 6 18:31:17 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,111

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 16:43:44 PDT 2017 to Thu Jul 6 10:08:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,060

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 17:09:19 PDT 2017 to Thu Jul 6 10:32:42 PDT 2017.
Topic: crawldata
Wide Web Targeted PDF Crawling (2017)
Wide Web Targeted PDF Crawling (2017)
collection
922
ITEMS
3.1M
VIEWS
by Internet Archive Web Group
collection

eye 3.1M

Github Mirror by Narabot
software

eye 32

favorite 0

comment 0

Scoop by Rusty Foster and the CMF running Kuro5hin and other websites scoop Scoop 1.27 by Rusty Foster and the CMF running Kuro5hin and other websites Also found here:http://archive.debian.net/sarge/web/scoop Scoop is an ealry clone of the Slashdot system with user diaries, story sumission queues, and comments ratings. Saved by Blastar of India and China http://blastar.in/ Deleted from Wikipedia and deemed non-notable:http://en.wikipedia.org/wiki/Wikipedia:Articles for deletion/Scoop (software)...
Topics: GitHub, code, software, git
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

CHARMM36 DOPS simulations (303 K, starting structure from the CHARMM-GUI) performed with a 1.0 nm point at which to switch off the van der Waals interactions. Two different simulations generated with different starting velocities are provided (the files are named v1 and v2 for these different simulations). The trajectories contain only the data from 400-500 ns of the simulations (as per the analysis provided on the nmrlipids blog) and additionally they have been processed with trjconv -skip...
Source: https://zenodo.org/record/1129411
Academic Data and Datasets
by Family name, given names
data

eye 3

favorite 0

comment 0

opendata.dwd.de - OpenData by Deutscher Wetter Dienst Conditions: https://www.dwd.de/EN/service/copyright/copyright_node.html dates and times are UTC.
Source: https://zenodo.org/record/1404410
Academic Data and Datasets
by SXS Collaboration
data

eye 1

favorite 0

comment 0

Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2639283
GeoTIFF layer (8 x 8 m) containing a model of potential floodplains of watercourses in the landscape of the Czech Republic. Based on watercourse network model (see http://doi.org/10.13140/RG.2.2.19409.48489). For a detailed description of layers, see http://doi.org/10.5281/zenodo.3367296
Source: https://zenodo.org/record/3367357
Academic Data and Datasets
by Delehanty, Casey; Welch, Ryan; Mewhirter, Jack; Wilks, Jason
data

eye 7

favorite 1

comment 0

Description: Does increased militarization of law enforcement agencies (LEAs) lead to an increase in violent behavior among officers? We theorize that the receipt of military equipment increases multiple dimensions of LEA militarization (material, cultural, organizational, and operational) and that such increases lead to more violent behavior. The U.S. Department of Defense 1033 program makes excess military equipment, including weapons and vehicles, available to local LEAs. The variation in...
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

Replication Files (datasets and codes in Stata format): - comparative survey data with individual-level analyses - second-level data of estimates from individual-level analyses - TESS experimental study - MTurk experimental study (pilot) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/E852VT&version=1.0
This is an agreement (“Agreement”) between you the downloader (“Downloader”) and the owner of the materials (“User”) governing the use of the materials (“Materials”) to be downloaded. I. Acceptance of this Agreement By downloading or otherwise accessing the Materials, Downloader represents his/her acceptance of the terms of this Agreement.   II. Modification of this Agreement Users may modify the terms of this Agreement at any time. However, any modifications to this Agreement...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IAH6Z6&version=6.1
There are two files needed to replicate the analyses described and depicted in “The Effects of Militarized Interstate Disputes on Incumbent Voting Across Genders,” by Shane P. Singh and Jaroslav Tir: (1) The data, “Singh_and_Tir_PB_Replication”; (2) The Stata code, in a do-file, included as “Singh and Tir Replication, Political Behavior.” To proceed with the replication, open the data in Stata. Then, open the do-file. The code can be run directly from that file. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/O9UVFU&version=1.0
Academic Data and Datasets
by Daniel Whitt
data

eye 1

favorite 0

comment 0

see Table 1 in Whitt et al. (2017) , JGR
Source: https://figshare.com/articles/dataset/WTL17_out_XW4d_02/4959236/1
Academic Data and Datasets
by Daniel Whitt
data

eye 1

favorite 0

comment 0

Whitt et al. 2017 Table 1
Source: https://figshare.com/articles/dataset/WTL17_drifters_RW/4958939/1
Accompanying data to "The complexity of high-frequency electric fields impairs jamming avoidance: a potential trade-off in electric sensing" (submitted)
Source: https://figshare.com/articles/dataset/The_complexity_of_high-frequency_electric_fields_impairs_jamming_avoidance_a_potential_trade-off_in_electric_sensing/5361007/1
Academic Data and Datasets
by marta severo
data

eye 1

favorite 0

comment 0

RSS feeds of 36 daily newspapers (in french, english, spanish) of 23 countries, RSS feeds international, 1 January 2014-30 June 2015 – UPD (collected during French research project ANR Geomedia : free access for scientific use only)
Source: https://figshare.com/articles/dataset/Geomedia_extract_AGENDA_titre_desc_zip/5873649/2
Academic Data and Datasets
by Young-Gun Kim
data

eye 1

favorite 0

comment 0

ECG-ViEW II sample dataset
Source: https://figshare.com/articles/dataset/person_csv/4584772/2
High-throughput sequencing raw data for IPEC_B2_B_ samples, IPEC_B2_B_ are the samples from treatment group
Source: https://figshare.com/articles/dataset/High-throughput_sequencing_raw_data_for_1_samples_porcine_epithelial_cell_line_IPEC-J2_/7440755/1
Academic Data and Datasets
by Nicholas E. Protonotarios; Athanassios S. Fokas; Kostas Kostarelos; George A. Kastis
data

eye 1

favorite 0

comment 0

All code, data and reconstructed images used in all studies involved (simulations, real phantom and clinical), as part of the Electronic Supplementary Material.
Source: https://rs.figshare.com/articles/dataset/Code_Data_and_Reconstructed_images_from_The_attenuated_spline_reconstruction_technique_for_single_photon_emission_computed_tomography/7346204/1
Academic Data and Datasets
data

eye 1

favorite 0

comment 0

Source code and datasets for "FastNet: Fast and Accurate Inference of Phylogenetic Networks Using Large-Scale Genomic Sequence Data".
Source: https://figshare.com/articles/dataset/Source_code_and_datasets_for_FastNet_Fast_and_Accurate_Inference_of_Phylogenetic_Networks_Using_Large-Scale_Genomic_Sequence_Data_/5785479/2
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=tWo-B0_Pha0 strain : RB2294 timestamp : 2010-02-24T10:25:25+00:00 gene : acr-6 chromosome : I allele : ok3117 strain_description : acr-6(ok3117)I sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1005747
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=QMJA5q749RU strain : QL22 timestamp : 2012-10-31T12:35:16+00:00 gene : ins-10 chromosome : -N/A- allele : tm3498 strain_description : ins-10(tm3498) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation :...
Source: https://zenodo.org/record/1022992
C.elegans behavioural database
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=hdGuEbCxL2o strain : N2 timestamp : 2011-06-01T11:38:11+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1018191