Skip to main content

600
UPLOADS


More right-solid

More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid

SHOW DETAILS
eye
Title
Date Archived
Creator
Internet Archive Research Publication Crawls
by CNKI
data

eye 0

favorite 0

comment 0

Metadata about COVID-19 papers downloaded from:  http://en.gzbd.cnki.net/GZBT/brief/Default.aspx
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=IsOlHPX3E-Q strain : AQ908 timestamp : 2010-11-11T11:05:00+00:00 gene : mec-4 chromosome : X allele : u253 strain_description : mec-4(u253)X; bzIs17[pmec-4::YC2.12; lin-15(+)] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri...
Source: https://zenodo.org/record/1015920
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=oeHe7uJNoJQ strain : CB402 timestamp : 2011-11-04T10:10:54+00:00 gene : unc-55 chromosome : I allele : e402 strain_description : unc-55(e402)I sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1022984
Web PDF Training Sets
by Internet Archive Web Group
data

eye 15

favorite 0

comment 0

This item contains three .zip archives, each containing a sample corpus of about 10,000 (or more) HTML documents from the IA web archive. For each, there is some form of metadata (CDX or JSON) with information about the original URL and timestamp for each document, and then directories containing HTML, extracted TEI-XML, and extracted TXT for each document. There are some fraction of documents which failed to download or failed to extract, so there are fewer .html (and derivative) files than...
The Dataset Collection
by John Hildyard
data

eye 1

favorite 0

comment 0

This file contains supplementary figures 1-5 from the extended data of the manuscript Single-transcript multiplex in situ hybridisation reveals unique patterns of dystrophin isoform expression in the developing mammalian embryo John C.W. Hildyard, Abbe H. Crawford, Faye Rawson, Dominique O. Riddell, Rachel C.M. Harron, Richard J. Piercy
Source: https://figshare.com/articles/dataset/Dystrophin_multiplex_ISH_Extended_data/12040746/1
The Dataset Collection
by Broad DepMap
data

eye 1

favorite 0

comment 0

This dataset contains the results of Avana library CRISPR-Cas9 genome-scale knockout (prefixed with Achilles) as well as mutation, copy number and gene expression data (prefixed with CCLE) for cancer cell lines as part of the Broad Institute’s Cancer Dependency Map project. We have repackaged our fileset to include all quarterly-updating datasets produced by DepMap. The Avana CRISPR-Cas9 genome-scale knockout data has expanded to include 739 cell lines, the RNAseq data includes 1270 cell...
Source: https://figshare.com/articles/dataset/DepMap_20Q1_Public/11791698/2
These are solar wind in situ data arrays in python pickle format suitable for machine learning, i.e. the arrays consist only of numbers, no strings and no datetime objects. See AAREADME_insitu_ML.txt for more explanation. If you use these data for peer reviewed scientific publications, please get in touch concerning usage and possible co-authorship by the authors (C. Möstl, A. J. Weiss, R. L. Bailey, A. Isavnin): christian.moestl@oeaw.ac.at or twitter @chrisoutofspace Made with...
Source: https://figshare.com/articles/dataset/Solar_wind_in_situ_data_suitable_for_machine_learning_python_numpy_arrays_STEREO-A_B_Wind_Parker_Solar_Probe_Ulysses_Venus_Express_MESSENGER/12058065/2
The Dataset Collection
by Hao Luo
data

eye 1

favorite 0

comment 0

Annotation of alternative splicing events in GENCODE
Source: https://figshare.com/articles/dataset/GENCODE/12524393/7
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0197, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0197_20171205a_videos/12947543/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-120_dcd/12162789/1
The Dataset Collection
by Tim Fischer
data

eye 1

favorite 0

comment 0

All recordings and source files for the measurements with this participant. For details, please see the "Methods" section of the publication "Multichannel acoustic source and image dataset for the cocktail party effect in hearing aid and implant users".
Source: https://figshare.com/articles/dataset/Human_Subjects_Audio_ID_09_zip/12771479/1
The Dataset Collection
by Simon Rasmussen
data

eye 1

favorite 0

comment 0

Near Complete bacterial genomes produced by VAMB from the Almeida et al., (Nature, 2019) benchmark dataset (1,000 human gut microbiome samples). This is part 5 of 5
Source: https://figshare.com/articles/dataset/Near_Complete_Bins_Almeida_dataset_part_E/13221743/1
The Dataset Collection
by Joan Pulupa
data

eye 1

favorite 0

comment 0

The p:s ratios of Nup54-mEGFP 494 fusion proteins with a flexible linker do not shift upon amino acid additions.
Source: https://figshare.com/articles/dataset/Figure1G_Nup54-mEGFP494_flex0_/13333757/1
The Dataset Collection
by Joel Sharbrough; Justin L. Conover; Corrinne Grover; Matheus Fernandes Gyorfy; Emma R. Miller; Jonathan F. Wendel; Daniel Sloan
data

eye 1

favorite 0

comment 0

Whole-genome duplications (WGDs), in which the number of nuclear genome copies is elevated as a result of autopolyploidy or allopolyploidy, are a prominent process of diversification in eukaryotes. The genetic and evolutionary forces that WGD imposes upon cytoplasmic genomes are not well understood, despite the central role that cytonuclear interactions play in eukaryotic function and fitness. Cellular respiration and photosynthesis depend upon successful interaction between the 3000+...
Source: https://figshare.com/articles/dataset/Global_patterns_of_subgenome_evolution_in_organelle-targeted_genes_of_six_allotetraploid_angiosperms/13473207/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0198, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0198_20171205a_videos/13174448/1
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=BDXGxnoR11M strain : RB1226 timestamp : 2010-02-24T12:04:14+00:00 gene : acr-18 chromosome : V allele : ok1285 strain_description : acr-18(ok1285)V sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1014737
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=wRa_0BMNsfE strain : RB1316 timestamp : 2010-06-15T12:08:36+01:00 gene : unc-105 chromosome : II allele : ok1432 strain_description : unc-105(ok1432)II sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1027630
Bulk Bibliographic Metadata
data

eye 7

favorite 0

comment 0

Mirrored from:  https://github.com/njahn82/vanished_journals/tree/master/data
Community Video
by Caveh Zahedi
movies

eye 461

favorite 1

comment 0

Bulk Bibliographic Metadata
by ISSN
data

eye 318

favorite 1

comment 0

Unlike most ISSN metadata, this mapping file is publicly available.
Internet Archive Research Publication Crawls
by Wanfang Data
data

eye 4

favorite 0

comment 0

Metadata and some fulltext PDFs from Wanfang Data, downloaded 2020-03-29 from http://subject.med.wanfangdata.com.cn/Channel/7
The Dataset Collection
by Honorata Kraskiewicz; Maria Paprocka; Aleksandra Bielawska-Pohl; Agnieszka Krawczenko; Kinga Panek; Judyta Kaczyńska; Agnieszka Szyposzyńska; Mateusz Psurski; Piotr Kuropka; Aleksandra Klimczak
data

eye 1

favorite 0

comment 0

Additional file 4. Migration activity of native HATMSC supernatants. MSU-1.1 cell migration activity was investigated at 37 °C in an incubation chamber (PeCon GmbH, Erbach, Germany) with 1%O2, 5%CO2 mounted on an Axio Observer inverted microscope equipped with a dry 5x objective (Zeiss, Gottingen, Germany). The movement of the cells was time-lapse recorded for 44 h at intervals of 2 h using Zen 2.6 Blue Edition Software (Zeiss, Gottingen, Germany) as 6 separate movies (one for each...
Source: https://springernature.figshare.com/articles/dataset/MOESM4_of_Can_supernatant_from_immortalized_adipose_tissue_MSC_replace_cell_therapy_An_in_vitro_study_in_chronic_wounds_model/11686431/1
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=w5EpBg8pYsU strain : N2 timestamp : 2010-07-13T10:04:15+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1029473
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=fnw3aj0tG-U strain : N2 timestamp : 2012-04-03T12:13:56+01:00 gene : -N/A- chromosome : -N/A- allele : -N/A- strain_description : Schafer Lab N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35...
Source: https://zenodo.org/record/1011287
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=nC7ZCfRX77s strain : VC1340 timestamp : 2011-09-15T11:39:00+01:00 gene : C52B9.11 chromosome : X allele : gk596 strain_description : C52B9.11(gk596)X sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation...
Source: https://zenodo.org/record/1022173
The Dataset Collection
data

eye 1

favorite 0

comment 0

KG-COVID-19 graph in KGX TSV format, built on Sep 1, 2020, with no CORD-19 data
Source: https://zenodo.org/record/4012578
The Dataset Collection
by Spencer, Alan R.T.; Garwood, Russell J.; Rees, Andrew R.; Raine, Robert J.; Rothwell, Gar W.; Hollingworth, Neville T. J.; Hilton, Jason
data

eye 1

favorite 0

comment 0

Dataset S4. SRXMT 8-bit BMP tomographic dataset of BU5265.2. The dataset consists of 1200 8-bit bitmap images compressed as a ZIP archive. Image brightness/contrast optimized and despeckling applied. Note that images from the tomographic stack beginning and end, without specimen data present, have not been included. [ZIP/BMP format 10.6 GB]
Source: https://zenodo.org/record/824047
The Dataset Collection
by Ginsburg, Adam
data

eye 1

favorite 0

comment 0

APEX map observations of the W51 Main/IRS2 region in the 217-221 GHz band and the 289-293 GHz band as part of project E-098.C-0421A-2016 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RYEANM&version=1.0
The Dataset Collection
by Chuan Li; Zhuofan Zhao; Yongming Liu; Bing Liang; Shuxian Guan; Hai Lan; Jing Wang; Yanli Lu; Moju Cao
data

eye 1

favorite 0

comment 0

File 1 of RNA-seq raw data of N48-2 biological replicate 3 spikelets at mononuclear stage. Paired-end sequencing strategy was applied on the Illumina HiSeq 4000 sequencing platform.
Source: https://figshare.com/articles/dataset/RNA-seq_raw_data_of_MS-N_3_1/4906994/1
The Dataset Collection
by Darcy Jones
data

eye 5

favorite 0

comment 0

All supplementary material and full resolution figures for the Predector pipeline manuscript. Figure 1: UpSet plot showing predictions of signal peptides, transmembrane domains, and effector-like properties for all known effectors in the training dataset (N=125). Rows indicate sets of proteins predicted to have a property related to effector prediction (e.g. a signal peptide), with the horizontal bar chart indicating set size. Columns indicate where the horizontal sets intersect with each...
Source: https://figshare.com/articles/dataset/Predector_-_supplementary_material/13325213/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0229, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0229_20181102a_videos/13010276/1
The Dataset Collection
by Martin Bencsik
data

eye 3

favorite 0

comment 0

Honeybee accelerometer data
Source: https://figshare.com/articles/dataset/folder_10/4758511/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0309, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0309_20191119a_videos/13110581/1
The Dataset Collection
by Vitalij Novickij; Irute Girkontaite
data

eye 2

favorite 0

comment 0

Raw data from experiments
Source: https://figshare.com/articles/dataset/Raw_dataset/13507140/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-155_dcdWW_domain_trajectories/12162894/2
The Dataset Collection
by Raymond Haggerty
data

eye 1

favorite 0

comment 0

Contains all the input and output files used to generate figure 4 Revision_IN_[motif].mat are the input files for each biologically relevant motif. OUT_[motif].mat are the output files run through MISC corresponding to each of the input files.
Source: https://figshare.com/articles/dataset/Figure_4_Reproduction_Files/12648905/1
Simulation data used in the paper CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9JI57U&version=2.0
These include the relevant files needed for replicating George A. Krause and Matthew Zarit's "The Retraction of Policy Benefits Across U.S. Federal Agencies: Programmatic Cutbacks and Executive Control of U.S. Federal Grant Retrenchments." Forthcoming, Public Administration Review. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/AIICA1&version=1.0
The Dataset Collection
data

eye 1

favorite 0

comment 0

This is a gzipped CSV file containing the 13 million Duolingo student learning traces used in experiments by Settles & Meeder (2016). For more details and replication source code, visit: https://github.com/duolingo/halflife-regression This work is released under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N8XJME&version=1.0
Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 58

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
CiteSeerX URL Crawl 2017
web

eye 5,284

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 05:56:50 PDT 2017 to Tue Jul 4 23:09:37 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,387

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:23:44 PDT 2017 to Wed Jul 5 01:37:05 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,795

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 08:43:10 PDT 2017 to Wed Jul 5 01:56:51 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,214

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 07:17:51 PDT 2017 to Wed Jul 5 00:28:29 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,104

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:57:08 PDT 2017 to Wed Jul 5 07:09:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,259

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 00:41:00 PDT 2017 to Wed Jul 5 18:35:58 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,210

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:46:59 PDT 2017 to Wed Jul 5 06:59:12 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,014

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:36:43 PDT 2017 to Wed Jul 5 14:00:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,908

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 12:45:37 PDT 2017 to Wed Jul 5 05:59:07 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 7,198

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:11:26 PDT 2017 to Wed Jul 5 20:24:18 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,091

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 05:06:06 PDT 2017 to Wed Jul 5 22:19:48 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,848

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:00:34 PDT 2017 to Thu Jul 6 01:11:41 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,689

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 10:59:03 PDT 2017 to Thu Jul 6 04:11:38 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,314

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 09:19:33 PDT 2017 to Thu Jul 6 02:30:40 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 10,216

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:45:15 PDT 2017 to Thu Jul 6 01:55:13 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 8,622

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 08:18:25 PDT 2017 to Thu Jul 6 01:29:26 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,177

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:04:32 PDT 2017 to Thu Jul 6 06:17:57 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,908

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 02:29:51 PDT 2017 to Thu Jul 6 20:40:52 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,157

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 13:25:18 PDT 2017 to Thu Jul 6 06:39:00 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,240

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc285.us.archive.org:CITESEERX-CRAWL-2017 from Fri Jul 7 01:31:00 PDT 2017 to Thu Jul 6 19:44:19 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 4,208

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 19:28:55 PDT 2017 to Wed Jul 5 12:44:56 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 3,716

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 20:21:08 PDT 2017 to Wed Jul 5 13:39:32 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 6,172

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 11:27:04 PDT 2017 to Wed Jul 5 04:42:02 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 5,181

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Thu Jul 6 03:39:30 PDT 2017 to Wed Jul 5 22:34:42 PDT 2017.
Topic: crawldata
CiteSeerX URL Crawl 2017
web

eye 11,127

favorite 0

comment 0

Internet Archive crawldata of uncrawled CiteseerX PDF URLs captured by wbgrp-svc284.us.archive.org:CITESEERX-CRAWL-2017 from Wed Jul 5 13:06:40 PDT 2017 to Wed Jul 5 06:20:59 PDT 2017.
Topic: crawldata
OA-JOURNAL-CRAWL-2020-07
OA-JOURNAL-CRAWL-2020-07
collection
1,923
ITEMS
9.7M
VIEWS
by Internet Archive Web Group
collection

eye 9.7M

PLATFORM-CRAWL-2020
PLATFORM-CRAWL-2020
collection
649
ITEMS
404,310
VIEWS
by Internet Archive Web Group
collection

eye 404,310

COS Sandbox Collection
by Stephen Politzer-Ahles, Edward Matthew Husband
data

eye 37

favorite 0

comment 0

This is an OSF registration, part of a Center for Open Science (COS) / Internet Archive (IA) partnership. For more read the blog post . You can browse the full collection at https://archive.org/details/cos-dev-sandbox . See the original registration at: osf.io That's all, folks! This was a dry run.
CiteSeerX URL Crawl 2017
CiteSeerX URL Crawl 2017
collection
207
ITEMS
1.1M
VIEWS
collection

eye 1.1M

A targeted crawl to fetch research publications from the public web which have been crawled by CiteSeerX but have not previously been crawled by the Internet Archive.
Topics: scholarly, papers, journal
OA-JOURNAL-CRAWL-2020-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

The Dataset Collection
by Akshay Yadav; David Fernández-Baca; Steven Cannon, scannon@iastate.edu
data

eye 1

favorite 0

comment 0

Yeast (YGOB) and legume gene families used for testing methods for detecting and correcting under-clustered and over-clustered gene families ygob_proteomes.tar.gz : Complete yeast proteomes from YGOB database ygob_family_fasta.tar.gz : Complete yeast families from the YGOB database ygob_family_fasta_delete.tar.gz : Intentionally under-clustered yeast families with missing 20% sequences ygob_family_fasta_insert_delete.tar.gz : Intentionally under-clustered yeast families with missing 20%...
Source: https://figshare.com/articles/dataset/Methods_for_analyzing_comparing_and_correcting_gene_families/12115305/1
The Dataset Collection
by Marieke Jepma; Tor Wager
data

eye 1

favorite 0

comment 0

contrast maps of painful heat vs. baseline. Heat stimuli preceded by heat predictive cue, with cue and heat level fully crossed. Data formated as fmri_data object as implemented in github.com/canlab/canlabCore
Source: https://figshare.com/articles/dataset/ie2/12797297/1
The Dataset Collection
by Thomas Cauchy
data

eye 1

favorite 0

comment 0

435 032 nwchem logfiles calculated thanks to the quchempedia BOINC project. DFT calculations (B3LYP / 321G) The first 184 158 molecules corresond to the recalculation of QM9 and PC9. The others are newly generated molecules with the EvoMol software. Beware the concaneted tar file is 141GB of compressed files.
Source: https://figshare.com/articles/dataset/OD9_dataset_part_15_28/13102532/1
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 14

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 5

favorite 0

comment 0

Github Mirror by Narabot
software

eye 35

favorite 0

comment 0

Sci-Hub's coverage of the scientific literature DOI Coverage of Sci-Hub This project is investigating the coverage of the Sci-Hub/LibGen for academic articles.It's based on using DOIs to uniquely identify articles.The repository hosting the manuscript for this study is greenelab/scihub-manuscript .The latest manuscript version is available at https://greenelab.github.io/scihub-manuscript/. Environment This repository uses conda to manage its environment as specified in environment.yml .Install...
Topics: GitHub, code, software, git
The Dataset Collection
by Kucuk, Ahmet
data

eye 0

favorite 0

comment 0

Images of LSDO in wavelength 94 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/OROOPU&version=1.1
WARCZone: Outsider WARCs
by Internet Archive
web

eye 70

favorite 0

comment 0

The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, MPI AOGCM, RCP 8.5, 2080s projections, monthly variables Tmin01-12 Tmax01-12 Tave01-12 Prec01-12. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_MPI_AOGCM_RCP_8_5_2080s_Monthly/11827695/1
The Dataset Collection
by Javer, Avelino; Currie, Michael; Hokanson, Jim; Lee, Chee Wai; Li, Kezhi; Yemini, Eviatar; Grundy, Laura J; Li, Chris; Ch'ng, Quee-Lim; Schafer, William R; Kerr, Rex; Brown, André EX
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=ht3JumrBqws strain : CB1376 timestamp : 2010-07-22T11:47:31+01:00 gene : daf-3 chromosome : X allele : e1376 strain_description : daf-3(e1376)X sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away...
Source: https://zenodo.org/record/1016243
The Dataset Collection
by Kristen Naegle
data

eye 0

favorite 0

comment 0

This is the expanded set of all predictions for GPS, run on the entire reference proteome, including sites not known to be phosphorylated. This dataset is used to perform a fast update when new phosphosites are discovered. The uncompressed folder will yield a large CSV file with predictions in list format (i.e. one line per kinase-substrate prediction) Columns in this order: substrate_id - unique substrate (accession_site) ID substrate_acc - Uniprot accession of substrate protein substrate_name...
Source: https://figshare.com/articles/dataset/Whole_proteome-level_GPS_predictions_part_2_/13228379/1
The Dataset Collection
by Soeren Lukassen; Robert Lorenz Chua; Timo Trefzer; Nicolas C. Kahn; Marc A. Schneider; Thomas Muley; Hauke Winter; Michael Meister; Carmen Veith; Agnes W. Boots; Bianca P. Hennig; Michael Kreuter; Christian Conrad; Roland Eils
data

eye 1

favorite 0

comment 0

This dataset contains count matrices and per-cells metadata tables for RNA sequencing of 39778 single nuclei from healthy primary lung samples of 12 lung adenocarcinoma patients as well as 17451 single human bronchiole epithelial cells from 4 donors. All samples were processed using the 10X Genomics Chromium platform with v2 chemistry and sequenced with one sample per lane on an Illumina HiSeq4000. Reads were aligned to the hg19 reference genome version 1.2.0 obtained from 10X Genomics. Data...
Source: https://figshare.com/articles/dataset/Single-cell_RNA-Seq_of_human_primary_lung_and_bronchial_epithelium_cells/11981034/2
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0225, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0225_20180528a_videos/12978344/1
The Dataset Collection
by Vladimir Platonov; Mikhail Varentsov
data

eye 1

favorite 0

comment 0

COSMO-CLM Russian Arctic climate dataset
Source: https://figshare.com/articles/dataset/Arctic_Reanalysis_COSMO-CLM_1989/13108157/1
The Dataset Collection
by Robert Sinkovits
data

eye 1

favorite 0

comment 0

Synthetic T cell receptor beta chain repertoire generated using IGoR software. Record format: V-gene D-gene J-gene amino-acid-CDR3 Where CDR3 contains anchor residues (V-gene Cys, J-gene Phe/Val)
Source: https://figshare.com/articles/dataset/tcr_beta_synrep_set1_06_txt_bz2/12413465/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0183, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0183_20170811a_videos/13070594/1
The Dataset Collection
by Pierre-Paul De Breuck
data

eye 1

favorite 0

comment 0

Unzip and import this file with the MODNet package as a MODData. Contains all inorganic compounds from the Materials Project (MP) as of June 2018. Useful for making predictions on the MP from a trained MODNet. https://github.com/ppdebreuck/modnet
Source: https://figshare.com/articles/dataset/Materials_Project_MP_2018_6_MODData/12834275/1
The Dataset Collection
by Hylke E. Beck; Seth Westra; Jackson Tan; Florian Pappenberger; George J. Huffman; Tim R. McVicar; Gaby J. Grundemann; Noemi Vergopolan; Hayley J. Fowler; Elizabeth Lewis; Koen Verbist; Eric F. Wood
data

eye 1

favorite 0

comment 0

The Precipitation Probability DISTribution (PPDIST) dataset represents a collection of global high-resolution (0.1°) observation-based climatologies (1979–2018) of the occurrence and peak intensity of precipitation at daily and 3-hourly time scales. For more details, please see the following open-access paper: Beck, H. E., Westra, S., Tan, J., Pappenberger, F., Huffman, G. J., McVicar, T. R., Gründemann, G. J., Vergopolan, N., Fowler, H. J., Lewis, E., Verbist, K., and Wood, E. F.: PPDIST:...
Source: https://figshare.com/articles/dataset/PPDIST_Global_observation-based_precipitation_probability_distribution_climatologies/12317219/4
The Dataset Collection
by hansen zhao
data

eye 1

favorite 0

comment 0

raw data
Source: https://figshare.com/articles/dataset/raw-image-data/12792554/1
The Dataset Collection
by Joan Pulupa
data

eye 2

favorite 0

comment 0

Karyopherin content at the nuclear periphery induces conformational changes in Nup54-mEGFP 494 but not Nup133-mEGFP.
Source: https://figshare.com/articles/dataset/Figure5F_Nup54_0_plusRan_plusKaps_t90/13339100/1
The Dataset Collection
by Joan Pulupa
data

eye 1

favorite 0

comment 0

Conformational changes of the Inner Ring of the NPC revealed by perturbations of cargo state in CRISPR cell lines.
Source: https://figshare.com/articles/dataset/Figure4K_Nup133_mEGFP_-9_noRanQ69L/13338779/1
The Dataset Collection
by Xiaowen Nie; Weiyang Mo
data

eye 1

favorite 0

comment 0

The Peacock Chinese Twitter Corpus (PCTC) contains 4911813 tweets (including original tweets and replies, excluding retweets) made in simplified Chinese from 2007 to 2020. The documents are stored in MongoDB in JSON format. User Interface: www.peacockpus.com
Source: https://figshare.com/articles/dataset/Peacock_Chinese_Twitter_Corpus_PCTC_/13489239/1
The Dataset Collection
by Vladimir Platonov; Mikhail Varentsov
data

eye 1

favorite 0

comment 0

COSMO-CLM Russian Arctic climate dataset
Source: https://figshare.com/articles/dataset/Arctic_COSMO-CLM_Reanalysis_2016/13266656/1
The Dataset Collection
by Rika Anderson
data

eye 1

favorite 0

comment 0

anvi'o contigs database and profile database for sample FS872 from the Mid-Cayman Rise. See project description for instructions.
Source: https://figshare.com/articles/dataset/FS872_contigs_and_profile_databases/5603233/1
The Dataset Collection
by Multicellgenome Lab
data

eye 1

favorite 0

comment 0

This data has been produced using BGI assembly + Transdecoder. – Guifré Torruella ,  Romain Derelle,   Jordi Paps ,  B. Franz Lang, Andrew J Roger, Kamran Shalchian-Tabrizi &  Iñaki Ruiz-Trillo . (2012) Phylogenetic relationships within the Opisthokonta based on phylogenomic analyses of conserved single copy protein domains . Molecular Biology and Evolution  29(2): 531-544. Access matrix www.multicellgenome.com
Source: https://figshare.com/articles/dataset/Trancriptome_-_Amoebidium_parasiticum/4714243/1
The Dataset Collection
by Li Ma; Quan Liu; Ren-Chun Chiu; Shou-Zen Fan; Maysam F. Abbod; Jiann-Shing Shieh
data

eye 1

favorite 0

comment 0

This dataset includes ECG waveform data from 110 patients during during under anesthesia. And the corresponding five doctor's evaluation of anesthetic depth curve are uploaded together. These raw data is for PeerJ journal manuscirpt" HRV derived data similarity and distribution index to measure anesthetic depth based on ensemble neural network ".
Source: https://figshare.com/articles/dataset/Raw_Data_rar/5254426/1
The Dataset Collection
by Youcong Chao; Xiaoqun Liu; Shijun Guo
data

eye 1

favorite 0

comment 0

File 1 includes seven subsets respectively named according to the year as “2007” to “2013”. Specifically, in file “2007”, there are subsets of 25 texts and 50 files. According to our samples, we constructed 25 portfolios using all of the individual stocks, which are named “s1b1”, “s1b2”, “s1b3”, “s1b4”, “s1b5”, …, “s5b1”, “s5b2”, “s5b3”, “s5b4”, “s5b5”. And the 25 subset texts are the constituent stocks for each portfolio, named...
Source: https://figshare.com/articles/dataset/File1_The_original_data_and_the_data_of_constructed_25_portfolios/4658794/1
The Dataset Collection
by Sean Hardison
data

eye 1

favorite 0

comment 0

Acoustic data recorded in an intertidal zone in Wilmington, North Carolina over summer 2016. See attached R script for calibration instructions.
Source: https://figshare.com/articles/dataset/5_21_2016_IL_50_/4531361/1
The Dataset Collection
by Liu, Yupeng
data

eye 1

favorite 0

comment 0

Startup Cartography Project (http://www.startupcartography.com) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/BMRPVH&version=8.0
The Dataset Collection
data

eye 1

favorite 0

comment 0

This code replicates the models in the paper "Compulsory Voting and Dissatisfaction with Democracy", by Shane Singh, which appears in the British Journal of Political Science. All models estimated in Stata 13. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/2WTACJ&version=1.0