Skip to main content

368
UPLOADS


More right-solid

More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid
SHOW DETAILS
eye
Title
Date Archived
Creator
DOI-LANDING-CRAWL-2018-06
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0

This item contains output files related to the DOI-LANDING-CRAWL-2018-06 crawl of Crossref DOI redirect landing pages: - list of Crossref DOI numbers attempted - an index of DOI, URL, and final HTTP status codes
Internet Archive Research Publication Crawls
by CNKI
data

eye 0

favorite 0

comment 0

Metadata about COVID-19 papers downloaded from:  http://en.gzbd.cnki.net/GZBT/brief/Default.aspx
Web PDF Training Sets
by Internet Archive Web Group
data

eye 15

favorite 0

comment 0

This item contains three .zip archives, each containing a sample corpus of about 10,000 (or more) HTML documents from the IA web archive. For each, there is some form of metadata (CDX or JSON) with information about the original URL and timestamp for each document, and then directories containing HTML, extracted TEI-XML, and extracted TXT for each document. There are some fraction of documents which failed to download or failed to extract, so there are fewer .html (and derivative) files than...
The Dataset Collection
by John Hildyard
data

eye 1

favorite 0

comment 0

This file contains supplementary figures 1-5 from the extended data of the manuscript Single-transcript multiplex in situ hybridisation reveals unique patterns of dystrophin isoform expression in the developing mammalian embryo John C.W. Hildyard, Abbe H. Crawford, Faye Rawson, Dominique O. Riddell, Rachel C.M. Harron, Richard J. Piercy
Source: https://figshare.com/articles/dataset/Dystrophin_multiplex_ISH_Extended_data/12040746/1
The Dataset Collection
by Broad DepMap
data

eye 1

favorite 0

comment 0

This dataset contains the results of Avana library CRISPR-Cas9 genome-scale knockout (prefixed with Achilles) as well as mutation, copy number and gene expression data (prefixed with CCLE) for cancer cell lines as part of the Broad Institute’s Cancer Dependency Map project. We have repackaged our fileset to include all quarterly-updating datasets produced by DepMap. The Avana CRISPR-Cas9 genome-scale knockout data has expanded to include 739 cell lines, the RNAseq data includes 1270 cell...
Source: https://figshare.com/articles/dataset/DepMap_20Q1_Public/11791698/2
These are solar wind in situ data arrays in python pickle format suitable for machine learning, i.e. the arrays consist only of numbers, no strings and no datetime objects. See AAREADME_insitu_ML.txt for more explanation. If you use these data for peer reviewed scientific publications, please get in touch concerning usage and possible co-authorship by the authors (C. Möstl, A. J. Weiss, R. L. Bailey, A. Isavnin): christian.moestl@oeaw.ac.at or twitter @chrisoutofspace Made with...
Source: https://figshare.com/articles/dataset/Solar_wind_in_situ_data_suitable_for_machine_learning_python_numpy_arrays_STEREO-A_B_Wind_Parker_Solar_Probe_Ulysses_Venus_Express_MESSENGER/12058065/2
The Dataset Collection
by Hao Luo
data

eye 1

favorite 0

comment 0

Annotation of alternative splicing events in GENCODE
Source: https://figshare.com/articles/dataset/GENCODE/12524393/7
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0197, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0197_20171205a_videos/12947543/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-120_dcd/12162789/1
The Dataset Collection
by Tim Fischer
data

eye 1

favorite 0

comment 0

All recordings and source files for the measurements with this participant. For details, please see the "Methods" section of the publication "Multichannel acoustic source and image dataset for the cocktail party effect in hearing aid and implant users".
Source: https://figshare.com/articles/dataset/Human_Subjects_Audio_ID_09_zip/12771479/1
The Dataset Collection
by Simon Rasmussen
data

eye 1

favorite 0

comment 0

Near Complete bacterial genomes produced by VAMB from the Almeida et al., (Nature, 2019) benchmark dataset (1,000 human gut microbiome samples). This is part 5 of 5
Source: https://figshare.com/articles/dataset/Near_Complete_Bins_Almeida_dataset_part_E/13221743/1
The Dataset Collection
by Joan Pulupa
data

eye 1

favorite 0

comment 0

The p:s ratios of Nup54-mEGFP 494 fusion proteins with a flexible linker do not shift upon amino acid additions.
Source: https://figshare.com/articles/dataset/Figure1G_Nup54-mEGFP494_flex0_/13333757/1
The Dataset Collection
by Joel Sharbrough; Justin L. Conover; Corrinne Grover; Matheus Fernandes Gyorfy; Emma R. Miller; Jonathan F. Wendel; Daniel Sloan
data

eye 1

favorite 0

comment 0

Whole-genome duplications (WGDs), in which the number of nuclear genome copies is elevated as a result of autopolyploidy or allopolyploidy, are a prominent process of diversification in eukaryotes. The genetic and evolutionary forces that WGD imposes upon cytoplasmic genomes are not well understood, despite the central role that cytonuclear interactions play in eukaryotic function and fitness. Cellular respiration and photosynthesis depend upon successful interaction between the 3000+...
Source: https://figshare.com/articles/dataset/Global_patterns_of_subgenome_evolution_in_organelle-targeted_genes_of_six_allotetraploid_angiosperms/13473207/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0198, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0198_20171205a_videos/13174448/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

Dodders ( Cuscuta spp., Convolvulaceae) are root- and leafless parasitic plants that parasitize a wide range of hosts. C. australis genome harbors only 19671 protein-coding genes, and 11.7% of the conserved orthologs in autotrophic plants are lost in C. australis . Many of these gene loss events likely result from its parasitic lifestyle and large body plan changes. This dataset is the phylogenetic analysis data for gene losses detection and gene alignments used for selection analysis.
Source: https://figshare.com/articles/dataset/Extend_Data_for_cuscuta_australis_genome_analysis/6072131/1
The Dataset Collection
by Xiaolu Yu
data

eye 1

favorite 0

comment 0

sequencing data of microRNA released from mPFC in normal mice.
Source: https://figshare.com/articles/dataset/sequencing_data_NPFC_/5808885/1
Bulk Bibliographic Metadata
data

eye 7

favorite 0

comment 0

Mirrored from:  https://github.com/njahn82/vanished_journals/tree/master/data
Internet Archive Research Publication Crawls
by Wanfang Data
data

eye 4

favorite 0

comment 0

Metadata and some fulltext PDFs from Wanfang Data, downloaded 2020-03-29 from http://subject.med.wanfangdata.com.cn/Channel/7
The Dataset Collection
by Honorata Kraskiewicz; Maria Paprocka; Aleksandra Bielawska-Pohl; Agnieszka Krawczenko; Kinga Panek; Judyta Kaczyńska; Agnieszka Szyposzyńska; Mateusz Psurski; Piotr Kuropka; Aleksandra Klimczak
data

eye 1

favorite 0

comment 0

Additional file 4. Migration activity of native HATMSC supernatants. MSU-1.1 cell migration activity was investigated at 37 °C in an incubation chamber (PeCon GmbH, Erbach, Germany) with 1%O2, 5%CO2 mounted on an Axio Observer inverted microscope equipped with a dry 5x objective (Zeiss, Gottingen, Germany). The movement of the cells was time-lapse recorded for 44 h at intervals of 2 h using Zen 2.6 Blue Edition Software (Zeiss, Gottingen, Germany) as 6 separate movies (one for each...
Source: https://springernature.figshare.com/articles/dataset/MOESM4_of_Can_supernatant_from_immortalized_adipose_tissue_MSC_replace_cell_therapy_An_in_vitro_study_in_chronic_wounds_model/11686431/1
The Dataset Collection
by D. Louis Collins; Gabriel Allan Devenyi; Raihaan Patel; Stephanie Tullo; Min Tae M Park; M. Mallar Chakravarty
data

eye 1

favorite 0

comment 0

Resulting atlas from the atlas-to-template warping technique performed registering the histologically-derived atlas to each of the 5 MRI templates using the ANIMAL_script code, delineating 108 subcortical structures.
Source: https://springernature.figshare.com/articles/dataset/Labels_of_108_subcortical_structures_for_brain3_NIfTI_format_/6068123/1
The Dataset Collection
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=_4xGXdIUSgs strain : CB1112 timestamp : 2014-04-27T15:28:17+02:00 strain_description : cat-2(e1112)II sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1189908
The Dataset Collection
by Salamon, Justin; Bittner, Rachel; Bonada, Jordi; Bosch, Juan Jose; Gómez, Emilia; Bello, Juan Pablo
data

eye 1

favorite 0

comment 0

MDB-mf0-synth ============= MDB-mf0-synth (c) by Justin Salamon, Rachel Bittner, Jordi Bonada, Juan Jose Bosch, Emilia Gómez and Juan Pablo Bello. MDB-mf0-synth is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).  You should have received a copy of the license along with this work. If not, see http://creativecommons.org/licenses/by-nc/4.0/ Created By ---------- Justin Salamon*, Rachel Bittner*, Jordi Bonada^, Juan Jose Bosch^, Emilia...
Source: https://zenodo.org/record/1481170
The Dataset Collection
data

eye 1

favorite 0

comment 0

KG-COVID-19 graph in KGX TSV format, built on Sep 1, 2020, with no CORD-19 data
Source: https://zenodo.org/record/4012578
Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 58

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
The isotopic composition of water vapour provides integrated perspectives on the hydrological histories of air masses and has been widely used for tracing physical processes in hydrological and climatic studies. Over the last two decades, the infrared laser spectroscopy technique has been used to measure the isotopic composition of water vapour near the Earth’s surface. Here, we have assembled a global database of high temporal resolution stable water vapour isotope ratios (𝛿18O and 𝛿D)...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/UTE3QI&version=5.0
The Dataset Collection
by Darcy Jones
data

eye 5

favorite 0

comment 0

All supplementary material and full resolution figures for the Predector pipeline manuscript. Figure 1: UpSet plot showing predictions of signal peptides, transmembrane domains, and effector-like properties for all known effectors in the training dataset (N=125). Rows indicate sets of proteins predicted to have a property related to effector prediction (e.g. a signal peptide), with the horizontal bar chart indicating set size. Columns indicate where the horizontal sets intersect with each...
Source: https://figshare.com/articles/dataset/Predector_-_supplementary_material/13325213/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0229, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0229_20181102a_videos/13010276/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0309, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0309_20191119a_videos/13110581/1
The Dataset Collection
by Vitalij Novickij; Irute Girkontaite
data

eye 2

favorite 0

comment 0

Raw data from experiments
Source: https://figshare.com/articles/dataset/Raw_dataset/13507140/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-155_dcdWW_domain_trajectories/12162894/2
The Dataset Collection
by Raymond Haggerty
data

eye 1

favorite 0

comment 0

Contains all the input and output files used to generate figure 4 Revision_IN_[motif].mat are the input files for each biologically relevant motif. OUT_[motif].mat are the output files run through MISC corresponding to each of the input files.
Source: https://figshare.com/articles/dataset/Figure_4_Reproduction_Files/12648905/1
The data and programs replicate tables and figures from "Human Capital and Development Accounting: New Evidence from Wage Gains at Migration", by Hendricks and Schoellman. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IPIBQP&version=1.1
Simulation data used in the paper CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9JI57U&version=2.0
These include the relevant files needed for replicating George A. Krause and Matthew Zarit's "The Retraction of Policy Benefits Across U.S. Federal Agencies: Programmatic Cutbacks and Executive Control of U.S. Federal Grant Retrenchments." Forthcoming, Public Administration Review. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/AIICA1&version=1.0
Bulk Bibliographic Metadata
by Crossref
data

eye 644

favorite 2

comment 0

This file is a snapshot dump of the Crossref DOI metadata API, containing entries for over 94 million DOIs. Compared to the previous 2017-03 version (see archive.org item "crossref_doi_dump_201703"), this snapshot has a few million more works, but the corpus size is much larger (29 GB compressed vs. 7 GB compressed) as it now contains significantly more citation data, due to the efforts of the Initiative for Open Citations (I4OC) project. This was generated by running the scripts...
OA-JOURNAL-CRAWL-2020-07
OA-JOURNAL-CRAWL-2020-07
collection
1,923
ITEMS
9.7M
VIEWS
by Internet Archive Web Group
collection

eye 9.7M

PLATFORM-CRAWL-2020
PLATFORM-CRAWL-2020
collection
649
ITEMS
404,310
VIEWS
by Internet Archive Web Group
collection

eye 404,310

Bulk Bibliographic Metadata
by ROAD: Directory of Open Access Scholarly Resources
data

eye 139

favorite 0

comment 0

This is a backup of ROAD/ISSN metadata from http://road.issn.org/en/contenu/download-road-records Dumps in both MARC XML and RDF format are included; see sub-directory for date of download. See also earlier July 2017 dump at: https://archive.org/download/road-issn-2017 These files are under the Creative Commons Attribution-NonCommercial 4.0 International Public License (aka, CC-BY-NC).
Topic: metadata
OA-JOURNAL-CRAWL-2020-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

Bulk Bibliographic Metadata
data

eye 23

favorite 0

comment 0

Downloaded from: https://zenodo.org/record/1438356
The Dataset Collection
by Akshay Yadav; David Fernández-Baca; Steven Cannon, scannon@iastate.edu
data

eye 1

favorite 0

comment 0

Yeast (YGOB) and legume gene families used for testing methods for detecting and correcting under-clustered and over-clustered gene families ygob_proteomes.tar.gz : Complete yeast proteomes from YGOB database ygob_family_fasta.tar.gz : Complete yeast families from the YGOB database ygob_family_fasta_delete.tar.gz : Intentionally under-clustered yeast families with missing 20% sequences ygob_family_fasta_insert_delete.tar.gz : Intentionally under-clustered yeast families with missing 20%...
Source: https://figshare.com/articles/dataset/Methods_for_analyzing_comparing_and_correcting_gene_families/12115305/1
The Dataset Collection
by Marieke Jepma; Tor Wager
data

eye 1

favorite 0

comment 0

contrast maps of painful heat vs. baseline. Heat stimuli preceded by heat predictive cue, with cue and heat level fully crossed. Data formated as fmri_data object as implemented in github.com/canlab/canlabCore
Source: https://figshare.com/articles/dataset/ie2/12797297/1
The Dataset Collection
by Thomas Cauchy
data

eye 1

favorite 0

comment 0

435 032 nwchem logfiles calculated thanks to the quchempedia BOINC project. DFT calculations (B3LYP / 321G) The first 184 158 molecules corresond to the recalculation of QM9 and PC9. The others are newly generated molecules with the EvoMol software. Beware the concaneted tar file is 141GB of compressed files.
Source: https://figshare.com/articles/dataset/OD9_dataset_part_15_28/13102532/1
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 14

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 5

favorite 0

comment 0

DOI-LANDING-CRAWL-2018-06
by Internet Archive Web Group
data

eye 6

favorite 0

comment 0

WARCZone: Outsider WARCs
by Internet Archive
web

eye 70

favorite 0

comment 0

The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, MPI AOGCM, RCP 8.5, 2080s projections, monthly variables Tmin01-12 Tmax01-12 Tave01-12 Prec01-12. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_MPI_AOGCM_RCP_8_5_2080s_Monthly/11827695/1
The Dataset Collection
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=lFyjR-9fQ4k strain : OW940 timestamp : 2014-02-23T15:59:41+01:00 strain_description : zgIs128[P(dat-1)::alpha-Synuclein::YFP] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who :...
Source: https://zenodo.org/record/1200642
The Dataset Collection
by Martineau, Celine N.; Nollen, Ellen A. A.
data

eye 1

favorite 0

comment 0

This experiment is part of the  C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=zCDBQ6mCbow strain : OW953 timestamp : 2014-03-12T10:24:09+01:00 strain_description : zgIs138[P(dat-1)::YFP] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1190867
The Dataset Collection
data

eye 1

favorite 0

comment 0

This is an archive of the raw data and analysis source code for the paper "A crop yield change emulator for use in GCAM and similar models: Persephone v1.0".  The archive contains: data.zip:  All source code for analysis, input data for analysis, and results of analysis persephone.proj :  R project for ease of reproducing analysis
Source: https://zenodo.org/record/1414423
The Dataset Collection
by Kristen Naegle
data

eye 0

favorite 0

comment 0

This is the expanded set of all predictions for GPS, run on the entire reference proteome, including sites not known to be phosphorylated. This dataset is used to perform a fast update when new phosphosites are discovered. The uncompressed folder will yield a large CSV file with predictions in list format (i.e. one line per kinase-substrate prediction) Columns in this order: substrate_id - unique substrate (accession_site) ID substrate_acc - Uniprot accession of substrate protein substrate_name...
Source: https://figshare.com/articles/dataset/Whole_proteome-level_GPS_predictions_part_2_/13228379/1
The Dataset Collection
by Soeren Lukassen; Robert Lorenz Chua; Timo Trefzer; Nicolas C. Kahn; Marc A. Schneider; Thomas Muley; Hauke Winter; Michael Meister; Carmen Veith; Agnes W. Boots; Bianca P. Hennig; Michael Kreuter; Christian Conrad; Roland Eils
data

eye 1

favorite 0

comment 0

This dataset contains count matrices and per-cells metadata tables for RNA sequencing of 39778 single nuclei from healthy primary lung samples of 12 lung adenocarcinoma patients as well as 17451 single human bronchiole epithelial cells from 4 donors. All samples were processed using the 10X Genomics Chromium platform with v2 chemistry and sequenced with one sample per lane on an Illumina HiSeq4000. Reads were aligned to the hg19 reference genome version 1.2.0 obtained from 10X Genomics. Data...
Source: https://figshare.com/articles/dataset/Single-cell_RNA-Seq_of_human_primary_lung_and_bronchial_epithelium_cells/11981034/2
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0225, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0225_20180528a_videos/12978344/1
The Dataset Collection
by Vladimir Platonov; Mikhail Varentsov
data

eye 1

favorite 0

comment 0

COSMO-CLM Russian Arctic climate dataset
Source: https://figshare.com/articles/dataset/Arctic_Reanalysis_COSMO-CLM_1989/13108157/1
The Dataset Collection
by Robert Sinkovits
data

eye 1

favorite 0

comment 0

Synthetic T cell receptor beta chain repertoire generated using IGoR software. Record format: V-gene D-gene J-gene amino-acid-CDR3 Where CDR3 contains anchor residues (V-gene Cys, J-gene Phe/Val)
Source: https://figshare.com/articles/dataset/tcr_beta_synrep_set1_06_txt_bz2/12413465/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0183, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0183_20170811a_videos/13070594/1
The Dataset Collection
by Pierre-Paul De Breuck
data

eye 1

favorite 0

comment 0

Unzip and import this file with the MODNet package as a MODData. Contains all inorganic compounds from the Materials Project (MP) as of June 2018. Useful for making predictions on the MP from a trained MODNet. https://github.com/ppdebreuck/modnet
Source: https://figshare.com/articles/dataset/Materials_Project_MP_2018_6_MODData/12834275/1
The Dataset Collection
by Hylke E. Beck; Seth Westra; Jackson Tan; Florian Pappenberger; George J. Huffman; Tim R. McVicar; Gaby J. Grundemann; Noemi Vergopolan; Hayley J. Fowler; Elizabeth Lewis; Koen Verbist; Eric F. Wood
data

eye 1

favorite 0

comment 0

The Precipitation Probability DISTribution (PPDIST) dataset represents a collection of global high-resolution (0.1°) observation-based climatologies (1979–2018) of the occurrence and peak intensity of precipitation at daily and 3-hourly time scales. For more details, please see the following open-access paper: Beck, H. E., Westra, S., Tan, J., Pappenberger, F., Huffman, G. J., McVicar, T. R., Gründemann, G. J., Vergopolan, N., Fowler, H. J., Lewis, E., Verbist, K., and Wood, E. F.: PPDIST:...
Source: https://figshare.com/articles/dataset/PPDIST_Global_observation-based_precipitation_probability_distribution_climatologies/12317219/4
The Dataset Collection
by hansen zhao
data

eye 1

favorite 0

comment 0

raw data
Source: https://figshare.com/articles/dataset/raw-image-data/12792554/1
The Dataset Collection
by Joan Pulupa
data

eye 2

favorite 0

comment 0

Karyopherin content at the nuclear periphery induces conformational changes in Nup54-mEGFP 494 but not Nup133-mEGFP.
Source: https://figshare.com/articles/dataset/Figure5F_Nup54_0_plusRan_plusKaps_t90/13339100/1
The Dataset Collection
by Joan Pulupa
data

eye 1

favorite 0

comment 0

Conformational changes of the Inner Ring of the NPC revealed by perturbations of cargo state in CRISPR cell lines.
Source: https://figshare.com/articles/dataset/Figure4K_Nup133_mEGFP_-9_noRanQ69L/13338779/1
The Dataset Collection
by Xiaowen Nie; Weiyang Mo
data

eye 1

favorite 0

comment 0

The Peacock Chinese Twitter Corpus (PCTC) contains 4911813 tweets (including original tweets and replies, excluding retweets) made in simplified Chinese from 2007 to 2020. The documents are stored in MongoDB in JSON format. User Interface: www.peacockpus.com
Source: https://figshare.com/articles/dataset/Peacock_Chinese_Twitter_Corpus_PCTC_/13489239/1
The Dataset Collection
by Vladimir Platonov; Mikhail Varentsov
data

eye 1

favorite 0

comment 0

COSMO-CLM Russian Arctic climate dataset
Source: https://figshare.com/articles/dataset/Arctic_COSMO-CLM_Reanalysis_2016/13266656/1
The Dataset Collection
by Gregory Duveiller; Josh Hooker; Alessandro Cescatti
data

eye 1

favorite 0

comment 0

Intermediate monthly air temperature product for year 2012 calculated using a geographically weighted regression (GWR).
Source: https://springernature.figshare.com/articles/dataset/GWR_air_temperature_for_year_2012/7059653/1
The Dataset Collection
by Stefan Zdraljevic
data

eye 1

favorite 0

comment 0

GWA mapping results for all traits measured in 86 wild C . elegans isolates. FIles generated using AndersenLab/cegwas2-nf on github.
Source: https://figshare.com/articles/dataset/GWA_results_for_all_traits_mapped_in_manuscript/7458932/1
The Dataset Collection
by Morteza M. Saber
data

eye 1

favorite 0

comment 0

3000 bcf samples from bacteria
Source: https://figshare.com/articles/dataset/BCF_file_segfault/7412864/1
The Dataset Collection
by Liu, Yupeng
data

eye 1

favorite 0

comment 0

Startup Cartography Project (http://www.startupcartography.com) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/BMRPVH&version=8.0
The Dataset Collection
data

eye 1

favorite 0

comment 0

Dataset to accompany Mayo, Cohen, and Maunsell's PLOS ONE article entitled "A refined neuronal population measure of visual attention" CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/3TDCHI&version=1.1
The Dataset Collection
by Coulombel, Nicolas
data

eye 1

favorite 0

comment 0

This dataset provides travel time and distances by car (off peak hour = OPH and morning peak hour = MPH, year 2008) and by public transit (year 2009) between municipalities of Ile-de-France, identified by their INSEE code. The data was generated using an adaptation of the MODUS 4-step model from the DRIEA Ile-de-France (originally implemented in VISUM) to the TransAD software. The DRIEA Ile-de-France cannot be held responsible for any mistake in the data, which remain the sole responsibility of...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/E85DBD&version=1.1
Contact tracing can play a vital role in controlling human-to-human transmission of a highly contagious disease such as COVID-19. To investigate the benefits and costs of contact tracing in the COVID-19 transmission, we develop an individual-based contact-network model and a susceptible-exposed-infected-confirmed (SEIC) epidemic model. We estimate the unknown parameters (reproductive ratio $R_0$ and confirmed rate $\delta_2$) by using confirmed case data. We model contact tracing in a two-layer...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/3IM82E&version=3.0
The Dataset Collection
by Salzman, Shayla
data

eye 1

favorite 0

comment 0

Weevil behavioral arena videos and custom scripts for matlab, CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/R84DJB&version=3.0
The Dataset Collection
by Internet Archive Web Group
data

eye 67

favorite 0

comment 0

This item contains both metadata and fulltext PDF content (from the public web) related to research on COVID-19 and past influenza pandemics. This content backs the https://covid19.fatcat.wiki search interface. Rough numbers: - over 51,000 metadata records from 2020-04-10 release of CORD19 corpus - over 79,000 metadata records total (union of the above plus fatcat.wiki keyword matches) - over 45,000 fulltext PDF files and derived PNG thumbnails and pdftotext text files The upstream...
Topic: COVID-19, Coronavirus, SARS-CoV-2
Improving the political participation of immigrants could advance their interests and foster their integration into receiving countries. In this study, 23,800 citizens were randomly assigned to receive visits from political activists during the lead-up to the 2010 French regional elections. Treatment increased the turnout of immigrants without having any statistically significant effect on non-immigrants, while turnout was roughly equal in the control group. A postelectoral survey reveals that...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/DDCNEW&version=1.1
UNPAYWALL-PDF-CRAWL-2020-11
UNPAYWALL-PDF-CRAWL-2020-11
collection
199
ITEMS
1.6M
VIEWS
by Internet Archive Web Group
collection

eye 1.6M

The Dataset Collection
by Kaicun Wang; Can Wang; Wenjia Cai; Yidan Chen; Fang Guo; Jiachen Wang
data

eye 1

favorite 0

comment 0

Population Grid for China under SSP4RCP8.5 from 2010 to 2100
Source: https://springernature.figshare.com/articles/dataset/Population_Grid_for_China_SSP4RCP8_5/11317703/1
Bulk Bibliographic Metadata
by EuropePMC
data

eye 48

favorite 0

comment 0

Data mirrored from https://europepmc.org/downloads Contains a mapping between PubMed IDs (PMID), PubMedCentral IDs (PMCID), and DOI numbers, for over 29 million works.
Social Media Videos
by Alexis Ohanian Sr. 🚀
movies

eye 12

favorite 1

comment 0

This is from 2016. We need to #DoBetter https://t.co/58DYEnDFRA Source: https://twitter.com/alexisohanian/status/1267467074022125568 Uploader: Alexis Ohanian Sr. 🚀
Topics: Twitter, video, DoBetter
by Internet Archive Web Group
collection

eye 6,505

This collection contains web crawl data for a random selection of 500k (0.5 million) Crossref DOI redirects, including the doi.org redirect requests. The intent of this crawl is to gather loose statistics on the number of failing redirects, number of host websites that block automated crawling, and a corpus of HTML landing pages for metadata extraction (eg, "signposting" HTTP headers, linked data HTML metadata, semantic markup). Total size of (uncompressed) WARC data is 50 GB,...
Bulk Bibliographic Metadata
by Impactstory
data

eye 6

favorite 0

comment 0

The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0196, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0196_20171211a_videos/13173236/1
Bulk Bibliographic Metadata
by ORCID, Inc.
data

eye 102

favorite 0

comment 0

This item contains an annual copy of the ORCID public data file, as originally downloaded from:  https://orcid.figshare.com/articles/dataset/ORCID_Public_Data_File_2020/13066970 More details about this content and it's use available at: https://orcid.org/content/orcid-public-data-file This dataset is available under the public domain (CC-0).
Lost Posters
Lost Posters
collection
102
ITEMS
4,315
VIEWS
collection

eye 4,315

Lost-and-found animals, items, and other messages transmitted on our public streets. Mostly in San Francisco.
Topic: poster
UNPAYWALL-PDF-CRAWL-2018-07
by Internet Archive Web Group
data

eye 1

favorite 0

comment 0

The Dataset Collection
by Gerard van der Schrier; Sergio vicente-serrano; Fernando Dominguez-Castro; Dhais Peña-Angulo; Enric Aguilar; Fergus Reig; Iván Noguera; Jesús Revuelto; Ahmed M. El Kenawy
data

eye 1

favorite 0

comment 0

Indecis is a gridded dataset for the whole Europe of 125 climate indices for the period 1950-2017. Climate indices were computed at different temporal scales (i.e. monthly, seasonal and annual) and mapped at a grid interval of 0.25º.
Source: https://springernature.figshare.com/articles/dataset/INDECIS/11988309/1
The Dataset Collection
by PowerTAC
data

eye 1

favorite 0

comment 0

Log and boot files of game 105
Source: https://zenodo.org/record/1325414
Internet Archive Research Publication Crawls
by Wanfang Data
data

eye 6

favorite 0

comment 0

Metadata and some fulltext PDFs from Wanfang Data, downloaded 2020-03-29 from http://subject.med.wanfangdata.com.cn/Channel/7
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 40

favorite 0

comment 0

This is a snapshot of the AI2 (Semantic Scholar') "Open Research Corpus", as release May 3rd, 2018. These files originally downloaded from AWS S3, via: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'sample-S2-records.gz' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Community Texts
by LOCKSS
software

eye 14

favorite 0

comment 0

This item contains a mirror of the LOCKSS daemon software, as well as a Debian/Ubuntu package generated using the 'alien' package. To recreate and install these .deb files on debian/ubuntu, you would so something like: sudo apt install alien wget https://github.com/lockss/lockss-daemon/releases/download/release-candidate_1-73-b4/lockss-daemon-1.73.4-1.noarch.rpm sudo alien lockss-daemon-1.73.4-1.noarch.rpm  sudo dpkg -i lockss-daemon_1.73.4-2_all.deb These files are used as part of IA's...
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 54

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions: Semantic Scholar Open Research Corpus is licensed under  ODC-BY . When using the Semantic Scholar Open Research Corpus (“S2 ORC”) in a product or service, or including data in a redistribution, please cite the following paper: Waleed Ammar et al. 2018. Construction...
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

deeplabcut output and processed 3D reconstructions of data for rat 0184. "_direct" zip files contain .csv's from the direct view and .mat files with cropping regions "_right/left" zip files contain .csv's from the mirror view and .mat files with cropping regions "_processed" files contain 3D reconstructions and reach analyses
Source: https://figshare.com/articles/dataset/R0184-DLC_output/13204307/1
This item contains a transformed copy (single gzip'd JSON-per-line file, instead of tarball of xz-zipped JSON per-source files) of the metadata in item https://archive.org/details/core_oa_metadata_20180301. All the same licenses and caveats apply.
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0310, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0310_20191106a_videos/13110032/2
The Dataset Collection
by Gerardo Zegers
data

eye 3

favorite 0

comment 0

Post-event topography lidar scan (acquired in Feb-March 2017 by the Chilean Ministry of Public Works). This dataset has a 1x1m2 horizontal resolution, and was post-processed in order to eliminate vegetation and building.
Source: https://figshare.com/articles/dataset/Lidar_DEM/12547709/1
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 17

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 4

favorite 0

comment 0

Mirrored from:  https://www.arc.gov.au/excellence-research-australia/era-2018-journal-list
The Dataset Collection
data

eye 1

favorite 0

comment 0

This project presents a dataset that is assembled from multiple sources between 2013 and 2017, including contaminants in drinking water, cancer incidence rates, public perception of the relationship between water contaminants and cancer on Twitter, and census data covering the population living in the United States. The units of analysis are 3,219 counties and 33,144 zip codes. The users of this dataset can address model-driven questions regarding water contaminants and cancer incidence rates...
Source: https://figshare.com/articles/dataset/A_dataset_integrating_water-related_public_health_social_media_census_and_administrative_data_in_the_United_States/12673157/2
The Dataset Collection
by Remote Sensing Group
data

eye 5

favorite 0

comment 0

IND_Dataset_V2
Source: https://figshare.com/articles/dataset/IND_Dataset_V2/12311249/1
The Dataset Collection
by Timothy Arnett; Pascale V. Guillot; Anna Maria Ranzoni; Michelangelo Corcelli
data

eye 1

favorite 0

comment 0

MicroCT of mouse tibiae-wild-type6
Source: https://springernature.figshare.com/articles/dataset/MicroCT_of_mouse_tibiae-wild-type6/5525449/1