Skip to main content

295
UPLOADS


More right-solid

More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid
SHOW DETAILS
eye
Title
Date Archived
Creator
Internet Archive Research Publication Crawls
by CNKI
data

eye 0

favorite 0

comment 0

Metadata about COVID-19 papers downloaded from:  http://en.gzbd.cnki.net/GZBT/brief/Default.aspx
Web PDF Training Sets
by Internet Archive Web Group
data

eye 15

favorite 0

comment 0

This item contains three .zip archives, each containing a sample corpus of about 10,000 (or more) HTML documents from the IA web archive. For each, there is some form of metadata (CDX or JSON) with information about the original URL and timestamp for each document, and then directories containing HTML, extracted TEI-XML, and extracted TXT for each document. There are some fraction of documents which failed to download or failed to extract, so there are fewer .html (and derivative) files than...
The Dataset Collection
by John Hildyard
data

eye 1

favorite 0

comment 0

This file contains supplementary figures 1-5 from the extended data of the manuscript Single-transcript multiplex in situ hybridisation reveals unique patterns of dystrophin isoform expression in the developing mammalian embryo John C.W. Hildyard, Abbe H. Crawford, Faye Rawson, Dominique O. Riddell, Rachel C.M. Harron, Richard J. Piercy
Source: https://figshare.com/articles/dataset/Dystrophin_multiplex_ISH_Extended_data/12040746/1
The Dataset Collection
by Broad DepMap
data

eye 1

favorite 0

comment 0

This dataset contains the results of Avana library CRISPR-Cas9 genome-scale knockout (prefixed with Achilles) as well as mutation, copy number and gene expression data (prefixed with CCLE) for cancer cell lines as part of the Broad Institute’s Cancer Dependency Map project. We have repackaged our fileset to include all quarterly-updating datasets produced by DepMap. The Avana CRISPR-Cas9 genome-scale knockout data has expanded to include 739 cell lines, the RNAseq data includes 1270 cell...
Source: https://figshare.com/articles/dataset/DepMap_20Q1_Public/11791698/2
These are solar wind in situ data arrays in python pickle format suitable for machine learning, i.e. the arrays consist only of numbers, no strings and no datetime objects. See AAREADME_insitu_ML.txt for more explanation. If you use these data for peer reviewed scientific publications, please get in touch concerning usage and possible co-authorship by the authors (C. Möstl, A. J. Weiss, R. L. Bailey, A. Isavnin): christian.moestl@oeaw.ac.at or twitter @chrisoutofspace Made with...
Source: https://figshare.com/articles/dataset/Solar_wind_in_situ_data_suitable_for_machine_learning_python_numpy_arrays_STEREO-A_B_Wind_Parker_Solar_Probe_Ulysses_Venus_Express_MESSENGER/12058065/2
The Dataset Collection
by Hao Luo
data

eye 1

favorite 0

comment 0

Annotation of alternative splicing events in GENCODE
Source: https://figshare.com/articles/dataset/GENCODE/12524393/7
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0197, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0197_20171205a_videos/12947543/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-120_dcd/12162789/1
The Dataset Collection
by Tim Fischer
data

eye 1

favorite 0

comment 0

All recordings and source files for the measurements with this participant. For details, please see the "Methods" section of the publication "Multichannel acoustic source and image dataset for the cocktail party effect in hearing aid and implant users".
Source: https://figshare.com/articles/dataset/Human_Subjects_Audio_ID_09_zip/12771479/1
The Dataset Collection
by Simon Rasmussen
data

eye 1

favorite 0

comment 0

Near Complete bacterial genomes produced by VAMB from the Almeida et al., (Nature, 2019) benchmark dataset (1,000 human gut microbiome samples). This is part 5 of 5
Source: https://figshare.com/articles/dataset/Near_Complete_Bins_Almeida_dataset_part_E/13221743/1
The Dataset Collection
by Joan Pulupa
data

eye 1

favorite 0

comment 0

The p:s ratios of Nup54-mEGFP 494 fusion proteins with a flexible linker do not shift upon amino acid additions.
Source: https://figshare.com/articles/dataset/Figure1G_Nup54-mEGFP494_flex0_/13333757/1
The Dataset Collection
by Joel Sharbrough; Justin L. Conover; Corrinne Grover; Matheus Fernandes Gyorfy; Emma R. Miller; Jonathan F. Wendel; Daniel Sloan
data

eye 1

favorite 0

comment 0

Whole-genome duplications (WGDs), in which the number of nuclear genome copies is elevated as a result of autopolyploidy or allopolyploidy, are a prominent process of diversification in eukaryotes. The genetic and evolutionary forces that WGD imposes upon cytoplasmic genomes are not well understood, despite the central role that cytonuclear interactions play in eukaryotic function and fitness. Cellular respiration and photosynthesis depend upon successful interaction between the 3000+...
Source: https://figshare.com/articles/dataset/Global_patterns_of_subgenome_evolution_in_organelle-targeted_genes_of_six_allotetraploid_angiosperms/13473207/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0198, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0198_20171205a_videos/13174448/1
The Dataset Collection
by Aria Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/DLF014_zip/3476849/1
Internet Archive Research Publication Crawls
by Wanfang Data
data

eye 4

favorite 0

comment 0

Metadata and some fulltext PDFs from Wanfang Data, downloaded 2020-03-29 from http://subject.med.wanfangdata.com.cn/Channel/7
The Dataset Collection
by Honorata Kraskiewicz; Maria Paprocka; Aleksandra Bielawska-Pohl; Agnieszka Krawczenko; Kinga Panek; Judyta Kaczyńska; Agnieszka Szyposzyńska; Mateusz Psurski; Piotr Kuropka; Aleksandra Klimczak
data

eye 1

favorite 0

comment 0

Additional file 4. Migration activity of native HATMSC supernatants. MSU-1.1 cell migration activity was investigated at 37 °C in an incubation chamber (PeCon GmbH, Erbach, Germany) with 1%O2, 5%CO2 mounted on an Axio Observer inverted microscope equipped with a dry 5x objective (Zeiss, Gottingen, Germany). The movement of the cells was time-lapse recorded for 44 h at intervals of 2 h using Zen 2.6 Blue Edition Software (Zeiss, Gottingen, Germany) as 6 separate movies (one for each...
Source: https://springernature.figshare.com/articles/dataset/MOESM4_of_Can_supernatant_from_immortalized_adipose_tissue_MSC_replace_cell_therapy_An_in_vitro_study_in_chronic_wounds_model/11686431/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

KG-COVID-19 graph in KGX TSV format, built on Sep 1, 2020, with no CORD-19 data
Source: https://zenodo.org/record/4012578
Bulk Bibliographic Metadata
data

eye 7

favorite 0

comment 0

Mirrored from:  https://github.com/njahn82/vanished_journals/tree/master/data
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-155_dcdWW_domain_trajectories/12162894/2
The Dataset Collection
by Raymond Haggerty
data

eye 1

favorite 0

comment 0

Contains all the input and output files used to generate figure 4 Revision_IN_[motif].mat are the input files for each biologically relevant motif. OUT_[motif].mat are the output files run through MISC corresponding to each of the input files.
Source: https://figshare.com/articles/dataset/Figure_4_Reproduction_Files/12648905/1
The Dataset Collection
by Darcy Jones
data

eye 5

favorite 0

comment 0

All supplementary material and full resolution figures for the Predector pipeline manuscript. Figure 1: UpSet plot showing predictions of signal peptides, transmembrane domains, and effector-like properties for all known effectors in the training dataset (N=125). Rows indicate sets of proteins predicted to have a property related to effector prediction (e.g. a signal peptide), with the horizontal bar chart indicating set size. Columns indicate where the horizontal sets intersect with each...
Source: https://figshare.com/articles/dataset/Predector_-_supplementary_material/13325213/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 3

favorite 0

comment 0

Rat 0229, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0229_20181102a_videos/13010276/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0309, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0309_20191119a_videos/13110581/1
The Dataset Collection
by Vitalij Novickij; Irute Girkontaite
data

eye 2

favorite 0

comment 0

Raw data from experiments
Source: https://figshare.com/articles/dataset/Raw_dataset/13507140/1
This corpus consists of texts written in Chinese during the Ming and Qing dynasties, spanning roughly 1368 to the early 20th century (the newest text was written in 1916). The texts have been mostly pre-cleaned. The only English found in most of the files are indications of chapter or scroll breaks in the format ~~~START| Title of Chapter/Scroll |START~~~ This was done to ease breaking the texts into organic sections. Most of the corpus consists of texts found on wenxian.fanren8.com, a website...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/4ZVSKA&version=1.0
Simulation data used in the paper CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9JI57U&version=2.0
These include the relevant files needed for replicating George A. Krause and Matthew Zarit's "The Retraction of Policy Benefits Across U.S. Federal Agencies: Programmatic Cutbacks and Executive Control of U.S. Federal Grant Retrenchments." Forthcoming, Public Administration Review. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/AIICA1&version=1.0
The Lantern Library
movies

eye 45

favorite 0

comment 0

This is a tour of the Lantern Library (a leftist political literature collection in Cambridge, MA) by creator James Herod, as filmed by Simmons College Library Science students.
PLATFORM-CRAWL-2020
PLATFORM-CRAWL-2020
collection
649
ITEMS
379,588
VIEWS
by Internet Archive Web Group
collection

eye 379,588

OA-JOURNAL-CRAWL-2020-07
OA-JOURNAL-CRAWL-2020-07
collection
1,923
ITEMS
9.5M
VIEWS
by Internet Archive Web Group
collection

eye 9.5M

Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 58

favorite 0

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
Social Media Videos
movies

eye 7

favorite 1

comment 0

Hear the message from @FlightConley on the steps of @JSOPIO ⤵ https://t.co/CnbD7w4Fkc Source: https://twitter.com/Jaguars/status/1268905383218876417 Uploader: #DUUUVAL
Topics: Twitter, video
Bulk Bibliographic Metadata
by Harshdeep Singh, Robert West, & Giovanni Colavizza
data

eye 11

favorite 0

comment 0

Mirrored from: https://zenodo.org/record/3940692 Harshdeep Singh, Robert West, & Giovanni Colavizza. (2020). Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia (Version 0.2) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3940692
The Dataset Collection
by Sarah Jaffe
data

eye 1

favorite 0

comment 0

This landsat scene is an example used in the sagebrush-ecosystem-modeling github workflow for NEON's Onaqui Mountains (ONAQ) site. Landsat-8 image courtesy of the U.S. Geological Survey
Source: https://figshare.com/articles/dataset/Landsat-L1-038_032-201710-ONAQ-scene/12525548/2
The Dataset Collection
by Ben Fulcher
data

eye 12

favorite 0

comment 0

A diverse selection of 1000 empirical time series, along with results of an hctsa feature extraction, using v1.03 of hctsa and Matlab 2019b, computed on a linux server at Sydney University. The results of the computation are in the hctsa file, HCTSA_Empirical1000.mat for use in Matlab using v1.03 of hctsa . The same data is available in .csv format (e.g., for use with non-Matlab computing environments) for the hctsa_datamatrix.csv (results of feature computation), with information about rows...
Source: https://figshare.com/articles/dataset/1000_Empirical_Time_series/5436136/7
Bulk Bibliographic Metadata
by Microsoft Academic Search
data

eye 319

favorite 0

comment 0

This is a copy of the Microsoft Academic Graph corpus of scholarly publications and citations, based on crawls from the open web. Metadata (authors, DOI numbers, journals, citations, keywords, affiliations, etc) is included for more than 125 million publications. The corpus is a single 27GB zipfile that extracts into about 96GB of flat tab-separated text files, cross-referenced using identifier columns. Schema information can be found in the `readme.txt` file, and usage restrictions can be...
Bulk Bibliographic Metadata
by Microsoft Academic
data

eye 127

favorite 1

comment 0

This is an updated snapshot of the Microsoft Academic Graph corpus. Microsoft generously makes this corpus available at no cost under the ODC-BY "open data license" ( https://opendatacommons.org/licenses/by/1.0/ ). See the link for details; at a minimum this license requires downstream users to acknowledge the creator. You can read more about the corpus, including how to obtain updated copies on Microsoft Azure, a schema reference, etc, at the following URLs and in the following...
The Dataset Collection
data

eye 14

favorite 1

comment 0

This is a mirror of the Mapping Police Violence spreadsheet, as downloaded from  https://mappingpoliceviolence.org/  on 2020-06-01. See that site for links to reporting, to donate, and for more context in general. A mirror of the website is also preserved in wayback:  http://web.archive.org/web/20200602001333/https://mappingpoliceviolence.org/
The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, HadGem2 AOGCM, RCP 8.5, 2050s projections, monthly variables Tmin01-12 Tmax01-12 Tave01-12 Prec01-12. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_HadGem2_AOGCM_RCP_8_5_2050s_Monthly/11827572/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-275_dcdWW_domain_trajectories/12163527/2
tar file with numerical Euler solution and processed data used in plotting the pressure fields
Source: https://rs.figshare.com/articles/dataset/Euler_solution_at_t_0_0031_from_A_fluid_mechanic_s_analysis_of_the_teacup_singularity/12739523/1
The Dataset Collection
by Yungang Xu
data

eye 0

favorite 0

comment 0

The data and codes used to reproduce the Figures and Tables for Result section 4: ScIGANs enhances the inference of cellular trajectory.
Source: https://figshare.com/articles/dataset/Section_4_zip/11509800/1
Screen capture of the research about the public availability of medical literature concerning the corona virus and its medication. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IZ3VAU&version=1.0
Wbbyyr: FastText language models for Mandarin Chinese, trained on 14,440,000 Sina Weibo posts for each year in 2012-2018. The 14,440,000 posts from each year are split into 10 folds. Due to Zenodo size limit, this dataset contains only the first fold from each year. Each model is trained for 20 iterations. Each vector is 300 dimensions long.
Source: https://zenodo.org/record/3605209
The Dataset Collection
data

eye 1

favorite 0

comment 0

Is there more violence in the middle? Over 100 studies have analyzed whether violent out- comes such as civil war, terrorism, and repression are more common in regimes that are neither full autocracies nor full democracies, yet findings are inconclusive. While this hypothesis is ultimately about functional form, existing work uses models in which a particular functional form is assumed. Existing work also uses arbitrary operationalizations of “the middle”. This paper aims to resolve the...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/LNUYXZ&version=1.1
The Dataset Collection
by Trujillo, Milo
data

eye 1

favorite 0

comment 0

This dataset includes video metadata and comment data from the video platform BitChute. We believe that this data set contains all videos uploaded between June 28th and December 3rd 2019, outside of two brief outages due to power shutdowns on October 23rd and November 18th. For each video upload recorded, we visited the video one week after its upload to record view counts and comments. Therefore, comments posted more than one week after the video upload date are not included in our dataset....
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9QCDHZ&version=1.0
Huntington referred to a ‘clash of civilizations’ revealing itself in international terrorism, particularly in the clash between the Islamic civilization and the West. The authors confront his hypotheses with ones derived from the strategic logic of international terrorism. They predict more terrorism against nationals from countries whose governments support the government of the terrorists’ home country. Like Huntington, they also predict excessive terrorism on Western targets, not...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HOGQGD&version=1.0
The Dataset Collection
by Tobin, John
data

eye 1

favorite 0

comment 0

ALMA 13CO Measurement sets CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PIDQM1&version=1.0
The Dataset Collection
data

eye 1

favorite 0

comment 0

This replication archive contains all data and code to replicate the results in "Measuring Political Positions from Legislative Speech" by Benjamin E. Lauderdale and Alexander Herzog. Article abstract : Existing approaches to measuring political disagreement from text data perform poorly except when applied to narrowly selected texts discussing the same issues and written in the same style. We demonstrate the first viable approach for estimating legislator-specific scores from the...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RQMIV3&version=1.0
The datum for a manuscript submitted to 《Geophysical Research Letters》. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/TZM7CR&version=1.0
The Dataset Collection
by William Wint; Neil Alexander
data

eye 1

favorite 0

comment 0

The MOOD project (MOnitoring Outbreak events for Disease surveillance in a data science context. H2020) has geo-referenced the data Google has published as a series of PDF files presenting reports on national and subnational human mobility levels relative to a baseline data of late January 2020. The details and the PDF files can be found at https://www.google.com/covid19/mobility/ . More detail on these files can be found at https://www.moodspatialdata.com/humanmobilityforcovid19 The first set...
Source: https://figshare.com/articles/dataset/Maps_of_human_mobility_change_during_the_COVID-19_outbreak/12130980/65
The Dataset Collection
by Benoit Pasquier
data

eye 1

favorite 0

comment 0

See AIBECS.jl
Source: https://figshare.com/articles/dataset/OCIM2_KiLOW_He_bson/11911131/1
The Dataset Collection
by Mike R. James; Gilles Antoniazza
data

eye 1

favorite 0

comment 0

James et al. (2020) - Mitigating systematic error in topographic models for geomorphic change detection: Accuracy, precision and considerations beyond off-nadir imagery UAV-collected image dataset and associated image coordinates of GCP observations for surveys of la Borgne d'Arolla. See paper for details. 60m_10degr_3 : UAV image data File formats for image observations: .xml file contains GCP image observations (and other data) exported from Photoscan v1.4.2. Can be imported back into...
Source: https://figshare.com/articles/dataset/James_et_al_2020_BdA_60m_10degr_3/11786754/1
The Dataset Collection
by Sisi Chen
data

eye 1

favorite 0

comment 0

This file contains single-cell RNA-seq datasets associated with : Chen, S, et al. 2020. “Dissecting Heterogeneous Cell Populations across Drug and Disease Conditions with PopAlign.” bioRxiv . https://doi.org/10.1101/421354. ------------------------ DATASETS ------------------------ Each data folder includes: barcodes.tsv : cell barcodes genes.tsv : genes [ Name ].mtx : sparse matrix file of transcript counts Subfolders MM1-MM4: --------------- 22,294 Peripheral Blood Mononuclear Cells...
Source: https://figshare.com/articles/dataset/PopAlign_Data/11837097/3
The Dataset Collection
by Maria Izabel Cavassim; Sara Moeskjaer; Bryden Fields; Asger Bachmann; Bjarni Vilhjálmsson; Mikkel H Schierup; J. Peter W. Young; Stig Uggerhøj Andersen
data

eye 1

favorite 0

comment 0

Supplementary material (Tables and Figures) for the article: Cavassim et al. 2020: Symbiosis genes show a unique pattern of introgression and selection within a Rhizobium leguminosarum species complex. Data.zip: comprises gene alignments and SNP matrices of a Rhizobium complex. Further detailed is found in the article https://doi.org/10.1099/mgen.0.000351
Source: https://figshare.com/articles/dataset/Gene_alignments_and_SNP_matrices_of_a_Rhizobium_complex/11568894/5
The complete mitochondrial genome of Aesop slipper lobster Scyllarides haanii (De Haan, 1841) sequencing cleandata
Source: https://figshare.com/articles/dataset/The_complete_mitochondrial_genome_of_Aesop_slipper_lobster_Scyllarides_haanii_De_Haan_1841_sequencing_cleandata/12805703/1
100-dimensional word2vec CBOW negative sampling word embeddings for the Cyrillic Uzbek language. Trained using the webcrawl corpus v1.
Source: https://figshare.com/articles/dataset/uzb-cyrl-webcrawl-v1-word2vec-cbow-ns-100d/12991472/2
The Dataset Collection
by Li Gao
data

eye 1

favorite 0

comment 0

Gene matrix for CESC
Source: https://figshare.com/articles/dataset/Gene_matrix_for_CESC/12751418/1
The Dataset Collection
by Vera Terblanche
data

eye 1

favorite 0

comment 0

Expression of prohormone convertase 1/3 (RNA) in the larval brain of Tribolium. Neuropile stained with against synapsin, nuclear stain.
Source: https://figshare.com/articles/dataset/PC13_expression_in_the_Tribolium_larval_brain/13072415/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

Reconstructions of the Large Ensemble Testbed generated by NN, XGB, and RF algorithms.
Source: https://figshare.com/articles/dataset/pCO2_reconstructions_-_Large_Ensemble_Testbed_-_XG_-_GFDL/12925631/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-211_dcdWW_domain_trajectories/12163065/2
The Dataset Collection
by Paolo Mignone
data

eye 2

favorite 0

comment 0

Fold 7 test set
Source: https://figshare.com/articles/dataset/test7_zip/13372976/1
Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based,...
Source: https://figshare.com/articles/dataset/A_genome-scale_investigation_of_how_sequence-_function-_and_tree-based_gene_properties_influence_phylogenetic_inference/1597710/2
The Dataset Collection
by Alexander V. Georgiev; Diana Christie; Kevin A. Rosenfield; Angelina V. Ruiz-Lambides; Elizabeth Maldonado; Melissa Emery Thompson; Dario Maestripieri
data

eye 1

favorite 0

comment 0

Explaining intraspecific variation in reproductive tactics hinges on measuring associated costs and benefits. Yet, this is difficult if alternative (purportedly less optimal) tactics remain unobserved. We describe a rare alpha-position take-over by an immigrant male rhesus macaque in a population where males typically gain rank via succession. Unusually, male aggressiveness after the take-over correlated with rank and mating success. The new alpha achieved the highest mating and reproductive...
Source: https://brill.figshare.com/articles/dataset/Movie_file_for_Breaking_the_succession_rule_the_costs_and_benefits_of_an_alpha_status_take_over_by_an_immigrant_rhesus_macaque_on_Cayo_Santiago/2198989/1
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 32

favorite 0

comment 0

DOAJ-CRAWL-2020-11
DOAJ-CRAWL-2020-11
collection
102
ITEMS
854,580
VIEWS
by Internet Archive Web Group
collection

eye 854,580

UNPAYWALL-PDF-CRAWL-2020-03
UNPAYWALL-PDF-CRAWL-2020-03
collection
344
ITEMS
1.7M
VIEWS
by Internet Archive Web Group
collection

eye 1.7M

SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
SEMSCHOLAR-DIRECT-PDF-CRAWL-2020-02
collection
1,011
ITEMS
1.4M
VIEWS
by Internet Archive Web Group
collection

eye 1.4M

The Dataset Collection
by Ghosh, Arindam
data

eye 1

favorite 0

comment 0

Data for Figures 3,4,5 and 6. All data files are standard .mat files . These files can be opened in MATLAB. CC0 Waiver
Source: https://data.goettingen-research-online.de/dataset.xhtml?persistentId=doi:10.25625/NIDERQ&version=1.0
The Dataset Collection
by Blackwell, Jody
data

eye 1

favorite 0

comment 0

Presentation Date: Tuesday, July 24, 2018. Location: PRISE Distinguished Speaker Presentation, Harvard University. Abstract: These are the slides from Dr. Alyssa Goodman's PRISE Distinguished Speaker Presentation "What does "data science" mean to me, and to you?" given on July 24, 2018. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/77QB4R&version=1.1
Using a variety of inputs, IFPRI's Spatial Production Allocation Model (SPAM) uses a cross-entropy approach to make plausible estimates of crop distribution within disaggregated units. Moving the data from coarser units such as countries and sub-national provinces, to finer units such as grid cells, reveals spatial patterns of crop performance, creating a global grid-scape at the confluence between geography and agricultural production systems. Improving spatial understanding of crop production...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PRFF8V&version=4.0
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 15

favorite 0

comment 0

The Dataset Collection
data

eye 0

favorite 0

comment 0

Raw data for the experiments in Section 7 conducted for the paper https://papers.nips.cc/paper/5872-efficient-and-robust-automated-machine-learning
Source: https://figshare.com/articles/dataset/Efficient_and_Robust_Automated_Machine_Learning_-_Section_6/3824103/2
Bulk Bibliographic Metadata
by dblp
data

eye 25

favorite 0

comment 0

The Dataset Collection
by Jie Lyu
data

eye 1

favorite 0

comment 0

see README.md
Source: https://figshare.com/articles/dataset/Epigenetics_processing_zip/12199029/1
The Dataset Collection
by Wallace H. Liu; Jie Zheng; Jessica L. Feldman; Mark A. Klein; Vyacheslav I. Kuznetsov; Craig L. Peterson; Patrick Robert Griffin; John Denu
data

eye 1

favorite 0

comment 0

The protein deacetylase SIRT6 maintains cellular homeostasis through multiple pathways that include the deacetylation of histone H3 and repression of transcription. Prior work suggests that SIRT6 is associated with chromatin and can substantially reduce global levels of H3 acetylation, but how SIRT6 is able to accomplish this feat is unknown. Here, we describe an exquisitely tight interaction between SIRT6 and nucleosome core particles, in which a 2:1 enzyme:nucleosome complex assembles via...
Source: https://figshare.com/articles/dataset/Multivalent_Interactions_Drive_Nucleosome_Binding_and_Efficient_Chromatin_Deacetylation_by_SIRT6/12937103/1
The Dataset Collection
by Jinxi Huo
data

eye 1

favorite 0

comment 0

RNA-seq of B16F10 cells. S303, S304, S305 are SH-1, SH-2, SH-3. S306, S307, S308 are CK-1, CK-2, CK-3
Source: https://figshare.com/articles/dataset/RNA-seq_raw_data_of_B16F10_cells_/12792491/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-116_dcd/12162777/1
The Dataset Collection
by Yuan Wang
data

eye 1

favorite 0

comment 0

URD object
Source: https://figshare.com/articles/dataset/URD_object/11880582/1
Supplementary dataset
Source: https://figshare.com/articles/dataset/Genome-wide_CRISPR_screens_reveal_fitness_genes_in_the_Hippo_pathway_for_oral_squamous_cell_carcinoma/11859249/3
The Dataset Collection
by Xiaojie Yu; Xinyu Guo; Huiwang Gao
data

eye 1

favorite 0

comment 0

This database includes the model result in the manuscript "Detachment of low-salinity water from the Yellow River plume in summer". All the data can be read by the Matlab files, which are also included.
Source: https://figshare.com/articles/dataset/data/12925832/1
The Dataset Collection
by Andrea Serra-Marques; Maud Martin; Eugene A. Katrukha; Ilya Grigoriev; Cathelijn Peeters; Qingyang Liu; Peter Jan Hooikaas; Yao Yao; Veronika Solianova; Ihor Smal; Lotte B. Pedersen; Erik Meijering; Lukas C Kapitein; Anna Akhmanova; Kapitein Lab
data

eye 1

favorite 0

comment 0

Supplementary code, raw data and figures from "Concerted action of kinesin-1 KIF5B and kinesin-3 KIF13B promotes efficient transport of exocytotic vesicles to microtubule plus ends"
Source: https://figshare.com/articles/dataset/Supplementary_code_raw_data_and_figures_from_Concerted_action_of_kinesin-1_KIF5B_and_kinesin-3_KIF13B_promotes_efficient_transport_of_exocytotic_vesicles_to_microtubule_plus_ends_/13103372/1
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 1

favorite 0

comment 0

Rat 0197, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0197_20171209a_videos/12949304/1
The Dataset Collection
by Nick Curtis; Kyle Niemeyer
data

eye 1

favorite 0

comment 0

A data-set of partially stirred reactor states generated by the pypasr (https://github.com/kyleniemeyer/pypasr) code for GRI-Mech 3.0 (http://combustion.berkeley.edu/gri-mech/version30/text30.html). The data is saved in the numpy .npz format, and is organized as follows: The keys of the .npz file are the original filenames, e.g. pasr_out_0.npy, etc. The files correspond to the following inlet conditions: 0, 1, 2: 400, 600, 800 K at 1 atm 3, 4, 5: 400, 600, 800 K at 10 atm 6, 7, 8: 400, 600, 800...
Source: https://figshare.com/articles/dataset/ch4_pasr_data_bin/4007418/2
The Dataset Collection
by Aria S Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS049164_scaffolds_zip/3478697/1
The Dataset Collection
by Lima, Alvaro
data

eye 2

favorite 0

comment 0

Newspaper CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/DWN6PA&version=1.0
The Dataset Collection
data

eye 3

favorite 0

comment 0

These videos are used as a recruitment tool for the Ovulation and Menstruation Health Study. A participant who navigates to the study website may watch this video for more information about the purpose of the study. The study team ensured that anyone interested in participating had this information available before determining if they wanted to join. The videos are available in English and in Spanish. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ODETL6&version=1.0
The Dataset Collection
by Tobin, John
data

eye 1

favorite 0

comment 0

Ka-band VLA data for Per-emb-1/HH211-MMS CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/8Z9AXP&version=2.0
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 54

favorite 0

comment 0

This item contains an example corpus of citations between scholarly documents, as extracted from the fatcat (https://fatcat.wiki) corpus as of the 2020-08-05 bulk release export. This corpus itself was generated from a fatcat-scholar "intermediate" fulltext dump which is not public, using software in the fatcat-scholar repository in mid-September 2020. See also the README for some more notes, and the "sample" file.
The Dataset Collection
by Charles Tapley Hoyt
data

eye 1

favorite 0

comment 0

This dataset is now maintained on Zenodo. See: https://zenodo.org/record/4020486
Source: https://figshare.com/articles/dataset/Ooh_Na_Na/12149241/1
The data is made for static occlusion analysis, there are 133 subjects involved and each subject consists with 8 sequences where 4 sequences belong to normal walking without occlusion and other 4 sequence made with static occlusion. This is a set of normalized silhouette sequence. That data is created through personal camera and configuration of camera similar to Powershot-SX430-IS.
Source: https://figshare.com/articles/dataset/Silhouette_frames_Selected_big_blob_Extracted_Centered_Alinged_Directed_splitted_Alinged_Directed_rar/12469655/1
The Dataset Collection
by Xiaojie Yu
data

eye 1

favorite 0

comment 0

This database includes the model result in the manuscript "Detachment of low-salinity water from the Yellow River plume in summer". All the data can be read by the Matlab files, which are also included.
Source: https://figshare.com/articles/dataset/data2/12924515/1
The Dataset Collection
by Kresten Lindorff-Larsen
data

eye 1

favorite 0

comment 0

Trajectories of ww domainTrajectories dataTrajectories of WW domain
Source: https://figshare.com/articles/dataset/GTT-1-protein-294_dcdWW_domain_trajectories/12163773/2
Simply all the supporting scripts, data and documentation relating to this work. Get in touch if you want to build on this and need clarity.
Source: https://figshare.com/articles/dataset/Raw_data_scripts_and_dissertation_related_to_Hijacking_Internet-connected_Devices_to_Provoke_Harmful_Oscillations_in_an_Electrical_Network_a_Feasibility_Assessment_/7218722/3
The Dataset Collection
by Denghua Yan; Baisha Weng; Tianling Qin; Hao wang; Xiangnan Li; Yuheng Yang; Kun Wang
data

eye 1

favorite 0

comment 0

global population and water withdrawal
Source: https://figshare.com/articles/dataset/A_data_set_of_distributed_global_population_and_water_withdrawal_from_1960_to_2017/8063150/4
The Dataset Collection
by Kennelly, Patrick J.; Patterson, Tom; Jenny, Bernhard; Huffman, Daniel P.; Marston, Brooke E.; Bell, Sarah; Tait, Alexander M.
data

eye 1

favorite 0

comment 0

An elevation model of Great Sand Dunes, Colorado, USA Landform features: active dune field, sand sheet, sabkha Resolution: 3.3 meter, 5,300 x 5,300 height samples File format: GeoTIFF This is one model of a set of elevation models: https://doi.org/10.5281/zenodo.3938020 . Please cite the entire set of models. When using this elevation model in an academic publication, please cite the following article, which describes the process and rationale for compiling elevation models: Kennelly, P. J.,...
Source: https://zenodo.org/record/3940434
The Dataset Collection
by Helene Moran; Lena Karlin; Elsie Lauchlan; Sarah Rappaport; Ben Bleasdale; Lucy Wild; Josh Dorr
data

eye 1

favorite 0

comment 0

We present extended data related to the research on understanding research culture conducted by Wellcome and Shift Learning. This includes; a document detailing how the e-survey data was transformed to protect anonymity, a flowchart that indicates how participants were guided to answer questions in the e-survey, e-survey guide, interview guide, sample plan for the qualitative phase of the research and a document outlining some ethical consideration guiding the research design
Source: https://wellcome.figshare.com/articles/dataset/Understanding_Research_Culture_extended_data/12191550/2
Bulk Bibliographic Metadata
by Impactstory
data

eye 113

favorite 0

comment 0

A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
The Dataset Collection
by Daniel Leventhal; Alexandra Bova
data

eye 2

favorite 0

comment 0

Rat 0219, skilled reaching data collected in Leventhal laboratory by Alexandra Bova
Source: https://figshare.com/articles/dataset/R0219_20180306a_videos/13175141/1
Fatcat Database Snapshots and Bulk Metadata Exports
by Internet Archive Web Group
data

eye 9

favorite 0

comment 0