Skip to main content

7,930
UPLOADS


More right-solid

More right-solid

Show sorted alphabetically

More right-solid

Show sorted alphabetically

More right-solid

SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
The Dataset Collection
by A. Murat Eren
data

eye 1

favorite 0

comment 0

The archive file contains the merged anvi'o profile, and the contigs database for the Infant Gut data from Sharon et al. that is suitable to analyze with anvi'o v4 or later. Please see http://merenlab.org/tutorials/infant-gut/ for details.
Source: https://figshare.com/articles/dataset/Infant_Gut_Data_v2/3502445/14
The Dataset Collection
by A. Murat Eren; Florian Trigodet; Karen Lolans
data

eye 1

favorite 0

comment 0

Assembled Illumina short reads from human oral cavity samples processed using the state-of-the-art DNA extraction strategies for shotgun metagenomics described in the following study: https://doi.org/10.1101/2021.03.03.433801 These two metagenomes are from the same individual's oral samples and generated from (1) the sample used for long-read sequencing of the same material to test HMW DNA extraction method 01 in our study (here named ORAL_ILLUMINA_METHOD_01_REPL_02_ASSEMBLED) and (2) the...
Source: https://figshare.com/articles/dataset/Assembled_Illumina_Short_Reads/14141819/1
Accompanying data to "The complexity of high-frequency electric fields impairs jamming avoidance: a potential trade-off in electric sensing" (submitted)
Source: https://figshare.com/articles/dataset/The_complexity_of_high-frequency_electric_fields_impairs_jamming_avoidance_a_potential_trade-off_in_electric_sensing/5361007/1
The Dataset Collection
by Abel Gomes
data

eye 1

favorite 0

comment 0

Gaussian Finder's cavity dataset in CSV. This dataset describes the protein cavities output by a protein cavity detection method called Gaussian Finder. This method is described in the article available at: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-017-1913-4
Source: https://figshare.com/articles/dataset/Gaussian_Finder_s_cavity_dataset_in_CSV/9916745/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

This is an archive of the raw data and analysis source code for the paper "A crop yield change emulator for use in GCAM and similar models: Persephone v1.0".  The archive contains: data.zip:  All source code for analysis, input data for analysis, and results of analysis persephone.proj :  R project for ease of reproducing analysis
Source: https://zenodo.org/record/1414423
The Dataset Collection
by Adac_uon Adac_uon
data

eye 1

favorite 0

comment 0

test "input" data for PIMMS pipeline
Source: https://figshare.com/articles/dataset/test_IN_R2_fastq_zip/1032716/1
The Dataset Collection
by Adrià Pérez Culubret; Gianni De Fabritiis
data

eye 1

favorite 0

comment 0

The initial structure of Chignolin was generated starting from the cln025 peptide, with sequence TYR-TYR-ASP-PRO-GLU-THR-GLY-THR-TRP-TYR. The structure was solvated in a cubic box of 40A, containing 1881 water molecules and two Na+ ions to neutralize the peptide's negative charge. MD simulations were performed with ACEMD, using CHARMM22* force field and TIP3P water model at 350K temperature. A Langevin integrator was used with a damping constant of 0.1 1/ps. Integration time step was set to 4...
Source: https://figshare.com/articles/dataset/Chignolin_Simulations/13858898/1
This data provides the topics identified by our approach BOUN-TI, on the data collected from Twitter while the 2012 U.S.A. presidential debates were holding. The dataset also provides tf values of words in a Wikipedia snapshot, and the values required to gain idf values of words. Word frequency distribution of an interval of Twitter english public stream tweets' is provided.
Source: https://figshare.com/articles/dataset/data_tar_gz/2068665/2
The Dataset Collection
by Akanksha Jain
data

eye 1

favorite 0

comment 0

Cartographic projections of a reconstructed multiview lightsheet dataset. The embryo was labeled with LifeAct-eGFP and imaged with multi-view SPIM from 3 views.
Source: https://figshare.com/articles/dataset/Supplementary_6B_LifeAct-eGFP_SPIM/13110260/1
The Dataset Collection
by Akshay Yadav; David Fernández-Baca; Steven Cannon, scannon@iastate.edu
data

eye 1

favorite 0

comment 0

Yeast (YGOB) and legume gene families used for testing methods for detecting and correcting under-clustered and over-clustered gene families ygob_proteomes.tar.gz : Complete yeast proteomes from YGOB database ygob_family_fasta.tar.gz : Complete yeast families from the YGOB database ygob_family_fasta_delete.tar.gz : Intentionally under-clustered yeast families with missing 20% sequences ygob_family_fasta_insert_delete.tar.gz : Intentionally under-clustered yeast families with missing 20%...
Source: https://figshare.com/articles/dataset/Methods_for_analyzing_comparing_and_correcting_gene_families/12115305/1
The Dataset Collection
by Aldo Hernandez-Corchado
data

eye 1

favorite 0

comment 0

bigwigs
Source: https://figshare.com/articles/dataset/2-1_bw/13526330/1
The Dataset Collection
by Aleksandr Zhernakov
data

eye 1

favorite 0

comment 0

The raw NGSequencing data of a library prepared by MACE technology from five pea plants with mutant phenotype of F2 (NGB1238 × N24) mapping population.
Source: https://figshare.com/articles/dataset/M2-library/7409270/1
The Dataset Collection
by Alex Detappe
data

eye 1

favorite 0

comment 0

Raw data, Figure 4, Qi et al., Nat Comm 2017
Source: https://figshare.com/articles/dataset/3_Figure_4_All_Raw_Data_Files/11728848/1
The Dataset Collection
by Alexander V. Georgiev; Diana Christie; Kevin A. Rosenfield; Angelina V. Ruiz-Lambides; Elizabeth Maldonado; Melissa Emery Thompson; Dario Maestripieri
data

eye 1

favorite 0

comment 0

Explaining intraspecific variation in reproductive tactics hinges on measuring associated costs and benefits. Yet, this is difficult if alternative (purportedly less optimal) tactics remain unobserved. We describe a rare alpha-position take-over by an immigrant male rhesus macaque in a population where males typically gain rank via succession. Unusually, male aggressiveness after the take-over correlated with rank and mating success. The new alpha achieved the highest mating and reproductive...
Source: https://brill.figshare.com/articles/dataset/Movie_file_for_Breaking_the_succession_rule_the_costs_and_benefits_of_an_alpha_status_take_over_by_an_immigrant_rhesus_macaque_on_Cayo_Santiago/2198989/1
The Dataset Collection
by Ali Safaya
data

eye 1

favorite 0

comment 0

trwiki-67 dataset trwiki-67 is a language modeling dataset that contain 67 million words of raw wikipedia articles. It can be utilized as a benchmark for different language modeling tasks on character, subword, or word level. This dataset was extracted from a Turkish wikipedia dump on 20 July 2021. Preprocessing All lists and tables were removed from the articles, and the initial extraction from .xml dump was done using wikiextractor : Additionally, further preprocessing was applied to get rid...
Source: https://zenodo.org/record/5146001
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 14

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions, and the request: We request that any published research that makes use of this data cites the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL. ...
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 23

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus". These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-*-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 23

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions, and the request: We request that any published research that makes use of this data cites the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL. ...
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 40

favorite 0

comment 0

This is a snapshot of the AI2 (Semantic Scholar') "Open Research Corpus", as release May 3rd, 2018. These files originally downloaded from AWS S3, via: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'sample-S2-records.gz' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 15

favorite 0

comment 0

This is a mirror of the Semantic Scholar Graph of References in Context (GORC) dataset. Use of this dataset is under terms of the Semantic Scholar Dataset License: http://web.archive.org/web/20200118202545/http://api.semanticscholar.org/corpus/legal/ See also: https://github.com/allenai/s2-gorc https://arxiv.org/abs/1911.02782
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 51

favorite 0

comment 0

This is a snapshot of the AI@ (Semantic Scholar') "Open Research Corpus", as downloaded June 26th, 2017. These files originally downloaded from: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'papers-2017-02-21-sample.zip' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 52

favorite 0

comment 0

This is a backup of the "Open Academic Search" corpus, published by Semantic Scholar / Allen Institute for AI. For more info see http://labs.semanticscholar.org/corpus/. In particular, note the terms and conditions: Semantic Scholar Open Research Corpus is licensed under  ODC-BY . When using the Semantic Scholar Open Research Corpus (“S2 ORC”) in a product or service, or including data in a redistribution, please cite the following paper: Waleed Ammar et al. 2018. Construction...
Bulk Bibliographic Metadata
by Allen Institute for Artificial Intelligence
data

eye 177

favorite 0

comment 0

Semantic Scholar Open Research Corpus is licensed under  ODC-BY . When using the Semantic Scholar Open Research Corpus (“S2 ORC”) in a product or service, or including data in a redistribution, please cite the following paper: Waleed Ammar et al. 2018. Construction of the Literature Graph in Semantic Scholar. NAACL https://www.semanticscholar.org/paper/09e3cf5704bcb16e6657f6ceed70e93373a54618 This site is provided by The Allen Institute for Artificial Intelligence (“AI2”) as a service...
The Dataset Collection
by Allen Institute for Artificial Intelligence, PubMed, bioRxiv, medRxiv, et al
data

eye 84

favorite 1

comment 0

This is a mirror of the "COVID-19 Open Research Dataset (CORD-19)", published by the Allen Institute for Artificial Research (AI2 / Semantic Scholar) on 2020-03-16. This item contains thousands of parsed open access research papers about the COVID-19 coronavirus, and may be of interest as a bulk research corpus. This dataset is *NOT INTENDED AS MEDICAL ADVICE* or as an informational resource for the general public. For details, including licensing information, see the included .html...
Topic: COVID-19, Coronavirus
The Dataset Collection
by Amir AghaKouchak; Mojtaba Sadegh; Ehsan Raei; Mohammad Reza Nikoo; Omid Mazdiyasni
data

eye 0

favorite 0

comment 0

Daily binary (0/1) occurrence records of heatwaves/warm-spells using Constant Threshold, EHF, ETCCDI, PDF and SHI Methods for the entire globe between 1979 to 2017.
Source: https://springernature.figshare.com/articles/dataset/GHWR_-_Record_-_Const_Thresh_EHF_ETCCDI_PDF_and_SHI_Methods/5885224/1
The Dataset Collection
by Andrea Capiluppi
data

eye 3

favorite 0

comment 0

These are the raw java classes of the projects parsed off SourceForge. The projects are categorised by application domain
Source: https://figshare.com/articles/dataset/raw_data_Java_source_code_for_MSR_data_track_2019/7673264/1
The Dataset Collection
by Andrea Serra-Marques; Maud Martin; Eugene A. Katrukha; Ilya Grigoriev; Cathelijn Peeters; Qingyang Liu; Peter Jan Hooikaas; Yao Yao; Veronika Solianova; Ihor Smal; Lotte B. Pedersen; Erik Meijering; Lukas C Kapitein; Anna Akhmanova; Kapitein Lab
data

eye 1

favorite 0

comment 0

Supplementary code, raw data and figures from "Concerted action of kinesin-1 KIF5B and kinesin-3 KIF13B promotes efficient transport of exocytotic vesicles to microtubule plus ends"
Source: https://figshare.com/articles/dataset/Supplementary_code_raw_data_and_figures_from_Concerted_action_of_kinesin-1_KIF5B_and_kinesin-3_KIF13B_promotes_efficient_transport_of_exocytotic_vesicles_to_microtubule_plus_ends_/13103372/1
The Dataset Collection
by Andrea Zonca
data

eye 1

favorite 0

comment 0

TMT IRIS simulated observations used for testing the software pipeline
Source: https://figshare.com/articles/dataset/TMT_IRIS_test_simulations/9941939/1
The Dataset Collection
by Andrew D. Richardson; David Y. Hollinger; Julie Shoemaker; Holly Hughes; Kathleen Savage; Eric A. Davidson
data

eye 1

favorite 0

comment 0

Carbon dioxide (CO 2 ), methane (CH 4 ), and nitrous oxide (N 2 O) are the greenhouse gases largely responsible for anthropogenic climate change. Natural plant and microbial metabolic processes play a major role in the global atmospheric budget of each. We have been studying ecosystem-atmosphere trace gas exchange at a sub-boreal forest in the northeastern United States for over two decades. Historically our emphasis was on turbulent fluxes of CO 2 and water vapor. In 2012 we embarked on an...
Source: https://figshare.com/articles/dataset/Tower-_and_chamber-based_greenhouse_gas_flux_measurements_from_Howland_Forest_Maine_2012-2018_/7445657/1
The Dataset Collection
by André Kashiwabara
data

eye 1

favorite 0

comment 0

This dataset was used to evaluate MYOP using 5 fold cross-validation.
Source: https://figshare.com/articles/dataset/MYOP_cross-validation_dataset/4254578/1
The Dataset Collection
by André R. A. Marques; Alessandro Di Spiezio; Niklas Thießen; Lina Schmidt; Joachim Grötzinger; Renate Lüllmann-Rauch; Markus Damme; Steffen E. Storck; Claus U. Pietrzik; Jens Fogh; Julia Bär; Marina Mikhaylova; Markus Glatzel; Mahmoud Bassal; Udo Bartsch; Paul Saftig
data

eye 2

favorite 0

comment 0

CTSD (cathepsin D) is one of the major lysosomal proteases indispensable for the maintenance of cellular proteostasis by turning over substrates of endocytosis, phagocytosis and autophagy. Consequently, CTSD deficiency leads to a strong impairment of the lysosomal-autophagy machinery. In mice and humans CTSD dysfunction underlies the congenital variant (CLN10) of neuronal ceroid lipofuscinosis (NCL). NCLs are distinct lysosomal storage disorders (LSDs) sharing various hallmarks, namely...
Source: https://tandf.figshare.com/articles/dataset/Enzyme_replacement_therapy_with_recombinant_pro-CTSD_cathepsin_D_corrects_defective_proteolysis_and_autophagy_in_neuronal_ceroid_lipofuscinosis/8798045/1
The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, HadGem2 AOGCM, RCP 8.5, 2050s projections, monthly variables Tmin01-12 Tmax01-12 Tave01-12 Prec01-12. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_HadGem2_AOGCM_RCP_8_5_2050s_Monthly/11827572/1
The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, MPI AOGCM, RCP 8.5, 2080s projections, monthly variables Tmin01-12 Tmax01-12 Tave01-12 Prec01-12. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_MPI_AOGCM_RCP_8_5_2080s_Monthly/11827695/1
The Dataset Collection
by Annette Menzel; Tongli Wang; Andreas Hamann; Maurizio Marchi; Dante Castellanos-Acuña; Duncan Ray
data

eye 1

favorite 0

comment 0

Gridded data at 1km resolution, MPI AOGCM, RCP 4.5, 2050s projections, bioclimatic variables. Unzip the archive with 7-zip.org.
Source: https://springernature.figshare.com/articles/dataset/Grids_1km_MPI_AOGCM_RCP_4_5_2050s_Bioclim/11827674/1
Supplementary dataset
Source: https://figshare.com/articles/dataset/Genome-wide_CRISPR_screens_reveal_fitness_genes_in_the_Hippo_pathway_for_oral_squamous_cell_carcinoma/11859249/3
The Dataset Collection
by Anonymous
data

eye 1

favorite 0

comment 0

Open-source apps from F-droid used for the evaluation of the tool TNBDroid. We only use the apps for research. Commercial apps from google play used for the evaluation of the tool TNBDroid. We only use the apps for research.
Source: https://figshare.com/articles/dataset/socialApps/11932743/5
The Dataset Collection
by Antonis Michalas
data

eye 9

favorite 0

comment 0

Text files of different size and structure. More precisely, we selected random data from the Gutenberg dataset. This artefact contains five different datasets with random text files (i.e. e-books in .txt format) from the Gutenberg database. The datasets that we selected ranged from text files with a total size of 184MB to a set of text files with a total size of 1.7GB. More precisely, the following datasets can be found in this package: 1. 184MB 2. 357MB 3. 670MB 4. 1GB 5. 1.7GB In our case, we...
Source: https://zenodo.org/record/3360392
The Dataset Collection
data

eye 1

favorite 0

comment 0

The dataset contains genotype (tped and tfam files) and phenotype data GWAS analyses on canine hip dysplasia. See README file for more information.
Source: https://figshare.com/articles/dataset/Genetic_dissection_of_canine_hip_dysplasia_phenotypes_and_osteoarthritis_reveals_three_novel_loci/10096595/1
The Dataset Collection
by Aria Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/DLF014_zip/3476849/1
The Dataset Collection
by Aria S Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS016056_zip/3478451/1
The Dataset Collection
by Aria S Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS016954_zip/3478478/1
The Dataset Collection
by Aria S Hahn
data

eye 2

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS017191_zip/3478499/1
The Dataset Collection
by Aria S Hahn
data

eye 1

favorite 0

comment 0

This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS049164_scaffolds_zip/3478697/1
The Dataset Collection
by Ariane Morassi Sasso
data

eye 1

favorite 0

comment 0

File to be used in the: https://github.com/arianesasso/aime-2020 , notebook: processing_predictions/extract_blfeatures_and_predict_bp_from_ppg_eval.ipynb
Source: https://figshare.com/articles/dataset/Processed_EVAL_Dataset_30_secs_window_-_bfill_/12649691/1
Genome alignments for data generated in the project " Whole transcriptome analysis of thousands of FACS-sorted single cells with the single cell nanoCAGE protocol – Optimization of the protocol. " Files names indicate unique identifiers of MOIRAI workflow runs, with the following structure: library name, dot, workflow ID (OP-WORKFLOW-CAGEscan-short-reads-v2.0.), dot, timestamp. The raw (FASTQ) data of each library is also deposited in Zenodo ( 10.5281/zenodo.250156 ). Library names...
Source: https://zenodo.org/record/3340196
The Dataset Collection
by Arrian Gibson-Khademi
data

eye 1

favorite 0

comment 0

scATAC-seq dataset processed for inclusion in scATAC.Explorer R package. scATAC.Explorer is an R package containing a curated collection of publicly available scATAC-seq datasets that can easily be searched and retrieved within R. Included datasets are processed into a consistent format for ease of analysis. This dataset was not generated by our lab. Please also give credit to the source the dataset was retrieved from, included in reference field.
Source: https://figshare.com/articles/dataset/FreshMouseBrainCellRanger1_2_0/14357105/1
The Dataset Collection
by Asier Erramuzpe
data

eye 1

favorite 0

comment 0

Neurovault's statmaps for benchmarking
Source: https://figshare.com/articles/dataset/images_tar/3425759/1
The Dataset Collection
by Atticus Stovall
data

eye 1

favorite 0

comment 0

Data for: Tree height explains mortality risk during an intense drought https://rdcu.be/bU8go Tree-level Information Zone: tree ID number Zmax: tree height (m) Count: integer value (1) Yearly Pixel-based Mortality Estimates mort2009: % of “dead” pixels in 2009 mort2010: as above for 2010 mort2012: as above for 2012 mort2014: as above for 2014 mort2016: as above for 2016 Tree-level Mortality Estimates x: x coordinate (UTM Zone 11N) y: y coordinate (UTM Zone 11N) dead: 0 = Live; 1 = Dead,...
Source: https://figshare.com/articles/dataset/CA_lidar_tree_mortality/7609193/4
The Dataset Collection
by Axel Séguret
data

eye 1

favorite 0

comment 0

Each directory stores 12 trials with the number of zebrafish displayed in the name of the directory (1 AB is for 12 trials of 1 zebrafish AB) for the directory 20AB, the data show only the positions of the individuals without their identities. The first column is for the time, then columns go by two (for x position then y position) For the directories 2AB, 3AB, 5AB, 7AB and 10AB, the data are ranked to gap and no gap. Gap means there are nan in the files when the tracker can not identify the...
Source: https://figshare.com/articles/dataset/seguret_dryad_2017_rar/5151247/1
The Dataset Collection
by Aydin Ayanzadeh; Özden Yalçın Özuysal; Devrim Pesen Okvur; Behçet Uğur Töreyin; Devrim Ünay; Sevgi Önal
data

eye 1

favorite 0

comment 0

Phase contrast Microscopy
Source: https://figshare.com/articles/dataset/Phase_Contrast_Microscopy_of_cells_with_annotation/8965820/1
100-dimensional word2vec CBOW negative sampling word embeddings for the Cyrillic Uzbek language. Trained using the webcrawl corpus v1.
Source: https://figshare.com/articles/dataset/uzb-cyrl-webcrawl-v1-word2vec-cbow-ns-100d/12991472/2
The data and programs replicate tables and figures from "The Global Distribution of Economic Activity: Nature, History, and the Role of Trade", by Henderson, Squires, Storeygard, and Weil. Data were constructed from various sources. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/MO6RJT&version=1.1
The data and programs replicate tables and figures from "Human Capital and Development Accounting: New Evidence from Wage Gains at Migration", by Hendricks and Schoellman. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IPIBQP&version=1.1
The data and programs replicate tables and figures from "From Hyperinflation to Stable Prices: Argentina's Evidence on Menu Cost Models", by Alvarez, Beraja, Gonzalez-Rozada, and Neumeyer. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/C8ZOAS&version=2.2
The Dataset Collection
by Baranga, Thomas
data

eye 1

favorite 0

comment 0

The data and programs replicate tables and figures from "The Return to Protectionism", by Fajgelbaum, Goldberg, Kennedy, and Khandelwal. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/KSOVSE&version=1.1
The Dataset Collection
data

eye 1

favorite 0

comment 0

Replication Files (datasets and codes in Stata format): - comparative survey data with individual-level analyses - second-level data of estimates from individual-level analyses - TESS experimental study - MTurk experimental study (pilot) CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/E852VT&version=1.0
The Dataset Collection
by Ben Dichter; Edward F Chang
data

eye 1

favorite 0

comment 0

The enclosed data is collected using a high-density 256-channel electrocorticography array implanted in a human patient during treatment for epilepsy. The subjects are reading aloud consonant-vowel syllables from a list. The data was collected by Dr. Edward Chang at the University of California, San Francisco, and curated by Ben Dichter. Data is organized by subject ID, and each file is a continuous recording session in Neurodata Without Borders: Neurophysiology (NWB:N) 2.0 format. Voltage...
Source: https://figshare.com/articles/dataset/EC9_B46_nwb/9631880/1
The Dataset Collection
by Ben Fulcher
data

eye 12

favorite 0

comment 0

A diverse selection of 1000 empirical time series, along with results of an hctsa feature extraction, using v1.03 of hctsa and Matlab 2019b, computed on a linux server at Sydney University. The results of the computation are in the hctsa file, HCTSA_Empirical1000.mat for use in Matlab using v1.03 of hctsa . The same data is available in .csv format (e.g., for use with non-Matlab computing environments) for the hctsa_datamatrix.csv (results of feature computation), with information about rows...
Source: https://figshare.com/articles/dataset/1000_Empirical_Time_series/5436136/7
Campylobacter is the leading bacterial cause of gastroenteritis worldwide and despite high incidence in low- to middle-income countries, where infection can be fatal, surveillance is rare and the genotypes responsible for disease have not been identified. The epidemiology of disease is different to the developed world, where infection is mostly associated with consumption of contaminated meat products. Infection is endemic among children and asymptomatic carriage is thought to be common....
Source: https://figshare.com/articles/dataset/Global_similarity_and_local_differences_in_Campylobacter_jejuni_lineages_associated_with_asymptomatic_paediatric_infection_from_the_Peruvian_Amazon/10352375/4
The Dataset Collection
by Benedikt Geier
data

eye 1

favorite 0

comment 0

CLSM of the main dataset "MPIMM_054_QE_P_BP_CF" apllying FISH after AP-MALDI-MSI and after widefield fluorescence overviews were made
Source: https://figshare.com/articles/dataset/CLSM_of_FISH_sample/6887315/2
The Dataset Collection
by Benjamin Arnold Krekeler Hartz
data

eye 5

favorite 0

comment 0

Radiation: Interface for Matlab Spectroscopy Calculations (RadISpeC) is a program that use HITRAN/HITEMP data to generate Line-by-Line (LBL) spectra at a wanted spectral resolution and environment. These data can be used with Discrete Transfer Methods (DTM), Ray Tracing (RT) etc. solvers to estimate high precision radiation in heated flow problems. It was created as a part of the article "Experimental and theoretical evaluation of spectral radiative transfer in high-pressure flames"...
Source: https://figshare.com/articles/dataset/RadISpeC_program_code_m_files_and_GUI_for_Matlab_IDE_/6988997/4
The Dataset Collection
by Benjamin Judkewitz; Markus Schuelke; Mykola Kadobianskyi; Lisanne Schulze
data

eye 1

favorite 0

comment 0

FASTA sequence file with the Danionella translucida (DT) assembly scaffolds.
Source: https://springernature.figshare.com/articles/dataset/Genome_assembly_file/8003693/1
The Dataset Collection
by Benjamin Judkewitz; Markus Schuelke; Mykola Kadobianskyi; Lisanne Schulze
data

eye 1

favorite 0

comment 0

IGV viewer-compatible 25 bp sliding window RNA-seq coverage tracks file for adult RNA-seq library.
Source: https://springernature.figshare.com/articles/dataset/RNA-seq_coverage_IGV_track_adult_fish/8003696/1
The Dataset Collection
by Benoit Pasquier
data

eye 1

favorite 0

comment 0

See AIBECS.jl
Source: https://figshare.com/articles/dataset/OCIM2_KiLOW_He_bson/11911131/1
The Dataset Collection
by Benoit Pasquier
data

eye 1

favorite 0

comment 0

See AIBECS.jl
Source: https://figshare.com/articles/dataset/OCIM2_KiHIGH_He_bson/11911125/2
The Dataset Collection
by Bjorn Herrmann
data

eye 1

favorite 0

comment 0

sub-18
Source: https://figshare.com/articles/dataset/sub-18_rar/16564113/1
The Dataset Collection
by Bjorn Herrmann
data

eye 2

favorite 0

comment 0

sub-47
Source: https://figshare.com/articles/dataset/sub-47_rar/16566861/1
The Dataset Collection
by Bjorn Herrmann
data

eye 1

favorite 0

comment 0

sub-49
Source: https://figshare.com/articles/dataset/sub-49_rar/16566870/1
The Dataset Collection
by Bjorn Herrmann
data

eye 1

favorite 0

comment 0

sub-19
Source: https://figshare.com/articles/dataset/sub-19_rar/16564116/1
The Dataset Collection
by Blackwell, Jody
data

eye 1

favorite 0

comment 0

Presentation Date: Thursday, June 8, 2017. Location: Harvard University IT Summit, Cambridge, MA. Abstract: Twenty-five faculty from across Harvard’s schools are working with each other, as well as with a handful of outside experts, to create an extensive collection of online resources (interview-style and demonstration videos, text, interactive visualizations, and computer simulations) focused on “The Past and Present of the Future,” known as PredictionX. All of the included material is...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/9HWRK8&version=1.0
The Dataset Collection
by Blackwell, Jody
data

eye 1

favorite 0

comment 0

Presentation Date: Tuesday, July 24, 2018. Location: PRISE Distinguished Speaker Presentation, Harvard University. Abstract: These are the slides from Dr. Alyssa Goodman's PRISE Distinguished Speaker Presentation "What does "data science" mean to me, and to you?" given on July 24, 2018. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/77QB4R&version=1.1
Dataset including questionnaire results and physiological linkage indices for 38 couples of participants in a randomized controlled study. Nineteen couples were in the control condition and nineteen in the mediation condition. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/SVXIPH&version=1.0
The Dataset Collection
by Bolivar Samuel Sosa Madrid; agustin blasco; noelia ibañez escriche
data

eye 1

favorite 0

comment 0

Genomic Data from divergently selected lines for intramuscular fat in rabbits. This database was used for the analyses of the article titled: "Genomic regions influencing intramuscular fat in divergently selected rabbit lines." Animal Genetics - Journal, accepted in 2019.
Source: https://figshare.com/articles/dataset/Genomic_Data_from_divergently_selected_lines_for_intramuscular_fat_in_rabbits/9934058/2
The Dataset Collection
by Brian Horsak; Djordje Slijepcevic; Anna-Maria Raberger; Caterine Schwab; Matthias Zeppelzauer; Marianne Worisch
data

eye 3

favorite 0

comment 0

This file is structured as a matrix with N rows x M columns. Each row holds the data of one subject and trial. The first column identifies each subject "SUBJECT_ID", the second column each recording session "SESSION_ID", and the third column each single trial within a recording session "TRIAL_ID".
Source: https://springernature.figshare.com/articles/dataset/GRF_F_AP_PRO_left/11394816/1
The Dataset Collection
by Broad DepMap
data

eye 1

favorite 0

comment 0

This dataset contains the results of Avana library CRISPR-Cas9 genome-scale knockout (prefixed with Achilles) as well as mutation, copy number and gene expression data (prefixed with CCLE) for cancer cell lines as part of the Broad Institute’s Cancer Dependency Map project. We have repackaged our fileset to include all quarterly-updating datasets produced by DepMap. The Avana CRISPR-Cas9 genome-scale knockout data has expanded to include 739 cell lines, the RNAseq data includes 1270 cell...
Source: https://figshare.com/articles/dataset/DepMap_20Q1_Public/11791698/2
Bulk Bibliographic Metadata
by Bruns A, Lenke C, Schmidt C, Taubert NC
data

eye 20

favorite 0

comment 0

ISSN-GOLD-OA provides a matching list of ISSN for Gold Open Access (OA) journals. The intention was to compile a matching table that is as complete as possible by using different publicly available sources. The data set offers a basis for various journal-related issues in bibliometric studies on Gold OA. The list is an updated version of ISSN-GOLD-OA . For a detailed description of the method, data sources used and the definition of the table fields, please refer to the original...
Web PDF Training Sets
by Bryan Newbold
data

eye 8

favorite 0

comment 0

Fatcat Database Snapshots and Bulk Metadata Exports
by Bryan Newbold
data

eye 19

favorite 0

comment 0

This item contains compiled binaries and packages (for apt and homebrew) for the fatcat-cli utility. Source code available at: https://gitlab.com/bnewbold/fatcat-cli
The Dataset Collection
by Bubba Brooks
data

eye 1

favorite 0

comment 0

OTU table generated from the Lotus run for this manuscript: "The developing premature infant gut microbiome is a major factor shaping the microbiome of neonatal intensive care unit rooms."
Source: https://figshare.com/articles/dataset/OTU_txt/6225686/1
The Dataset Collection
by Bubba, Tatiana A.; Juvonen, Markus; Lehtonen, Jonatan; März, Maximilian; Meaney, Alexander; Purisha, Zenith; Siltanen, Samuli
data

eye 1

favorite 0

comment 0

This is an open-access dataset of tomographic X-ray data of a carved cheese. The dataset consists of the X-ray sinogram of a single 2D slice of the cheese slice with three different resolutions, and the corresponding measurement matrices modeling the linear operation of the X-ray transform. Each of the sinograms was obtained from a measured 360-projection fan-beam sinogram by down-sampling and taking logarithms. The original (measured) sinogram is also provided in its original form and...
Source: https://zenodo.org/record/1254210
Model weights / tensorflow checkpoints for Max F. Burg et al. (2021): Learning Divisive Normalization in Primary Visual Cortex CC0 Waiver
Source: https://data.goettingen-research-online.de/dataset.xhtml?persistentId=doi:10.25625/0JCXYO&version=2.0
We explore how support for radical parties of both the left and right may be shaped by what we call ‘positional deprivation’, where growth in income of individuals at a given point in the income distribution is outpaced by income growth elsewhere in that distribution. We argue that positional deprivation captures the combination of over-time and relative misfortune that can be expected to distinctly spur support for radical left and right parties. We explore this possibility by matching new...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/0FBCTJ&version=1.0
The Dataset Collection
by Bushra Zafar; Nouman Ali
data

eye 3

favorite 0

comment 0

The SIRI-WHU dataset comprises of 2400 images organized into 12 categories. Dataset is uploaded in parts. Complete dataset can be obtained by combining SIRI_WHU1 and SIRI_WHU2 respectively. Zhao B, Zhong Y, Xia GS, Zhang L. Dirichlet-derived multiple topic scene classification model for high spatial resolution remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing. 2016;54(4):2108{2123.
Source: https://figshare.com/articles/dataset/SIRI_WHU_Dataset/8796980/2
Internet Archive Research Publication Crawls
by CNKI
data

eye 0

favorite 0

comment 0

Metadata about COVID-19 papers downloaded from:  http://en.gzbd.cnki.net/GZBT/brief/Default.aspx
Bulk Bibliographic Metadata
by CORE
data

eye 17

favorite 0

comment 0

This item contains mappings between CORE (https://core.ac.uk/) internal identifiers (simple integer numbers) and DOIs. This listing (a simple two-column TSV file) is derived from their publicly available metadata corpus.
Bulk Bibliographic Metadata
by CORE.ac.uk
data

eye 76

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset CORE Dataset to Microsoft Academic Graph (MAG) mapping (80MB compressed, 173 MB in total) - 8.9M items License: Open Data Commons Attribution (ODC-By) license.
Bulk Bibliographic Metadata
by CORE.ac.uk
data

eye 17

favorite 0

comment 0

Mirrored from: https://core.ac.uk/documentation/dataset Dataset created for Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings (LREC 2020) (62 MB compressed, 204 MB in total) License: Open Data Commons Attribution (ODC-By) license.
The Dataset Collection
by Caitlin Kowalsky
data

eye 1

favorite 0

comment 0

Sequencing file used for data analysis. See README.txt for more information.
Source: https://figshare.com/articles/dataset/SPP_CG_F/1270759/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

Second experimental replicate. GC-MS of the co-culture, S. cerevisiae alone, and A. malorum alone for initial putative identification of metabolites. Methanol extraction of XAD-4 beads Includes standards and samples used for EICs in paper.
Source: https://figshare.com/articles/dataset/GC-MS_Data_experimental_replicate_2_co-culture_and_individual_cultures/3124891/1
Bulk Bibliographic Metadata
by Cariniana
data

eye 20

favorite 0

comment 0

Downloaded from, eg:  https://cariniana.ibict.br/index.php/preservacao-de-publicacoes-digitais/periodicos-eletronicos
The Dataset Collection
by Carolina Osuna Mascaró
data

eye 1

favorite 0

comment 0

Here we present the floral transcriptomes of 18 populations from seven species of the genus Erysimum (Brassicaceae). Transcriptomes were de novo assembled.
Source: https://figshare.com/articles/dataset/Erysimum_assemblies/11877786/2
The Dataset Collection
by Carson Witte
data

eye 1

favorite 0

comment 0

MODIS sst 15-year timeseries in greater Kotzebue sound
Source: https://figshare.com/articles/dataset/sst_modis_day_nc/7423247/1
The Dataset Collection
data

eye 4

favorite 0

comment 0

Replication materials for "A Framework for Measuring Leaders' Willingness to Use Force" by Jeff Carter and Charles E. Smith, Jr. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/7WFX1K&version=1.0
This is an agreement (“Agreement”) between you the downloader (“Downloader”) and the owner of the materials (“User”) governing the use of the materials (“Materials”) to be downloaded. I. Acceptance of this Agreement By downloading or otherwise accessing the Materials, Downloader represents his/her acceptance of the terms of this Agreement.   II. Modification of this Agreement Users may modify the terms of this Agreement at any time. However, any modifications to this Agreement...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IAH6Z6&version=6.1
The Dataset Collection
by Cefan Zhou; Xuehong Qian; Miao Hu; Rui Zhang; Nanxi Liu; Yuan Huang; Jing Yang; Juan Zhang; Hua Bai; Yuyan Yang; Yefu Wang; Declan Ali; Marek Michalak; Xing-Zhen Chen; Jingfeng Tang
data

eye 1

favorite 0

comment 0

Macroautophagy/autophagy plays key roles in development, oncogenesis, and cardiovascular and metabolic diseases. Autophagy-specific class III phosphatidylinositol 3-kinase complex I (PtdIns3K-C1) is essential for autophagosome formation. However, the regulation of this complex formation requires further investigation. Here, we discovered that STYK1 (serine/threonine/tyrosine kinase 1), a member of the receptor tyrosine kinases (RTKs) family, is a new upstream regulator of autophagy. We...
Source: https://tandf.figshare.com/articles/dataset/STYK1_promotes_autophagy_through_enhancing_the_assembly_of_autophagy-specific_class_III_phosphatidylinositol_3-kinase_complex_I/10265156/1
The Dataset Collection
by ChanSu Park
data

eye 1

favorite 0

comment 0

Practice building additional datasets using the datasets of sklearn
Source: https://figshare.com/articles/dataset/lfw_funneled_tar/7699448/2
The Dataset Collection
by Chaoxiang Ren
data

eye 1

favorite 0

comment 0

A part of transcriptome sequencing data of safflower flowers cultivated under different light intensities.
Source: https://figshare.com/articles/dataset/HS3_2_2_fq_gz/8281109/1
The Dataset Collection
by Charles Tapley Hoyt
data

eye 1

favorite 0

comment 0

This dataset is now maintained on Zenodo. See: https://zenodo.org/record/4020486
Source: https://figshare.com/articles/dataset/Ooh_Na_Na/12149241/1
The Dataset Collection
data

eye 1

favorite 0

comment 0

This project presents a dataset that is assembled from multiple sources between 2013 and 2017, including contaminants in drinking water, cancer incidence rates, public perception of the relationship between water contaminants and cancer on Twitter, and census data covering the population living in the United States. The units of analysis are 3,219 counties and 33,144 zip codes. The users of this dataset can address model-driven questions regarding water contaminants and cancer incidence rates...
Source: https://figshare.com/articles/dataset/A_dataset_integrating_water-related_public_health_social_media_census_and_administrative_data_in_the_United_States/12673157/2
The Dataset Collection
by Chi Chen; Weike Ye; Yunxing Zuo; Shyue Ping Ong
data

eye 1

favorite 0

comment 0

This file contains the graph representation of structures in the Materials Project (www.materialsproject.org) and target properties, including formation energy per atom, band gap, and for a subset of 5830 structures, the shear moduli G_{VRH} and bulk moduli K_{VRH}. This data is part of the our paper "Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals". Change log: v3. For the graph dictionaries, we modify the "node" key to "atom"...
Source: https://figshare.com/articles/dataset/Graphs_of_materials_project/7451351/3