15M
15M
Jul 17, 2018
07/18
by
Internet Archive Web Group
Web archive data from a crawl of open access PDF URLs provided by Unpaywall.
11.1M
11M
Apr 9, 2018
04/18
by
Internet Archive Web Group
1.7M
1.7M
Oct 31, 2018
10/18
by
Internet Archive Web Group
Crawl of "upstream" URLs from CORE (core.ac.uk) metadata dump. Only a partial seedlist of files crawled.
3.3M
3.3M
Jun 1, 2018
06/18
by
Internet Archive Web Group
7,194
7.2K
Sep 6, 2018
09/18
by
"Paywall The Movie"
movies
eye 7,194
favorite 4
comment 0
"Paywall: The Business of Scholarship" is a documentary film released in 2018 about the scholarly publishing industry and the Open Access movement. More information available from https://paywallthemovie.com/paywall Website blurb: "Paywall: The Business of Scholarship is a documentary which focuses on the need for open access to research and science, questions the rationale behind the $25.2 billion a year that flows into for-profit academic publishers, examines the 35-40% profit...
Topics: Open Access, Copyright, Publishing
6,567
6.6K
May 7, 2018
05/18
by
Internet Archive Web Group
This collection contains web crawl data for a random selection of 500k (0.5 million) Crossref DOI redirects, including the doi.org redirect requests. The intent of this crawl is to gather loose statistics on the number of failing redirects, number of host websites that block automated crawling, and a corpus of HTML landing pages for metadata extraction (eg, "signposting" HTTP headers, linked data HTML metadata, semantic markup). Total size of (uncompressed) WARC data is 50 GB,...
linux.conf.au is a conference about the Linux operating system, and all aspects of the thriving ecosystem of Free and Open Source Software that has grown up around it. Run since 1999, in a different Australian or New Zealand city each year, by a team of local volunteers, LCA invites more than 500 people to learn from the people who shape the future of Open Source. For more information on the conference see https://linux.conf.au/
Topic: linux
Lost-and-found animals, items, and other messages transmitted on our public streets. Mostly in San Francisco.
Topic: poster
Downloaded from: https://zenodo.org/record/1438356
This is intended to be an exact copy of the item "stackexchange" (https://archive.org/details/stackexchange) as of 2018-03-14. That item is continuously updated by Stack Exchange (which is great!); this snapshot could be helpful if something goes wrong with that process, or might be helpful for researchers if the upstream schema changes or to check for missing/changed data. See the "upstream" item for details and license/policy details.
Bathymetry TIFF, Lake Victoria Bathymetry, raster, 2017 Reference Information and Units: Projected Coordinate System: Africa Lambert Conformal Conic ESRI:102024 (https://epsg.io/102024) Geographic Datum: D_WGS_1984 Pixel Size: 100 meters Units: Meters File Naming Convention: LV_Bathy_V7.tif Data Origin: The point data was obtained from an Admiral Bathymetry map and points collected in the field. In addition, we used points from other maps and acoustic sounding data. The final input point total...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/SOEKNR&version=10.2
This item contains a copy of the 2018-09-03 snapshot of bibliographic metadata extracted from Wikidata. These datasets downloaded from: http://uri.gbv.de/wikicite/20180903/ More information at: https://github.com/wikicite/wikicite-data#readme and http://wikicite.org/
Downloaded from: https://www.ebsco.com/sites/g/files/nabnos191/files/acquiadam-assets/Jan-Szczepanski-Open-Access-Journals-2018_0.docx
This is a mapping between: - DOIs (Crossref) - PubMed PMID and PMCID (NIH) - CORE record identifier (core.ac.uk) - Wikidata QIDs See README and scripts for details.
Academic Data and Datasets
1
1.0
Dec 18, 2021
12/21
by
D. Louis Collins; Gabriel Allan Devenyi; Raihaan Patel; Stephanie Tullo; Min Tae M Park; M. Mallar Chakravarty
data
eye 1
favorite 0
comment 0
Pseudo-MRIs was created by manually assigning an intensity value to each label value of the hisotologically-derived atlas based on the intensity of the matching structure in MRI template for which it is registered to (Colin27 MRI template); done separately for each hemisphere
Source: https://springernature.figshare.com/articles/dataset/Pseudo-MRI_for_Colin27_MRI_template_right_hemisphere_MINC_format_/6067994/1
The raw NGSequencing data of a library prepared by MACE technology from five pea plants with mutant phenotype of F2 (NGB1238 × N24) mapping population.
Source: https://figshare.com/articles/dataset/M2-library/7409270/1
uv (visibility) continuum and spectral line data for Per-emb-13. Also know as NGC 1333 IRAS 4B. Field includes IRAS 4B' CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/YPGSYV&version=1.1
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2639599
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2625771
Ka-band VLA data for Per-emb-1/HH211-MMS CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/8Z9AXP&version=2.0
Newspaper CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/DWN6PA&version=1.0
In Thailand, India, Libya, and elsewhere, governments arm the populace or call up volunteers in irregular armed groups despite the risks this entails. The widespread presence of these militias, outside the context of state failure, challenges the expectation that governments uniformly consolidate the tools of violence. Drawing on the logic of delegation, we resolve this puzzle by arguing that governments have multiple incentives to form armed groups with a recognized link to the state but...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N6M2OQ&version=1.1
OpenAIRE LOD Dump
Source: https://zenodo.org/record/1321718
identifier-refinery Tools and assets for easy and reproducable gene identifier conversion. Methods This repository is used to build matricies which can convert between different gene identifiers. These conversion matricies are built by: Randomly choosing raw CEL files from NCBI GEO for a given platform accession code (in /cels ) Reading the CEL header and joining Brainarray (e.g., hgu133plus2hsensgprobe ) and Bioconductor (e.g., hgu133plus2.db ) (x, y) coordinates Finding intersecting probe...
Source: https://zenodo.org/record/1327563
The eCLIP data provided here is a subset of the eCLIP data of RBFOX2 from a study published by Nostrand et al. (2016, http://dx.doi.org/10.1038/nmeth.3810). The dataset contains the first biological replicate of RBFOX2 CLIP-seq and the input control experiment (*fastq files). The data was changed and downsampled to reduce data processing time, thus the datasets does not correspond to the original data pulled from Nostrand et al. (2016, http://dx.doi.org/10.1038/nmeth.3810). Also included is...
Source: https://zenodo.org/record/1327423
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=_djVv6OPEso strain : OW939 timestamp : 2014-03-23T19:18:59+01:00 strain_description : zgIs113[P(dat-1)::alpha-Synuclein::YFP] sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who :...
Source: https://zenodo.org/record/1192659
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=wgWwkUBLjWI strain : AQ2947 timestamp : 2014-05-08T09:24:31+02:00 strain_description : CGC N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1191135
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=qEjGiv6BP1Q strain : OW953 timestamp : 2014-03-27T15:03:21+01:00 strain_description : zgIs138[P(dat-1)::YFP] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1193450
This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS049164_scaffolds_zip/3478697/1
Raw data for the experiments in Section 7 conducted for the paper https://papers.nips.cc/paper/5872-efficient-and-robust-automated-machine-learning
Source: https://figshare.com/articles/dataset/Efficient_and_Robust_Automated_Machine_Learning_-_Section_6/3824103/2
A data-set of partially stirred reactor states generated by the pypasr (https://github.com/kyleniemeyer/pypasr) code for GRI-Mech 3.0 (http://combustion.berkeley.edu/gri-mech/version30/text30.html). The data is saved in the numpy .npz format, and is organized as follows: The keys of the .npz file are the original filenames, e.g. pasr_out_0.npy, etc. The files correspond to the following inlet conditions: 0, 1, 2: 400, 600, 800 K at 1 atm 3, 4, 5: 400, 600, 800 K at 10 atm 6, 7, 8: 400, 600, 800...
Source: https://figshare.com/articles/dataset/ch4_pasr_data_bin/4007418/2
Molecular phylogenetic inference is inherently dependent on choices in both methodology and data. Many insightful studies have shown how choices in methodology, such as the model of sequence evolution or optimality criterion used, can strongly influence inference. In contrast, much less is known about the impact of choices in the properties of the data, typically genes, on phylogenetic inference. We investigated the relationships between 52 gene properties (24 sequence-based, 19 function-based,...
Source: https://figshare.com/articles/dataset/A_genome-scale_investigation_of_how_sequence-_function-_and_tree-based_gene_properties_influence_phylogenetic_inference/1597710/2
Academic Data and Datasets
1
1.0
Dec 16, 2021
12/21
by
Alexander V. Georgiev; Diana Christie; Kevin A. Rosenfield; Angelina V. Ruiz-Lambides; Elizabeth Maldonado; Melissa Emery Thompson; Dario Maestripieri
data
eye 1
favorite 0
comment 0
Explaining intraspecific variation in reproductive tactics hinges on measuring associated costs and benefits. Yet, this is difficult if alternative (purportedly less optimal) tactics remain unobserved. We describe a rare alpha-position take-over by an immigrant male rhesus macaque in a population where males typically gain rank via succession. Unusually, male aggressiveness after the take-over correlated with rank and mating success. The new alpha achieved the highest mating and reproductive...
Source: https://brill.figshare.com/articles/dataset/Movie_file_for_Breaking_the_succession_rule_the_costs_and_benefits_of_an_alpha_status_take_over_by_an_immigrant_rhesus_macaque_on_Cayo_Santiago/2198989/1
Scholars, practitioners, and pundits often leave their assessments of uncertainty vague when debating foreign policy, arguing that clearer probability estimates would provide arbitrary detail instead of useful insight. We provide the first systematic test of this claim using a data set containing 888,328 geopolitical forecasts. We find that coarsening numeric probability assessments in a manner consistent with common qualitative expressions—including expressions currently recommended for use...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/D9FAZL&version=1.0
uv (visibility) continuum and spectral line data for Per-emb-15 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/DZWOXZ&version=1.0
Huntington referred to a ‘clash of civilizations’ revealing itself in international terrorism, particularly in the clash between the Islamic civilization and the West. The authors confront his hypotheses with ones derived from the strategic logic of international terrorism. They predict more terrorism against nationals from countries whose governments support the government of the terrorists’ home country. Like Huntington, they also predict excessive terrorism on Western targets, not...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/HOGQGD&version=1.0
This replication archive contains all data and code to replicate the results in "Measuring Political Positions from Legislative Speech" by Benjamin E. Lauderdale and Alexander Herzog. Article abstract : Existing approaches to measuring political disagreement from text data perform poorly except when applied to narrowly selected texts discussing the same issues and written in the same style. We demonstrate the first viable approach for estimating legislator-specific scores from the...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/RQMIV3&version=1.0
Computational methods that automatically extract knowledge from data are critical for enabling data-driven materials science. A reliable identification of lattice symmetry is a crucial first step for materials characterization and analytics. Current methods require a user-specified threshold, and are unable to detect ``average symmetries'' for defective structures. Here, we propose a new machine-learning-based approach to automatically classify structures by crystal symmetry. First, we...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZDKBRF&version=2.0
DNS data in forced turbulence for case 1-part a
Source: https://figshare.com/articles/dataset/entrainment3df_case1_tar_gzaa/5821086/1
Human brain tissues were obtained from the Wuhan brain bank in accordance with the brain bank protocol. Ethical agreements were obtained from the donors or their relatives by written informed consent. Total RNA was isolated from the frozen prefrontal cortex tissue using the Trizol (Invitrogen, USA) protocol with no modifications. Low molecular weight RNA was isolated, ligated to the adapters, amplified, and sequenced following the Small RNA preparation protocol (Illumina, USA) with no...
Source: https://figshare.com/articles/dataset/human_brain_smRNA_seq_bz2/6893138/1
Refinement of supervised Vizbin binning strategy to reconstruct metagenome assembled genomes (MAGs).
Source: https://figshare.com/articles/dataset/Anvio_database/6170420/1
Data for Figure 2, Elsden & Wright, 2018b
Source: https://figshare.com/articles/dataset/Data_for_Figure_2_Elsden_Wright_2018/7235153/2
MODIS sst 15-year timeseries in greater Kotzebue sound
Source: https://figshare.com/articles/dataset/sst_modis_day_nc/7423247/1
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=IRcoeqEf3JQ strain : CB1112 timestamp : 2014-03-26T09:16:50+01:00 strain_description : cat-2(e1112)II sex : hermaphrodite stage : adult ventral_side : anticlockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1191686
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=qYtdVcAFUL8 strain : AQ2947 timestamp : 2014-04-10T10:17:57+02:00 strain_description : CGC N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1200948
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=Z5d0z3g9sDA strain : AQ2947 timestamp : 2014-05-08T08:22:32+02:00 strain_description : CGC N2 (Bristol, UK) sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who : Celine N. Martineau,...
Source: https://zenodo.org/record/1192651
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2641306
Academic Data and Datasets
5
5.0
Dec 16, 2021
12/21
by
Ian Hinder; Kidder, Larry; Pfeiffer, Harald; Scheel, Mark; Boyle, Michael; Hemberger, Dan; Lovelace, Geoffrey; Szilagyi, Bela
data
eye 5
favorite 0
comment 0
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2625759
This item contains a snapshot of the "Norwegian Register for Scientific Journals, Series and Publishers", as downloaded from https://dbh.nsd.uib.no/publiseringskanaler/AlltidFerskListe. As the name indicates, this is a registry of international Journals (aka "titles", or "serials"); the scope is not limited to Norwegian or Nordic publications.
This is a mirror of the RDF dump posted at: http://ma-graph.org/rdf-dumps/ The license provided with this metadata is: Open Data Commons Attribution License (ODC-By) v1.0
This is a copy of the Microsoft Academic Graph corpus of scholarly publications and citations, based on crawls from the open web. Metadata (authors, DOI numbers, journals, citations, keywords, affiliations, etc) is included for more than 125 million publications. The corpus is a single 27GB zipfile that extracts into about 96GB of flat tab-separated text files, cross-referenced using identifier columns. Schema information can be found in the `readme.txt` file, and usage restrictions can be...
Downloaded from: https://grid.ac/downloads
6
6.0
May 30, 2017
05/17
by
yjerem
software
eye 6
favorite 0
comment 0
creates a static html archive that mixes external links with internal mirroring To restore the repository download the bundle yjerem-estate_-_2016-01-22_22-50-06.bundle and run: git clone yjerem-estate_-_2016-01-22_22-50-06.bundle -b master Source: https://github.com/yjerem/estate Uploader: yjerem Upload date: 2016-01-22
Topics: GitHub, code, software, git
This file is a snapshot dump of the Crossref DOI metadata API, containing entries for over 99 million DOIs. This was generated by running the scripts at: https://github.com/greenelab/crossref (git commit: 768a49ba1d8ba1971f00471950514716a9f699c8) The script completed on 2018-09-20. Format is xz-compressed JSON (one JSON object per line).
Downloaded from https://core.ac.uk/services "The data aggregated from repositories by the CORE system can be accessed in two ways, through the CORE API or by downloading the data to your computer. The former option is practical if you want to build a service on top of CORE while the latter is something we recommend to those who would like to analyse the CORE dataset and/or apply some computationally intensive batch processes. If you use CORE in your work, we kindly request you to cite one...
37
37
Nov 2, 2019
11/19
by
Paul Michael Deutchman, Jess Sullivan
data
eye 37
favorite 1
comment 0
This is an OSF registration, part of a Center for Open Science (COS) / Internet Archive (IA) partnership. For more read the blog post . You can browse the full collection at https://archive.org/details/cos-dev-sandbox . See the original registration at: osf.io That's all, folks! This was a dry run.
This experiment is part of the C.elegans behavioural database . For more information and the complete collection of experiments visit http://movement.openworm.org preview link : https://www.youtube.com/watch?v=2YoQTP7l9IM strain : OW949 timestamp : 2014-03-19T08:41:41+01:00 strain_description : zgIs125[P(dat-1)::alpha-Synuclein::YFP] sex : hermaphrodite stage : adult ventral_side : clockwise media : NGM agar low peptone arena : style : petri size : 35 orientation : away food : OP50 who :...
Source: https://zenodo.org/record/1192123
Globcover 2009 (Global Land Cover. European Space Agency and the Université catholique de Louvain, 2010 - http://due.esrin.esa.int/page_globcover.php). Resampled from original spatial resolution of 250 m to 100 m and provided as 1201x1201 pixel tiles. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/XALRAG&version=1.0
uv (visibility) continuum and spectral line data for Per-emb-7
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/A8XY2V&version=1.0
Log and boot files of game 67
Source: https://zenodo.org/record/1325331
This is model output from GEPIC for wheat as part of AgMIP's Global Gridded Crop Model Intercomparison (GGCMI) phase 1 output data set. The data have been generated following the modeling protocol of Elliott et al. (2015) and has been used to evaluate the models (Müller et al., 2017). A data description paper has been published in Scientific Data (Müller et al. 2019). References: Elliott J, Müller C, Deryng D, Chryssanthacopoulos J, Boote KJ, Büchner M, Foster I, Glotter M, Heinke J, Iizumi...
Source: https://zenodo.org/record/1408571
Simulation of a black-hole binary system evolved by the SpEC code .
Source: https://zenodo.org/record/2642028
OTU table generated from the Lotus run for this manuscript: "The developing premature infant gut microbiome is a major factor shaping the microbiome of neonatal intensive care unit rooms."
Source: https://figshare.com/articles/dataset/OTU_txt/6225686/1
This dataset includes raw downscaled climate projections (1950-2100) for multiple locations across the U.S. with estimations of temperature humidity index, heat stress frequency, dry matter intake and milk production loss with or without heat abatement. This is the first set of data
Source: https://figshare.com/articles/dataset/Dataset_Dairy_Heat_Stress_Gunn_et_al_2018_PLOS_ONE_part_1/7148651/1
Academic Data and Datasets
1
1.0
Dec 19, 2021
12/21
by
Friederike Ehrhart; Martina Kutmon; Egon Willighagen; Chris T. Evelo; Jonathan Mélius
data
eye 1
favorite 0
comment 0
Gene-to-variant BridgeDb mapping database for Ensembl version 92
Source: https://figshare.com/articles/dataset/BridgeDb_SNP_collections_Ensembl_version_92/7326671/1
This Dataset accompanies: Benigni, Matthew, Joseph, Kenneth, and Carley, Kathleen. n.d. “Online Threat-Group-Supporting Community Detection: Uncovering the ISIS Supporting Community on Twitter.” Under Review Plos One. and can be analyzed using R source code provided at: https://github.com/mbenigni/OSNThreatGroups The following files are contained in this dataset: Files: deIdentified_attributes.csv - contains node attribute information for users associated with the 2 hop snowball sample...
Source: https://figshare.com/articles/dataset/PLOS_One_Data_Accompanies_Online_Threat_Group_Supporting_Community_Detection_Uncovering_the_ISIS_Supporting_Community_on_Twitter_/3166798/1
This folder contains raw Illumina MiSeq sequence files. We amplified a 313bp of the mitochondrial Cytochrome c. Oxidase Subunit I gene (COI) for benthic coral reef samples collected in the Red Sea. Details of samples are provided in the metadata file "Pearmanetal_HiddenMajority_COI_metadata.xls"
Source: https://figshare.com/articles/dataset/Pearmanetal_HiddenMajority_Archive-1_zip/5549365/1
A mirror of the Unpaywall (aka oaDOI.org) metadata corpus, primarily consisting of public open access flags for a large number of Crossref-registered DOIs (identifiers representing published journal articles and other works). For more information see: http://unpaywall.org/products/snapshot
Data-munged title-level metadata combined from: DOAJ, ROAD, Norwegian Register, and Internet Archive crawled metadata. See SOURCES.md for URLs of upstream metadata, and ISSN_matching.html for Jupyter notebook used to derive this dataset.
111
111
Jun 4, 2018
06/18
by
Internet Archive Web Group
software
eye 111
favorite 0
comment 0
This item contains re-compiled .jar files for JVM (Java, Scala, etc) software packages used by the archive's "sandcrawler" journal ingest pipeline.
Downloaded from https://doaj.org/csv and the OAI-PMH interface. File names encode the date when data was downloaded.
Downloaded from https://core.ac.uk/services "The data aggregated from repositories by the CORE system can be accessed in two ways, through the CORE API or by downloading the data to your computer. The former option is practical if you want to build a service on top of CORE while the latter is something we recommend to those who would like to analyse the CORE dataset and/or apply some computationally intensive batch processes. If you use CORE in your work, we kindly request you to cite one...
Replication files for "Ex Post Lobbying" CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/CWQYIB&version=1.0
Equivital SEM raw files for Integrated Biomedical System project CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/FD4B6C&version=1.0
The data and programs replicate tables and figures from "The Global Distribution of Economic Activity: Nature, History, and the Role of Trade", by Henderson, Squires, Storeygard, and Weil. Data were constructed from various sources. Please see the Readme file for additional details. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/MO6RJT&version=1.1
Enumeration of all blocks in the blockchain of Namecoin, divided by year. The data are from Namecoin Explorer, which is vailable from: https://bitinfocharts.com/de/namecoin/explorer/. CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/M9K5OJ&version=2.0
Log and boot files of game 223
Source: https://zenodo.org/record/1325662
Advertising expenditures in congressional campaigns are made not directly by campaigns themselves but indirectly though intermediary firms. Using a new dataset of revenues and costs of these firms, we study the markups that these firms charge candidates. We find that markups are higher for inexperienced candidates relative to experienced candidates, and PACs relative to candidates. We also find significant differences across the major parties: firms working for Republicans charge higher prices,...
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/XEXISR&version=1.1
Viral communities upstream and downstream of the River Murray Illumina MiSeq paired-end metagenomics CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/YTZCEO&version=1.0
Ustilago maydis FB1 sequence described in Kamper J et al. "Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis ". Nature 444, 97-101.
Source: https://figshare.com/articles/dataset/Ustilago_maydis_FB1_sequence/1444438/1
Textured surface model of section T3/39 to T3/41 of the T3 theropod dinosaur trackway at Münchehagen (Lower Cretaceous, Germany), preserved as natural casts
Source: https://figshare.com/articles/dataset/Surface_model_of_section_T3_39_to_T3_41_of_the_T3_theropod_dinosaur_trackway_at_M_nchehagen_Lower_Cretaceous_Germany_preserved_as_natural_casts/3027949/1
Second experimental replicate. GC-MS of the co-culture, S. cerevisiae alone, and A. malorum alone for initial putative identification of metabolites. Methanol extraction of XAD-4 beads Includes standards and samples used for EICs in paper.
Source: https://figshare.com/articles/dataset/GC-MS_Data_experimental_replicate_2_co-culture_and_individual_cultures/3124891/1
Metaproteomic profiling of saliva in subjects with periodontitis, dental caries and orally healthy controls
Source: https://figshare.com/articles/dataset/20150722_QE5_UPLC8_RJC_COLLAB_DB_4067_01_raw/3807984/1
Academic Data and Datasets
0
0.0
Dec 20, 2021
12/21
by
Amir AghaKouchak; Mojtaba Sadegh; Ehsan Raei; Mohammad Reza Nikoo; Omid Mazdiyasni
data
eye 0
favorite 0
comment 0
Daily binary (0/1) occurrence records of heatwaves/warm-spells using Constant Threshold, EHF, ETCCDI, PDF and SHI Methods for the entire globe between 1979 to 2017.
Source: https://springernature.figshare.com/articles/dataset/GHWR_-_Record_-_Const_Thresh_EHF_ETCCDI_PDF_and_SHI_Methods/5885224/1
A data-set of partially stirred reactor states generated by the pypasr (https://github.com/kyleniemeyer/pypasr) code for GRI-Mech 3.0. Used in several works for error validation and performance testing
Source: https://figshare.com/articles/dataset/h2_pasr_data_bin/4007427/1
Mirrored from: https://www.arc.gov.au/excellence-research-australia/era-2018-journal-list
Contains (at least) a list of DOIs cited by various language Wikipedias as of March 2018. Transformed by Charles using lists linked from https://blog.wikimedia.org/2018/04/05/ten-most-cited-sources-wikipedia/
uv (visibility) continuum and spectral line data for Per-emb-24 CC0 Waiver
Source: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/SLWLWJ&version=1.0
Academic Data and Datasets
1
1.0
Dec 16, 2021
12/21
by
Timothy Arnett; Pascale V. Guillot; Anna Maria Ranzoni; Michelangelo Corcelli
data
eye 1
favorite 0
comment 0
MicroCT of mouse tibiae-wild-type6
Source: https://springernature.figshare.com/articles/dataset/MicroCT_of_mouse_tibiae-wild-type6/5525449/1
Academic Data and Datasets
1
1.0
Dec 16, 2021
12/21
by
Zhanwei Du; Yongjian Yang; Zeynep Ertem; Chao Gao; Liping Huang; Qiuyang Huang; Yuan Bai
data
eye 1
favorite 0
comment 0
Movement between locations
Source: https://figshare.com/articles/dataset/Week-Mobility-Network/7066685/2
Stress of worm under flow
Source: https://figshare.com/articles/dataset/cervantes_data23_shun_newmicron_shun_042717_flowworm_5X_4_/7523426/2
Redundant transcriptome assembly based on reads obtained from the library prepared from a cloacal tissue.
Source: https://figshare.com/articles/dataset/Redundant_transcriptome_assembly_-_Cloaca_Dataset_6_/6819617/1
Additional table:The compositional variation of the Formula Diet fed to the experimental mice (Table 1), Differentially expressed genes by effect of RS, DJ, and DJ526 (Table 2), The most highly significant up-regulated and down-regulated pathways in the livers of mice on RS, DJ and DJ526 towards those on Ctrl groups (Table 3) Excel file: Globally normalized data (Fold change raw data), Z transformed data (Z-ratio raw data), and GSEA results
Source: https://figshare.com/articles/dataset/New_draft_item/3115552/3
This sample is part of the GutCyc Collection ( www.gutcyc.org ), a compendium of environmental pathway / genome databases. GutCyc was constructed from 418 human microbiome assemblies using the open-source MetaPathways pipeline, that enables reproducible metagenomic annotation.
Source: https://figshare.com/articles/dataset/SRS017191_zip/3478499/1
The archive file contains the merged anvi'o profile, and the contigs database for the Infant Gut data from Sharon et al. that is suitable to analyze with anvi'o v4 or later. Please see http://merenlab.org/tutorials/infant-gut/ for details.
Source: https://figshare.com/articles/dataset/Infant_Gut_Data_v2/3502445/14
Log and boot files of game 105
Source: https://zenodo.org/record/1325414
This is a snapshot of the AI2 (Semantic Scholar') "Open Research Corpus", as release May 3rd, 2018. These files originally downloaded from AWS S3, via: http://labs.semanticscholar.org/corpus/ Note restrictions in the 'license.txt' file. 'index.html' is a backup of the landing page, that includes field content. 'sample-S2-records.gz' is a subset of the data useful for exploration. Semantic Scholar is a project of the Allen Institute for Artificial Intelligence.