Platform to facilitate sharing, discovery, and secure access to UCSF biomedical data. It''s powered by the Dataverse Network platform, which supports a variety of data types, as well as attribution and licensing needs. Researchers may share datasets, discover data from other labs, and reuse data. Links to tools and information that help scientists properly organize, manage, and document their datasets are also provided.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This repository contains a range of Arena-Hard-Auto benchmark artifacts sourced as part of the 2024 paper Style Outweighs Substance. Repository Structure Model Responses for Arena Hard Auto Questions: data/ArenaHardAuto/model_answer
Our standard reference model for pairwise comparisons was gpt-4-0314.
Our standard set of comparison models was:
Llama-3-8B Variants: bagel-8b-v1.0, Llama-3-8B-Magpie-Align-SFT-v0.2, Llama-3-8B-Magpie-Align-v0.2, Llama-3-8B-Tulu-330K, Llama-3-8B-WildChat… See the full description on the dataset page: https://huggingface.co/datasets/DataShare/sos-artifacts.
DataShare/QwQ_32B_setting_7 dataset hosted on Hugging Face and contributed by the HF Datasets community
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset compiles growing-season methane (CHâ‚„) flux measurements from permafrost-affected wetlands across the Northern Hemisphere, covering 154 observations from 40 published studies. It includes associated environmental variables such as permafrost type, water table depth, soil temperature, vegetation presence, and site climate conditions. The data support analyses of how permafrost continuity (continuous, discontinuous, sporadic) influences CHâ‚„ emissions through hydrological and ecological pathways. This resource is intended to support research on greenhouse gas dynamics in high-latitude ecosystems and to inform model development for predicting methane fluxes under permafrost thaw and climate change scenarios. We employed Webplot Digitalzer version 4 to extract numeric data from corresponding literature :" https://github.com/automeris-io/WebPlotDigitizer"
This is the data and code associated with the Publication: Han, B., Reidy, A., & Li, A. (2021). Modeling nutrient release with compiled data in a typical Midwest watershed. Ecological indicators, 121, 107213.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The sample data consist of return of 50 Baht Gold Futures and 10 Baht Gold Futures from the period August 3, 2010 to February 26, 2019 for the nearest even month contracts with 2,097 sample data points.
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Explore the historical Whois records related to datashare.link (Domain). Get insights into ownership history and changes over time.
Website which allows data from completed clinical trials to be distributed to investigators and public. Researchers can download de-identified data from completed NIDA clinical trial studies to conduct analyses that improve quality of drug abuse treatment. Incorporates data from Division of Therapeutics and Medical Consequences and Center for Clinical Trials Network.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data in relation with a study in the journal Land Degradation & Development
The Human Know-How Dataset describes 211,696 human activities from many different domains. These activities are decomposed into 2,609,236 entities (each with an English textual label). These entities represent over two million actions and half a million pre-requisites. Actions are interconnected both according to their dependencies (temporal/logical orders between actions) and decompositions (decomposition of complex actions into simpler ones). This dataset has been integrated with DBpedia (259,568 links). For more information see: - The project website: http://homepages.inf.ed.ac.uk/s1054760/prohow/index.htm - The data is also available on datahub: https://datahub.io/dataset/human-activities-and-instructions ---------------------------------------------------------------- * Quickstart: if you want to experiment with the most high-quality data before downloading all the datasets, download the file '9of11_knowhow_wikihow', and optionally files 'Process - Inputs', 'Process - Outputs', 'Process - Step Links' and 'wikiHow categories hierarchy'. * Data representation based on the PROHOW vocabulary: http://w3id.org/prohow# Data extracted from existing web resources is linked to the original resources using the Open Annotation specification * Data Model: an example of how the data is represented within the datasets is available in the attached Data Model PDF file. The attached example represents a simple set of instructions, but instructions in the dataset can have more complex structures. For example, instructions could have multiple methods, steps could have further sub-steps, and complex requirements could be decomposed into sub-requirements. ---------------------------------------------------------------- Statistics: * 211,696: number of instructions. From wikiHow: 167,232 (datasets 1of11_knowhow_wikihow to 9of11_knowhow_wikihow). From Snapguide: 44,464 (datasets 10of11_knowhow_snapguide to 11of11_knowhow_snapguide). * 2,609,236: number of RDF nodes within the instructions From wikiHow: 1,871,468 (datasets 1of11_knowhow_wikihow to 9of11_knowhow_wikihow). From Snapguide: 737,768 (datasets 10of11_knowhow_snapguide to 11of11_knowhow_snapguide). * 255,101: number of process inputs linked to 8,453 distinct DBpedia concepts (dataset Process - Inputs) * 4,467: number of process outputs linked to 3,439 distinct DBpedia concepts (dataset Process - Outputs) * 376,795: number of step links between 114,166 different sets of instructions (dataset Process - Step Links)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ARC-Lake v2.0 - Per-Lake contains data products on a lake-by-lake basis. These data products contain observations of Lake Surface Water Temperature (LSWT) and Lake Ice Cover (LIC) from the series of (Advanced) Along-Track Scanning Radiometers ((A)ATSRs). ARC-Lake v2.0 data products cover the period from 1st August 1991 to 31st December 2011. A number of different data products are available for each lake and are grouped together into a zip archive for each lake. A summary of the types of data product available is given on http://datashare.is.ed.ac.uk/handle/10283/88 and full details of the file naming convention and file contents are given in the ARC-Lake Data Product Description document (ARCLake_DPD_v1_1_2.pdf). Individual lake archives are grouped into larger zip archives by continent (with the exception of the Caspian Sea). Details of the methods used and a list of all lakes and their locations are given in the ARC-Lake Algorithm Theoretical Basis Document (ARC-Lake-ATBD-v1.3.pdf). Additional information about the ARC-Lake project and some basic data analysis tools can be found on the project website: http://www.geos.ed.ac.uk/arclake Please cite both this dataset and the related publication: * 'MacCallum, Stuart N; Merchant, Christopher J. (2013). ARC-Lake v2.0 - Per-Lake, 1991-2011 [Dataset]. University of Edinburgh. School of GeoSciences / European Space Agency. https://doi.org/10.7488/ds/161.' * 'MacCallum, S.N. and Merchant, C.J. (2012) Surface water temperature observations of large lakes by optimal estimation. Canadian Journal of Remote Sensing, 38 (1). pp. 25-45. ISSN 1712-7971 doi: 10.5589/m12-010'
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
ARC-Lake v2.0 - Global contains data products with global coverage, i.e. data for all (available) lakes are included in each product. These data products contain observations of Lake Surface Water Temperature (LSWT) and Lake Ice Cover (LIC) from the series of (Advanced) Along-Track Scanning Radiometers ((A)ATSRs). ARC-Lake v2.0 data products cover the period from 1st August 1991 to 31st December 2011. A number of different data products are available and are grouped together into eight zip archives, by product type. A summary of the types of data product available is given on http://datashare.is.ed.ac.uk/handle/10283/88 and full details of the file naming convention and file contents are given in the ARC-Lake Data Product Description document (ARCLake_DPD_v1_1_2.pdf). Note that not all types of data product available on a per-lake basis are available as a global product. Details of the methods used and a list of all lakes and their locations are given in the ARC-Lake Algorithm Theoretical Basis Document (ARC-Lake-ATBD-v1.3.pdf). Additional information about the ARC-Lake project and some basic data analysis tools can be found on the project website: http://www.geos.ed.ac.uk/arclake/ Please cite both this dataset and the related publication: * 'MacCallum, Stuart N; Merchant, Christopher J. (2013). ARC-Lake v2.0 - Global, 1991-2011 [Dataset]. University of Edinburgh. School of GeoSciences / European Space Agency. https://doi.org/10.7488/ds/110.' * 'MacCallum, S.N. and Merchant, C.J. (2012) Surface water temperature observations of large lakes by optimal estimation. Canadian Journal of Remote Sensing, 38 (1). pp. 25-45. ISSN 1712-7971 doi: 10.5589/m12-010'
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data underlying the figures in the eScaf manuscript.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This is data from a published paper evaluating the impact of weight loss combined with exercise training on cardiometabolic risk factors (e.g. insulin resistance, body composition, fitness, lipids). The final paper is published in Plos ONE (Title:Effects of aerobic training with and without weight loss on insulin sensitivity and lipids)
Zip file with 5 Roam exercises in PDF and PPTX formats, plus trainer guide and Quick Guide. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2014-04-10 and migrated to Edinburgh DataShare on 2017-02-22.
https://dataverse.nl/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.34894/SVEUWIhttps://dataverse.nl/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.34894/SVEUWI
The R-scripts and data in this study can be used to reproduce figures in the associated paper, which is based on a simulation of a scientific community. Model description: We construct a model in which there is a cost associated with sharing datasets whereas reusing such sets implies a benefit. In our calculations conflicting interests appear for researchers. Individual researchers are al ways better off not sharing and omitting the sharing cost, at the same time both sharing and not sharing researchers are better off if (almost) all researchers share. Namely, the more researchers share, the more benefit can be gained by the reuse of those datasets. We simulated several policy measures to increase benefits for researchers sharing or reusing datasets. Results point out that, although policies should be able to increase the rate of sharing researchers, and increased discoverability and dataset quality could partly compensate for costs, a better measure would be to directly lower the cost for sharing, or even turn it into a (citation-) benefit.
Environment Roam exercise. Zip file contains exercise plus annotations file and Quick Guide. GIS vector data. This dataset was first accessioned in the EDINA ShareGeo Open repository on 2014-04-10 and migrated to Edinburgh DataShare on 2017-02-22.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Recent observations demonstrate that misalignments and other out-of-plane structures are common in protoplanetary discs. Many of these have been linked to a central host binary with an orbit that is inclined with respect to the disc. We present simulations of misaligned circumbinary discs with a range of parameters to gain a better understanding of the link between those parameters and the disc morphology in the wave-like regime of warp propagation that is appropriate to protoplanetary discs. The simulations confirm that disc tearing is possible in protoplanetary discs as long as the mass ratio, m, and disc-binary inclination angle, i, are not too small. For the simulations presented here this corresponds to m>0.1 and i40. For highly eccentric binaries, tearing can occur for discs with smaller misalignment. Existing theoretical predictions provide an estimate of the radial extent of the disc in which we can expect breaking to occur. However, there does not seem to be a simple relationship between the disc properties and the radius within the circumbinary disc at which the breaks appear, and furthermore the radius at which the disc breaks can change as a function of time in each case. We discuss the implications of our results for interpreting observations and suggest some considerations for modelling misaligned discs in the future. The dataset contains files for generating the models presented in the upcoming publication Alison K. Young, Struan Stevenson, C.J. Nixon, Ken Riceand (in submission) 'On the conditions for warping and breaking protoplanetary discs'. Files provided here relate to simulations A-K. See also a second DataShare deposit (https://datashare.ed.ac.uk/handle/10283/8516), which contains files for the simulations L-S.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Platform to facilitate sharing, discovery, and secure access to UCSF biomedical data. It''s powered by the Dataverse Network platform, which supports a variety of data types, as well as attribution and licensing needs. Researchers may share datasets, discover data from other labs, and reuse data. Links to tools and information that help scientists properly organize, manage, and document their datasets are also provided.