43 datasets found

n
Data from: Data Management and Sharing: Practices and Perceptions of...
data.niaid.nih.gov
datadryad.org
zip
Updated Jun 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Borghi; Ana Van Gulick (2020). Data Management and Sharing: Practices and Perceptions of Psychology Researchers [Dataset]. http://doi.org/10.5061/dryad.6wwpzgmw3
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.6wwpzgmw3
Dataset updated
Jun 24, 2020
Dataset provided by
Stanford University
Figshare (United Kingdom)
Authors
John Borghi; Ana Van Gulick
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Research data is increasingly viewed as an important scholarly output. While a growing body of studies have investigated researcher practices and perceptions related to data sharing, information about data-related practices throughout the research process (including data collection and analysis) remains largely anecdotal. Building on our previous study of data practices in neuroimaging research, we conducted a survey of data management practices in the field of psychology. Our survey included questions about the type(s) of data collected, the tools used for data analysis, practices related to data organization, maintaining documentation, backup procedures, and long-term archiving of research materials. Our results demonstrate the complexity of managing and sharing data in psychology. Data is collected in multifarious forms from human participants, analyzed using a range of software tools, and archived in formats that may become obsolete. As individuals, our participants demonstrated relatively good data management practices, however they also indicated that there was little standardization within their research group. Participants generally indicated that they were willing to change their current practices in light of new technologies, opportunities, or requirements.

Methods To investigate the data-related practices of psychology researchers, we adapted a survey developed as part of our previous study of neuroimaging researchers. The survey was distributed via Qualtrics (http://www.qualtrics.com) from January 25 to March 25, 2019. Before beginning the survey, participants were required to verify that they were at least 18 years old and gave their informed consent to participate. Participants who did not meet these inclusion criteria or who did not complete at least the first section of the survey were not included in the final data analysis. After filtering, 274 psychology researchers from 31 countries participated in our survey.

All code for data collection and visualization is included in the Jupyter notebooks included here.
r
SciTran
rrid.site
neuinfo.org
+2more
Updated Nov 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). SciTran [Dataset]. http://identifiers.org/RRID:SCR_013666
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_013666
Dataset updated
Nov 30, 2025
Description
Scientific Transparency (SciTran) is a software project that has grown out of the Project on Scientific Transparency at Stanford University. At the heart of SciTran is a scientific data management system – SDM – designed to enable and foster reproducible research. SciTran SDM delivers efficient and robust archiving, organization, and sharing of scientific data. We have developed the system around neuroimaging data, but our goal is to build a system that is flexible enough to accomodate all types of scientific data – from paper-and-pencil tests to genomics data. SDM will also allow for the sharing of data and computations between remote sites. SciTran is open-source software, released under the MIT license. Our code is hosted on GitHub. Feel free to try it out or to contribute. Commercial support for SciTran SDM is available through our partners at Flywheel. Check out their demo, if you''d like to give SDM a quick try.
s
NeuroPlant
purl.stanford.edu
Updated May 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily Fryer; Sujay Guha; Lucero Rogel; Theresa Logan-Garbisch; Hodan Farah; Ehsan Rezaei; Iris Mollhoff; Adam Nekimken; Angela Xu; Lara Selin Seyahi; Sylvia Fechner; Shaul Druckmann; Thomas Clandinin; Seung Rhee; Miriam B. Goodman (2024). NeuroPlant [Dataset]. http://doi.org/10.25740/rh109rs1058
Explore at:
Unique identifier
https://doi.org/10.25740/rh109rs1058
Dataset updated
May 6, 2024
Authors
Emily Fryer; Sujay Guha; Lucero Rogel; Theresa Logan-Garbisch; Hodan Farah; Ehsan Rezaei; Iris Mollhoff; Adam Nekimken; Angela Xu; Lara Selin Seyahi; Sylvia Fechner; Shaul Druckmann; Thomas Clandinin; Seung Rhee; Miriam B. Goodman
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
We developed an efficient screening platform to evaluate chemotactic behavioral responses (attraction/repulsion) in C. elegans when exposed to plant derived small molecules. This platform uses multiwell plates, standard liquid handling equipment, flatbed scanners, a custom data management workflow, and computer-vision based software for data analysis. With this system, we screened 96 conditions (90 test and 6 reference compounds) in triplicate against the N2 wild-type C. elegans strain in 3 experimental days. We identified 37 SMs that evoke detectable responses in chemotaxis assays and advanced these 37 compounds into screens using tax-4, osm-9, and tax-4;osm-9 mutants. Taken together these initial results move us closer to our goal of better understanding the molecular mechanisms of chemosensory signal transduction and the identification of SM/receptor pairs. Overall, this repository contains the raw image files (.tiff), metadata and worm location files (.csv) for all analyzed images associated with the screen described above.
d
Single Nucleotide Polymorphism (SNP) data from Palumbi laboratory at...
dataone.org
bco-dmo.org
+1more
Updated Dec 5, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stephen R. Palumbi; Kristen Milligan (2021). Single Nucleotide Polymorphism (SNP) data from Palumbi laboratory at Stanford University Hopkins Marine Station from 2010 to 2013 (OMEGAS-MaS project) [Dataset]. https://dataone.org/datasets/sha256%3A3b0ddc25af9a7b67ef14c2686705a5f3de681ce39f03f43a19848b1c59a2fc7b
Explore at:
Dataset updated
Dec 5, 2021
Dataset provided by
Biological and Chemical Oceanography Data Management Office (BCO-DMO)
Authors
Stephen R. Palumbi; Kristen Milligan
Description
Single Nucleotide Polymorphism for each of the Strongylocentrotus purpuratus genes from the GLEAN3 genome build.

Data structure: Fractions are allele frequency values. Location, Day samples (D1 or D7), and CO2 level (385 or 1000) are listed.

Data are for each of the S. purpuratus genes from the GLEAN3 genome build.

Parameter Names/Definitions: GLEAN ID: ID code POS: base position Random trails larger than observed value (running record): no units Sample ID code format: State_Site_Tidal Height_: OR: Oregon FC: Fogarty Creek Site ST: Strawberry Hill BD: Bodega Bay NC: Northern California VD: Van Damme CC: Central California SH: Sand Hill Bluff TP: Terrace Point SB: Santa Barbara region AL: Alegria

Resulting Publications:
2013. Pespeni, M. E. Sanford, B. Gaylord, T. M. Hill, J. D. Hosfelt, H. Jaris, M. LaVigne, E. Lenz, A. D. Russell, M. K. Young, S. R. Palumbi^. Evolutionary change during experimental ocean acidification. Published online before print April 8, 2013, http://dx.doi.org/10.1073/pnas.1220673110

Pespeni MH, Barney BT, Palumbi SR (2012) Differences in the regulation of growth and biomineralization genes revealed through long-term common garden acclimation and experimental genomics in the purple sea urchin. Evolution 67(7): 1901–1914. doi:10.1111/evo.12036
Global Youth Tobacco Survey
redivis.com
application/jsonl +7
Updated Feb 21, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford Center for Population Health Sciences (2020). Global Youth Tobacco Survey [Dataset]. http://doi.org/10.57761/8s4w-6z42
Explore at:
arrow, stata, csv, application/jsonl, avro, spss, parquet, sasAvailable download formats
Unique identifier
https://doi.org/10.57761/8s4w-6z42
Dataset updated
Feb 21, 2020
Dataset provided by
Redivis Inc.
Authors
Stanford Center for Population Health Sciences
Description
Abstract

The Global Youth Tobacco Survey (GYTS) is a school-based survey designed to enhance the capacity of countries to monitor tobacco use among youth and to guide the implementation and evaluation of tobacco prevention and control programmes. The information generated from the GYTS can be used to stimulate the development of tobacco control programmes and can serve as a means to assess progress in meeting programme goals. In addition, GYTS data can be used to monitor seven Articles in the WHO FCTC.

Documentation

Please visit GTSSData that houses and displays data from four tobacco-related surveys conducted around the world, including India.

Methodology

In December 1998, TFI convened a meeting in Geneva with the Centers for Disease Control and Prevention (CDC), the United Nations Children’s Fund (UNICEF), the World Bank and representatives from countries in each of the six WHO regions to discuss the need for standardized mechanisms to collect youth tobacco use information on a global basis. The outcome of this meeting was the development by WHO and CDC of a Global Tobacco Surveillance System, which uses the Global Youth Tobacco Survey (GYTS) as its data collection mechanism.

The GYTS uses a standard methodology for constructing the sampling frame, selecting schools and classes, preparing questionnaires, following consistent field procedures, and using consistent data management procedures for data processing and analysis.

GYTS is composed of 56 "core" questions designed to gather data on the following seven domains. The questionnaire also allows countries to insert their own country-specific questions.

Knowledge and attitudes of young people towards cigarette smoking

Prevalence of cigarette smoking and other tobacco use among young people

Role of the media and advertising in young people’s use of cigarettes

Access to cigarettes

Tobacco-related school curriculum

Environmental tobacco smoke (ETS)

Cessation of cigarette smoking

%3C!-- --%3E

Centers for Disease Control and Prevention. Global Youth Tobacco Survey Data. Retrieved on 2020 February 21 from https://nccd.cdc.gov/GTSSDataSurveyResources/Ancillary/DataReports.aspx?CAID=2
f
Data Sheet 1_Surgical management of complicated Stanford type A aortic...
figshare.com
pdf
Updated Aug 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Weibo He; Dongmei Sun; Shiyuan Yao; Yuan Wu; Linglin Fan (2025). Data Sheet 1_Surgical management of complicated Stanford type A aortic dissection with subdural haematoma due to constrictive pericarditis—a case report.pdf [Dataset]. http://doi.org/10.3389/fcvm.2025.1583332.s007
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fcvm.2025.1583332.s007
Dataset updated
Aug 13, 2025
Dataset provided by
Frontiers
Authors
Weibo He; Dongmei Sun; Shiyuan Yao; Yuan Wu; Linglin Fan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Constrictive pericarditis, often resulting from pericardial adhesions secondary to chest radiotherapy or cardiac surgery, generally carries a favorable prognosis when surgical intervention is performed early. In contrast, extensive acute aortic dissection is associated with a significantly higher mortality rate, highlighting the critical importance of timely surgical intervention to improve patient outcomes. Here, we present a case involving a patient with a ruptured Stanford type A aortic dissection complicated by constrictive pericarditis, which led to the formation of a large pseudoaneurysm causing severe compression of the right atrium and markedly increasing the risk of mortality. The patient underwent successful emergency one-stage surgery and was discharged with a favorable recovery. This report uniquely highlights the management of two life-threatening conditions—type A aortic dissection and constrictive pericarditis—in a single surgical procedure, aiming to enhance patient survival and minimize complications. This case not only illustrates a successful life-saving intervention but also provides a strategic framework for cardiac surgeons, encouraging them to address complex, high-risk cases with an integrated approach.
w
Dataset of environmental score (ESG) and governance score (ESG) of companies...
workwithdata.com
Updated May 6, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Dataset of environmental score (ESG) and governance score (ESG) of companies in Stanford [Dataset]. https://www.workwithdata.com/datasets/companies?col=company%2Cwwd_environment_score%2Cwwd_governance_score&f=1&fcol0=city&fop0=%3D&fval0=Stanford
Explore at:
Dataset updated
May 6, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about companies in Stanford. It has 56 rows. It features 3 columns: environmental score (ESG), and governance score (ESG).
Government Technology Solutions Survey
redivis.com
application/jsonl +7
Updated Feb 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford University Libraries (2024). Government Technology Solutions Survey [Dataset]. http://doi.org/10.57761/j7vq-mk06
Explore at:
arrow, parquet, avro, stata, sas, spss, csv, application/jsonlAvailable download formats
Unique identifier
https://doi.org/10.57761/j7vq-mk06
Dataset updated
Feb 21, 2024
Dataset provided by
Redivis Inc.
Authors
Stanford University Libraries
Description
Abstract

Technology solutions have become essential to residential services, business development, and other local government priorities. The International City/County Management Association (ICMA), in partnership with OnBase by Hyland, conducted a 2017 survey to assess the information technology and e‐government solutions being used by local governments. The results of this survey are meant to give an overview of the capabilities, priorities, and difficulties local governments have in implementing technology solutions.

Methodology

ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research).

This survey was delivered to 1,422 US local governments, addressed to chief administrative officers. The sample was limited to counties with at least 50,000 residents and municipalities with at least 75,000 residents. This yielded 190 responses for a 13.4% response rate.

Usage

Available documentation is contained in zip files labelled by survey year (see

Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.
s
Data from: Full-System Power Analysis and Modeling
purl.stanford.edu
Updated Aug 19, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dimitris Economou; Christos Kozyrakis; Stanford University, Stanford University. Department of Electrical Engineering. (2013). Full-System Power Analysis and Modeling [Dataset]. https://purl.stanford.edu/wp105cx4248
Explore at:
Dataset updated
Aug 19, 2013
Authors
Dimitris Economou; Christos Kozyrakis; Stanford University, Stanford University. Department of Electrical Engineering.
License
Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
License information was derived automatically
Description
The increasing costs of power delivery and cooling, as well as the trend toward higher-density computer systems, have created a growing demand for better power management in server environments. Despite the increasing interest in this issue, little work has been done in quantitatively understanding power consumption trends and developing models to predict full-system power and component power breakdown that are not hardware specific. To quantitatively understand power consumption trends in server systems, we study the component-level power breakdown and variation, as well as temporal workload-specific power consumption of two instrumented systems: a power-optimized blade system and a compute-optimized Itanium-2 system. Using this analysis, we examine the validity of prior ad-hoc approaches to understanding power breakdown and quantify several interesting trends important for power modeling and management. We also introduce Mantis, a non-intrusive method for modeling full-system power consumption and providing real-time power prediction. Mantis is not hardware-specific and uses a one-time calibration phase to generate a model correlating AC power measurements with standard user-level system utilization metrics. We experimentally validate Mantis on the two instrumented systems using a variety of workloads (CPU, memory, or I/O intensive) and hardware configurations (number of processors, power supply and frequency settings). Mantis provides power estimates with high accuracy for both overall and temporal power consumption, making it a valuable non-hardware-specific tool for power-aware scheduling and analysis.
u
Bloom and Van Reenen Management Practices Dataset
datacatalogue.ukdataservice.ac.uk
Updated Mar 11, 2008
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Van Reenen, J., London School of Economics and Political Science, Centre for Economic Performance; Bloom, N., Stanford University (California), Department of Economics (2008). Bloom and Van Reenen Management Practices Dataset [Dataset]. http://doi.org/10.5255/UKDA-SN-5799-1
Explore at:
Unique identifier
https://doi.org/10.5255/UKDA-SN-5799-1
Dataset updated
Mar 11, 2008
Dataset provided by
UK Data Servicehttps://ukdataservice.ac.uk/
Authors
Van Reenen, J., London School of Economics and Political Science, Centre for Economic Performance; Bloom, N., Stanford University (California), Department of Economics
Time period covered
Jan 1, 1994 - Jan 1, 2004
Area covered
United States, United Kingdom, Germany (October 1990-), France
Description
This study comprises the data used for the following published academic journal articles:
Bloom, N. and Van Reenen, J. (2007) 'Measuring and explaining management practices across firms and countries', Quarterly Journal of Economics, 122(4), pp.1351–1408.

The study used an innovative survey tool to collect management practice data from 732 medium sized firms in the USA, France, Germany and the UK. These measures of managerial practice are strongly associated with firm-level productivity, profitability, 'Tobin’s Q' and firm survival rates. Management practices also display significant cross-country differences, with US firms on average better managed than European firms, and significant within-country differences, with a long tail of extremely badly managed firms. The study found that poor management practices are more prevalent when product market competition is weak and/or when family-owned firms pass management control down to the eldest child (primogeniture). For further details, see documentation.

A survey paper based on the same research was published in 2006:
Bloom, N. and Van Reenen, J. (2006) 'Management practice, work-life balance and productivity: a review of some recent evidence', Oxford Review of Economic Policy, 22(4), pp.457-482.

Users of these data are advised to consult the journal articles mentioned above.
Michigan Longitudinal Study
openneuro.org
Updated Mar 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael I. Demidenko; Ryan Klaus; Mary Soules; Mary M. Heitzeg (2024). Michigan Longitudinal Study [Dataset]. http://doi.org/10.18112/openneuro.ds005027.v1.0.0
Explore at:
Unique identifier
https://doi.org/10.18112/openneuro.ds005027.v1.0.0
Dataset updated
Mar 11, 2024
Dataset provided by
OpenNeurohttps://openneuro.org/
Authors
Michael I. Demidenko; Ryan Klaus; Mary Soules; Mary M. Heitzeg
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Michigan
Description
Michigan Longitudinal Study: Monetary Incentive Delay Task

Structural: Low and High resolution. Functional: Monetary Incentive Delay task BOLD

Uploaded onto Openn-Neuro by

Michael Demidenko

Provenance

Collected at the University of Michigan under the grant: R01AA012217. Original Owners and curators: Mary Heitzeg, PhD; Data Manager: Mary Soules. Data transferred from University of Michigan to Stanford University November 2022.

Relevant info for data: NIHReporter [Neuropsych]: https://reporter.nih.gov/project-details/8896367 pub: https://doi.org/10.1523/JNEUROSCI.1390-11.2012 pub: https://doi.org/10.1111/j.1530-0277.2007.00605.x pub: https://doi.org/10.1093/scan/nsx021

Access Restrictions

Any questions regarding other data use should be made to Mary Soules and Mary Heitzeg, PhD.

Data notes

Converted with mids2bids by Ryan Klaus and Mary Soules with support from Krisanne Litinas For some subjects MID runs for a session are unavailable/corrupt, so alternative session combo was recommended for use by MLS team.

Command used for fMRIPrep v23.1.4:

singularity run --cleanenv \ -B ${DATA_DIR}:/data:ro -B ${DATA_OUT}:/out ${SIMG_DIR}/fmriprep-23.1.4.simg /data /out participant --skip_bids_validation --participant-label sub-${subj} --fs-license-file ${FS_LICENSE} --fd-spike-threshold .9 --output-space MNI152NLin2009cAsym:res-2 --project-goodvoxels --cifti-output 91k --work-dir ${SCRATCH}

Command used for MRIQC v23.1.0:

singularity run --cleanenv -B $OAK:$OAK -B $OAK/data/templateflow:/opt/templateflow -B ${L_SCRATCH}:/tmp ${GROUP_HOME}/singularity_images/mriqc_23.1.0rc0.simg ${BIDS_DIR} ${OUTPUT_DIR} participant --participant-label ${sub} --ants-nthreads 8 --nprocs 12 --mem_gb 30 -vv --dsname MLS --verbose-reports -w /tmp/work -m bold T1w T2w

Command used for taskdescribe_v1.0:

For each session and run, there is a derivatives folder summarizing the behavioral data in an JSON or PNG image. These were calculated based the associated events.tsv files. The PNG files are meant to be used as a quick summary of the behavioral performance throughout the task/run/session
s
Flood Data: Russian River Basin, California, (Q3) 1996
searchworks.stanford.edu
zip
Updated Nov 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Flood Data: Russian River Basin, California, (Q3) 1996 [Dataset]. https://searchworks.stanford.edu/view/yh376xf2882
Explore at:
zipAvailable download formats
Dataset updated
Nov 16, 2024
Area covered
Russian River, California
Description
This polygon shapefile represents the portion of the Federal Emergenecy Management Q3 Flood Data Management maps within the Russian River basin region of California. Flood Data are derived from the 1:24000 Flood Insurance Rate Maps (FIRMS) published by FEMA.
Alternative Service Delivery Survey
redivis.com
application/jsonl +7
Updated Aug 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford University Libraries (2023). Alternative Service Delivery Survey [Dataset]. http://doi.org/10.57761/fbcn-vc40
Explore at:
sas, stata, spss, csv, parquet, arrow, application/jsonl, avroAvailable download formats
Unique identifier
https://doi.org/10.57761/fbcn-vc40
Dataset updated
Aug 22, 2023
Dataset provided by
Redivis Inc.
Authors
Stanford University Libraries
Description
Abstract

This ICMA signature survey, conducted every five years, examines the service delivery choices, practices, and policies of local governments. Survey topics covered include adopting, evaluating private service delivery and obstacles in private service delivery.

Methodology

ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research). The survey was distributed to chief administrative officers of local governments.

Usage

Available documentation is contained in zip files labelled by survey year (see

Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.

For all years except 2017, each year's survey data was split across multiple files. Consequently, in Redivis, a single year of data is split into multiple tables. Users can JOIN the tables on shared variables (keys) like IMISID, USTATE, etc.
WONDERBREAD: A Benchmark + Dataset for Business Process Management (BPM)...
zenodo.org
csv, json, zip
Updated Nov 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Wornow; Michael Wornow (2024). WONDERBREAD: A Benchmark + Dataset for Business Process Management (BPM) Tasks [Dataset]. http://doi.org/10.5281/zenodo.14094162
Explore at:
zip, csv, jsonAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14094162
Dataset updated
Nov 12, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Michael Wornow; Michael Wornow
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jul 6, 2024
Description
Paper: WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks

Background

The WONDERBREAD dataset contains 2,928 human demonstrations of 598 web navigation workflows across 6 types of BPM tasks. These tasks measure the ability of a model to generate accurate documentation, assist in knowledge transfer, and improve the efficiency of workflows.

Please see our website for more details: https://wonderbread.stanford.edu/

Quick Start

To start, download debug_demos.zip (1 GB). It contains a subset of 24 demonstrations which can give you a sense of how the dataset is structured.

To reproduce the paper, download gold_demos.zip (33 GB). It contains 724 demonstrations corresponding to the 162 "Gold" tasks which were used for all the evaluations in the original paper.

To obtain the full dataset, download demos.zip (133 GB). This contains all 2,928 demonstrations and can be used for training, fine-tuning, and evaluating models.

Dataset Structure

The dataset contains several files, defined below.

Raw Data (useful for training/fine-tuning/evaluation)

debug_demos.zip (1 GB) -- a subset of only 24 demonstrations taken from the full dataset. Useful to get a sense of the dataset and for debugging.

gold_demos.zip (22 GB) -- a subset of only 724 demonstrations corresopnding to the 162 "Gold" tasks. This is the dataset that was used for all evaluations in the original WONDERBREAD paper.

demos.zip (133 GB) -- all 2,928 demonstrations across 598 tasks. Useful for training your own models.

Modality-Specific Subsets of Raw Data (useful for specific types of training/fine-tuning/evaluation)

All Demos

demos_sop_only.zip (4 MB)-- only the SOP .txt files for all 2,928 demonstrations

demos_sop_and_trace_only.zip (770 MB)-- only the SOP .txt files and action trace .json files for all 2,928 demonstrations

demos_sop_and_trace_and_screenshots_only.zip (22 GB)-- only the SOP .txt files and action trace .json files and screenshot images for all 2,928 demonstrations

"Gold" Demos

gold_demos_sop_only.zip (1 MB)-- only the SOP .txt files for the 724 demonstrations in the "Gold" tasks.

gold_demos_sop_and_trace_only.zip (190 MB) -- only the SOP .txt files and action trace .json files for the 724 demonstrations in the "Gold" tasks.

gold_demos_sop_and_trace_and_screenshots_only.zip (6 GB) -- only the SOP .txt files and action trace .json files and screenshot images for the 724 demonstrations in the "Gold" tasks

Evaluation (useful for evaluation)

qa_dataset.csv -- contains all 120 questions and ground truth answers used in the "Knowlege Transfer" evaluation.

df_rankings.csv -- contains the rankings of all "Gold" tasks used in the "SOP Ranking" evaluation.

Metadata (can be safely ignored)

Process Mining Task Demonstrations.xlsx -- maps human annotators to specific demonstrations; also contains "Gold" task rankings used in the "SOP Ranking" evaluation.

metadata.json -- maps Google Drive URLs to Google Drive Folder IDs to demonstration names

df_valid.csv -- tracks assets associated with each demonstration
w
Correlation of LinkedIn followers and governance score (ESG) by company in...
workwithdata.com
Updated May 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Correlation of LinkedIn followers and governance score (ESG) by company in Stanford [Dataset]. https://www.workwithdata.com/charts/companies?chart=scatter&f=1&fcol0=city&fop0=%3D&fval0=Stanford&x=wwd_governance_score&y=linkedin_followers
Explore at:
Dataset updated
May 6, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This scatter chart displays LinkedIn followers (followers) against governance score (ESG) (/ 100) in Stanford. The data is about companies.
b
Theoretical dislodgement data for mussels analyzed at Hopkins Marine Station...
bco-dmo.org
search.dataone.org
csv
Updated Jan 15, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr Mark W Denny; Dr Bengt J Allen (2016). Theoretical dislodgement data for mussels analyzed at Hopkins Marine Station in 2014 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/632681
Explore at:
csv(4.10 MB)Available download formats
Dataset updated
Jan 15, 2016
Dataset provided by
Biological and Chemical Data Management Office
Authors
Dr Mark W Denny; Dr Bengt J Allen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
amplif, sample, step_hgt, fraction_dislodged, log10_ReturnTime_yr
Description
This dataset includes the number and attachment sites of byssal threads produced by individual mussels.

Related Reference:

Cole, A. and Denny MW. (2014) United we fail: group versus individual strength in the ribbed mussel Mytilus californianus. Biol. Bull. 227: 61-67.

These data are also available at the Stanford Digital Repository: http://purl.stanford.edu/ph942zz5524

Related Datasets:

mussel size vs. byssal count
mussel size vs. byssal width
mussel byssus tenacity
Municipal form of government (FOG)
redivis.com
stanford.redivis.com
application/jsonl +7
Updated Apr 14, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stanford University Libraries (2023). Municipal form of government (FOG) [Dataset]. http://doi.org/10.57761/0kkg-bc08
Explore at:
spss, parquet, avro, application/jsonl, csv, stata, sas, arrowAvailable download formats
Unique identifier
https://doi.org/10.57761/0kkg-bc08
Dataset updated
Apr 14, 2023
Dataset provided by
Redivis Inc.
Authors
Stanford University Libraries
Description
Abstract

The International City/County Management Association conducts surveys on local government practices and policies, independently and with partners, including Form of Government (FOG) surveys. Form of government refers to the legal structure under which municipalities and counties in the United States organize (for example, the council-manager vs. the mayor-council form of government). The topic also refers to governance issues and how a local government operates. The Municipal Form of Government (FOG) surveys cover form of government, initiatives for referenda and recall, and the selection and composition of elected officials in cities/municipalities. The survey is conducted once every five years.

Methodology

ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research). The county or municipal clerk responds to the Form of Government survey.

Usage

Available documentation is contained in zip files labelled by survey year (see Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.
E
Executive Education Program Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jun 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Executive Education Program Report [Dataset]. https://www.datainsightsmarket.com/reports/executive-education-program-1987238
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Jun 10, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global executive education market is a dynamic and rapidly growing sector, driven by the increasing demand for upskilling and reskilling among professionals in a constantly evolving business landscape. While precise market sizing data is unavailable, considering the presence of numerous prestigious universities like Harvard, Stanford, and Wharton, along with a substantial number of other prominent institutions, a reasonable estimate places the 2025 market value at approximately $30 billion. The Compound Annual Growth Rate (CAGR) is expected to remain robust, potentially around 7-8% for the forecast period of 2025-2033, propelled by several key factors. These include the growing adoption of online and blended learning formats, expanding focus on leadership development programs tailored to specific industry needs, and the increasing recognition of executive education as a critical investment for both individual career advancement and organizational success. The market is segmented by program type (e.g., open enrollment, custom programs), industry focus (e.g., finance, technology), and geographic region. Key players are continually innovating their offerings, incorporating new technologies such as virtual reality and gamification to enhance the learning experience and cater to the evolving preferences of participants. The competitive landscape is fiercely competitive, with leading business schools vying for market share through strategic partnerships, curriculum development, and brand recognition. The North American market currently dominates, accounting for a substantial portion of the global revenue, but regions like Asia-Pacific are showing promising growth potential. Key restraints to market growth include the high cost of executive education programs, which may limit accessibility for some participants, and the need for continuous curriculum adaptation to remain relevant in a fast-paced world. However, the long-term outlook for the executive education market remains positive, driven by sustained demand for high-quality leadership and management development across industries globally. Further market penetration is expected in emerging economies, fueled by increased economic activity and a growing middle class.
b
Experimental results describing the tenacity of byssal threads produced by...
bco-dmo.org
search.dataone.org
csv
Updated Jan 14, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr Mark W Denny; Dr Bengt J Allen (2016). Experimental results describing the tenacity of byssal threads produced by mussels analyzed at Hopkins Marine Station during 2014 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/632647
Explore at:
csv(2.25 KB)Available download formats
Dataset updated
Jan 14, 2016
Dataset provided by
Biological and Chemical Data Management Office
Authors
Dr Mark W Denny; Dr Bengt J Allen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
comment, num_muss, tenacity
Description
This dataset includes the number and attachment sites of byssal threads produced by individual mussels.

Related Reference:

Cole, A. and Denny MW. (2014) United we fail: group versus individual strength in the ribbed mussel Mytilus californianus. Biol. Bull. 227: 61-67.

These data are also available at the Stanford Digital Repository: http://purl.stanford.edu/ph942zz5524

Related Datasets:

mussel size vs. byssal count
mussel size vs. byssal width
mussel dislodgement data
b
Data describing individual aquatic respiration rates for limpets in trials...
bco-dmo.org
search.dataone.org
csv
Updated Jan 13, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr Mark W Denny; Dr Bengt J Allen (2016). Data describing individual aquatic respiration rates for limpets in trials conducted at the Hopkins Marine Station in 2013 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/630089
Explore at:
csv(17.45 KB)Available download formats
Dataset updated
Jan 13, 2016
Dataset provided by
Biological and Chemical Data Management Office
Authors
Dr Mark W Denny; Dr Bengt J Allen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
temp, well, limpet, species, rate_raw, r_squared, rate_corr, resid_var, date_trial, rate_blank, and 2 more
Measurement technique
Water Temperature Sensor, Oxygen Sensor
Description
Growth data for animals on experimental plates in the field during 2013 was collected monthly via digital photographs and measured using ImageJ software. Respiration of limpets collected from the field during summer 2013 was measured in air or seawater at a range of temperatures for one hour or two hours.

Related Reference:

Miller, L.P., B.J. Allen, F.A. King, D.R. Chilin, V.M. Reynoso and M.W. Denny (2015). Warm microhabitats drive both increased respiration and growth rates of intertidal consumers. Marine Ecology Progress Series 522: 127-143 doi: http://dx.doi.org/10.3354/meps11117

Related Datasets:

limpet aerial respiration
limpet mass and body volume
limpet R-code and images

These data are also available at the Stanford Digital Repository: https://purl.stanford.edu/mz343tz6255

Facebook

Twitter

Click to copy link

Link copied

Cite

John Borghi; Ana Van Gulick (2020). Data Management and Sharing: Practices and Perceptions of Psychology Researchers [Dataset]. http://doi.org/10.5061/dryad.6wwpzgmw3

Data from: Data Management and Sharing: Practices and Perceptions of Psychology Researchers

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5061/dryad.6wwpzgmw3

Dataset updated

Jun 24, 2020

Dataset provided by

Stanford University
Figshare (United Kingdom)

Authors

John Borghi; Ana Van Gulick

License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Description

Research data is increasingly viewed as an important scholarly output. While a growing body of studies have investigated researcher practices and perceptions related to data sharing, information about data-related practices throughout the research process (including data collection and analysis) remains largely anecdotal. Building on our previous study of data practices in neuroimaging research, we conducted a survey of data management practices in the field of psychology. Our survey included questions about the type(s) of data collected, the tools used for data analysis, practices related to data organization, maintaining documentation, backup procedures, and long-term archiving of research materials. Our results demonstrate the complexity of managing and sharing data in psychology. Data is collected in multifarious forms from human participants, analyzed using a range of software tools, and archived in formats that may become obsolete. As individuals, our participants demonstrated relatively good data management practices, however they also indicated that there was little standardization within their research group. Participants generally indicated that they were willing to change their current practices in light of new technologies, opportunities, or requirements.

Methods To investigate the data-related practices of psychology researchers, we adapted a survey developed as part of our previous study of neuroimaging researchers. The survey was distributed via Qualtrics (http://www.qualtrics.com) from January 25 to March 25, 2019. Before beginning the survey, participants were required to verify that they were at least 18 years old and gave their informed consent to participate. Participants who did not meet these inclusion criteria or who did not complete at least the first section of the survey were not included in the final data analysis. After filtering, 274 psychology researchers from 31 countries participated in our survey.

All code for data collection and visualization is included in the Jupyter notebooks included here.

Clear search

Close search

Google apps

Main menu

Data from: Data Management and Sharing: Practices and Perceptions of...

SciTran

NeuroPlant

Single Nucleotide Polymorphism (SNP) data from Palumbi laboratory at...

Global Youth Tobacco Survey

Abstract

Documentation

Data Sheet 1_Surgical management of complicated Stanford type A aortic...

Dataset of environmental score (ESG) and governance score (ESG) of companies...

Government Technology Solutions Survey

Abstract

Methodology

Usage

Data from: Full-System Power Analysis and Modeling

Bloom and Van Reenen Management Practices Dataset

Michigan Longitudinal Study

Michigan Longitudinal Study: Monetary Incentive Delay Task

Uploaded onto Openn-Neuro by

Provenance

Access Restrictions

Data notes

Flood Data: Russian River Basin, California, (Q3) 1996

Alternative Service Delivery Survey

Abstract

Methodology

Usage

WONDERBREAD: A Benchmark + Dataset for Business Process Management (BPM)...

Background

Quick Start

Dataset Structure

Correlation of LinkedIn followers and governance score (ESG) by company in...

Theoretical dislodgement data for mussels analyzed at Hopkins Marine Station...

Municipal form of government (FOG)

Abstract

Methodology

Usage

Executive Education Program Report

Experimental results describing the tenacity of byssal threads produced by...

Data describing individual aquatic respiration rates for limpets in trials...

Data from: Data Management and Sharing: Practices and Perceptions of Psychology Researchers