43 datasets found
  1. n

    Data from: Data Management and Sharing: Practices and Perceptions of...

    • data.niaid.nih.gov
    • datadryad.org
    zip
    Updated Jun 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Borghi; Ana Van Gulick (2020). Data Management and Sharing: Practices and Perceptions of Psychology Researchers [Dataset]. http://doi.org/10.5061/dryad.6wwpzgmw3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 24, 2020
    Dataset provided by
    Stanford University
    Figshare (United Kingdom)
    Authors
    John Borghi; Ana Van Gulick
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Research data is increasingly viewed as an important scholarly output. While a growing body of studies have investigated researcher practices and perceptions related to data sharing, information about data-related practices throughout the research process (including data collection and analysis) remains largely anecdotal. Building on our previous study of data practices in neuroimaging research, we conducted a survey of data management practices in the field of psychology. Our survey included questions about the type(s) of data collected, the tools used for data analysis, practices related to data organization, maintaining documentation, backup procedures, and long-term archiving of research materials. Our results demonstrate the complexity of managing and sharing data in psychology. Data is collected in multifarious forms from human participants, analyzed using a range of software tools, and archived in formats that may become obsolete. As individuals, our participants demonstrated relatively good data management practices, however they also indicated that there was little standardization within their research group. Participants generally indicated that they were willing to change their current practices in light of new technologies, opportunities, or requirements.

    Methods To investigate the data-related practices of psychology researchers, we adapted a survey developed as part of our previous study of neuroimaging researchers. The survey was distributed via Qualtrics (http://www.qualtrics.com) from January 25 to March 25, 2019. Before beginning the survey, participants were required to verify that they were at least 18 years old and gave their informed consent to participate. Participants who did not meet these inclusion criteria or who did not complete at least the first section of the survey were not included in the final data analysis. After filtering, 274 psychology researchers from 31 countries participated in our survey.

    All code for data collection and visualization is included in the Jupyter notebooks included here.

  2. r

    SciTran

    • rrid.site
    • neuinfo.org
    • +2more
    Updated Nov 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). SciTran [Dataset]. http://identifiers.org/RRID:SCR_013666
    Explore at:
    Dataset updated
    Nov 30, 2025
    Description

    Scientific Transparency (SciTran) is a software project that has grown out of the Project on Scientific Transparency at Stanford University. At the heart of SciTran is a scientific data management system – SDM – designed to enable and foster reproducible research. SciTran SDM delivers efficient and robust archiving, organization, and sharing of scientific data. We have developed the system around neuroimaging data, but our goal is to build a system that is flexible enough to accomodate all types of scientific data – from paper-and-pencil tests to genomics data. SDM will also allow for the sharing of data and computations between remote sites. SciTran is open-source software, released under the MIT license. Our code is hosted on GitHub. Feel free to try it out or to contribute. Commercial support for SciTran SDM is available through our partners at Flywheel. Check out their demo, if you''d like to give SDM a quick try.

  3. s

    NeuroPlant

    • purl.stanford.edu
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emily Fryer; Sujay Guha; Lucero Rogel; Theresa Logan-Garbisch; Hodan Farah; Ehsan Rezaei; Iris Mollhoff; Adam Nekimken; Angela Xu; Lara Selin Seyahi; Sylvia Fechner; Shaul Druckmann; Thomas Clandinin; Seung Rhee; Miriam B. Goodman (2024). NeuroPlant [Dataset]. http://doi.org/10.25740/rh109rs1058
    Explore at:
    Dataset updated
    May 6, 2024
    Authors
    Emily Fryer; Sujay Guha; Lucero Rogel; Theresa Logan-Garbisch; Hodan Farah; Ehsan Rezaei; Iris Mollhoff; Adam Nekimken; Angela Xu; Lara Selin Seyahi; Sylvia Fechner; Shaul Druckmann; Thomas Clandinin; Seung Rhee; Miriam B. Goodman
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    We developed an efficient screening platform to evaluate chemotactic behavioral responses (attraction/repulsion) in C. elegans when exposed to plant derived small molecules. This platform uses multiwell plates, standard liquid handling equipment, flatbed scanners, a custom data management workflow, and computer-vision based software for data analysis. With this system, we screened 96 conditions (90 test and 6 reference compounds) in triplicate against the N2 wild-type C. elegans strain in 3 experimental days. We identified 37 SMs that evoke detectable responses in chemotaxis assays and advanced these 37 compounds into screens using tax-4, osm-9, and tax-4;osm-9 mutants. Taken together these initial results move us closer to our goal of better understanding the molecular mechanisms of chemosensory signal transduction and the identification of SM/receptor pairs. Overall, this repository contains the raw image files (.tiff), metadata and worm location files (.csv) for all analyzed images associated with the screen described above.

  4. d

    Single Nucleotide Polymorphism (SNP) data from Palumbi laboratory at...

    • dataone.org
    • bco-dmo.org
    • +1more
    Updated Dec 5, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephen R. Palumbi; Kristen Milligan (2021). Single Nucleotide Polymorphism (SNP) data from Palumbi laboratory at Stanford University Hopkins Marine Station from 2010 to 2013 (OMEGAS-MaS project) [Dataset]. https://dataone.org/datasets/sha256%3A3b0ddc25af9a7b67ef14c2686705a5f3de681ce39f03f43a19848b1c59a2fc7b
    Explore at:
    Dataset updated
    Dec 5, 2021
    Dataset provided by
    Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    Authors
    Stephen R. Palumbi; Kristen Milligan
    Description

    Single Nucleotide Polymorphism for each of the Strongylocentrotus purpuratus genes from the GLEAN3 genome build.

    Data structure: Fractions are allele frequency values. Location, Day samples (D1 or D7), and CO2 level (385 or 1000) are listed.

    Data are for each of the S. purpuratus genes from the GLEAN3 genome build.

    Parameter Names/Definitions:
    GLEAN ID: ID code
    POS: base position
    Random trails larger than observed value (running record): no units
    Sample ID code format: State_Site_Tidal Height_:
      OR: Oregon
      FC: Fogarty Creek Site
      ST: Strawberry Hill
      BD: Bodega Bay
      NC: Northern California
      VD: Van Damme
      CC: Central California
      SH: Sand Hill Bluff
      TP: Terrace Point
      SB: Santa Barbara region
      AL: Alegria
    

    Resulting Publications:
    2013. Pespeni, M. E. Sanford, B. Gaylord, T. M. Hill, J. D. Hosfelt, H. Jaris, M. LaVigne, E. Lenz, A. D. Russell, M. K. Young, S. R. Palumbi. Evolutionary change during experimental ocean acidification. Published online before print April 8, 2013, http://dx.doi.org/10.1073/pnas.1220673110

    Pespeni MH, Barney BT, Palumbi SR (2012) Differences in the regulation of growth and biomineralization genes revealed through long-term common garden acclimation and experimental genomics in the purple sea urchin. Evolution 67(7): 1901–1914. doi:10.1111/evo.12036

  5. Global Youth Tobacco Survey

    • redivis.com
    application/jsonl +7
    Updated Feb 21, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford Center for Population Health Sciences (2020). Global Youth Tobacco Survey [Dataset]. http://doi.org/10.57761/8s4w-6z42
    Explore at:
    arrow, stata, csv, application/jsonl, avro, spss, parquet, sasAvailable download formats
    Dataset updated
    Feb 21, 2020
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford Center for Population Health Sciences
    Description

    Abstract

    The Global Youth Tobacco Survey (GYTS) is a school-based survey designed to enhance the capacity of countries to monitor tobacco use among youth and to guide the implementation and evaluation of tobacco prevention and control programmes. The information generated from the GYTS can be used to stimulate the development of tobacco control programmes and can serve as a means to assess progress in meeting programme goals. In addition, GYTS data can be used to monitor seven Articles in the WHO FCTC.

    Documentation

    Please visit GTSSData that houses and displays data from four tobacco-related surveys conducted around the world, including India.

    Methodology

    In December 1998, TFI convened a meeting in Geneva with the Centers for Disease Control and Prevention (CDC), the United Nations Children’s Fund (UNICEF), the World Bank and representatives from countries in each of the six WHO regions to discuss the need for standardized mechanisms to collect youth tobacco use information on a global basis. The outcome of this meeting was the development by WHO and CDC of a Global Tobacco Surveillance System, which uses the Global Youth Tobacco Survey (GYTS) as its data collection mechanism.

    The GYTS uses a standard methodology for constructing the sampling frame, selecting schools and classes, preparing questionnaires, following consistent field procedures, and using consistent data management procedures for data processing and analysis.

    GYTS is composed of 56 "core" questions designed to gather data on the following seven domains. The questionnaire also allows countries to insert their own country-specific questions.

    • Knowledge and attitudes of young people towards cigarette smoking
    • Prevalence of cigarette smoking and other tobacco use among young people
    • Role of the media and advertising in young people’s use of cigarettes
    • Access to cigarettes
    • Tobacco-related school curriculum
    • Environmental tobacco smoke (ETS)
    • Cessation of cigarette smoking

    %3C!-- --%3E

    Centers for Disease Control and Prevention. Global Youth Tobacco Survey Data. Retrieved on 2020 February 21 from https://nccd.cdc.gov/GTSSDataSurveyResources/Ancillary/DataReports.aspx?CAID=2

  6. f

    Data Sheet 1_Surgical management of complicated Stanford type A aortic...

    • figshare.com
    pdf
    Updated Aug 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weibo He; Dongmei Sun; Shiyuan Yao; Yuan Wu; Linglin Fan (2025). Data Sheet 1_Surgical management of complicated Stanford type A aortic dissection with subdural haematoma due to constrictive pericarditis—a case report.pdf [Dataset]. http://doi.org/10.3389/fcvm.2025.1583332.s007
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Aug 13, 2025
    Dataset provided by
    Frontiers
    Authors
    Weibo He; Dongmei Sun; Shiyuan Yao; Yuan Wu; Linglin Fan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Constrictive pericarditis, often resulting from pericardial adhesions secondary to chest radiotherapy or cardiac surgery, generally carries a favorable prognosis when surgical intervention is performed early. In contrast, extensive acute aortic dissection is associated with a significantly higher mortality rate, highlighting the critical importance of timely surgical intervention to improve patient outcomes. Here, we present a case involving a patient with a ruptured Stanford type A aortic dissection complicated by constrictive pericarditis, which led to the formation of a large pseudoaneurysm causing severe compression of the right atrium and markedly increasing the risk of mortality. The patient underwent successful emergency one-stage surgery and was discharged with a favorable recovery. This report uniquely highlights the management of two life-threatening conditions—type A aortic dissection and constrictive pericarditis—in a single surgical procedure, aiming to enhance patient survival and minimize complications. This case not only illustrates a successful life-saving intervention but also provides a strategic framework for cardiac surgeons, encouraging them to address complex, high-risk cases with an integrated approach.

  7. w

    Dataset of environmental score (ESG) and governance score (ESG) of companies...

    • workwithdata.com
    Updated May 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Dataset of environmental score (ESG) and governance score (ESG) of companies in Stanford [Dataset]. https://www.workwithdata.com/datasets/companies?col=company%2Cwwd_environment_score%2Cwwd_governance_score&f=1&fcol0=city&fop0=%3D&fval0=Stanford
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about companies in Stanford. It has 56 rows. It features 3 columns: environmental score (ESG), and governance score (ESG).

  8. Government Technology Solutions Survey

    • redivis.com
    application/jsonl +7
    Updated Feb 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford University Libraries (2024). Government Technology Solutions Survey [Dataset]. http://doi.org/10.57761/j7vq-mk06
    Explore at:
    arrow, parquet, avro, stata, sas, spss, csv, application/jsonlAvailable download formats
    Dataset updated
    Feb 21, 2024
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford University Libraries
    Description

    Abstract

    Technology solutions have become essential to residential services, business development, and other local government priorities. The International City/County Management Association (ICMA), in partnership with OnBase by Hyland, conducted a 2017 survey to assess the information technology and e‐government solutions being used by local governments. The results of this survey are meant to give an overview of the capabilities, priorities, and difficulties local governments have in implementing technology solutions.

    Methodology

    ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research).

    This survey was delivered to 1,422 US local governments, addressed to chief administrative officers. The sample was limited to counties with at least 50,000 residents and municipalities with at least 75,000 residents. This yielded 190 responses for a 13.4% response rate.

    Usage

    Available documentation is contained in zip files labelled by survey year (see

    Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.

  9. s

    Data from: Full-System Power Analysis and Modeling

    • purl.stanford.edu
    Updated Aug 19, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dimitris Economou; Christos Kozyrakis; Stanford University, Stanford University. Department of Electrical Engineering. (2013). Full-System Power Analysis and Modeling [Dataset]. https://purl.stanford.edu/wp105cx4248
    Explore at:
    Dataset updated
    Aug 19, 2013
    Authors
    Dimitris Economou; Christos Kozyrakis; Stanford University, Stanford University. Department of Electrical Engineering.
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Description

    The increasing costs of power delivery and cooling, as well as the trend toward higher-density computer systems, have created a growing demand for better power management in server environments. Despite the increasing interest in this issue, little work has been done in quantitatively understanding power consumption trends and developing models to predict full-system power and component power breakdown that are not hardware specific. To quantitatively understand power consumption trends in server systems, we study the component-level power breakdown and variation, as well as temporal workload-specific power consumption of two instrumented systems: a power-optimized blade system and a compute-optimized Itanium-2 system. Using this analysis, we examine the validity of prior ad-hoc approaches to understanding power breakdown and quantify several interesting trends important for power modeling and management. We also introduce Mantis, a non-intrusive method for modeling full-system power consumption and providing real-time power prediction. Mantis is not hardware-specific and uses a one-time calibration phase to generate a model correlating AC power measurements with standard user-level system utilization metrics. We experimentally validate Mantis on the two instrumented systems using a variety of workloads (CPU, memory, or I/O intensive) and hardware configurations (number of processors, power supply and frequency settings). Mantis provides power estimates with high accuracy for both overall and temporal power consumption, making it a valuable non-hardware-specific tool for power-aware scheduling and analysis.

  10. u

    Bloom and Van Reenen Management Practices Dataset

    • datacatalogue.ukdataservice.ac.uk
    Updated Mar 11, 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Van Reenen, J., London School of Economics and Political Science, Centre for Economic Performance; Bloom, N., Stanford University (California), Department of Economics (2008). Bloom and Van Reenen Management Practices Dataset [Dataset]. http://doi.org/10.5255/UKDA-SN-5799-1
    Explore at:
    Dataset updated
    Mar 11, 2008
    Dataset provided by
    UK Data Servicehttps://ukdataservice.ac.uk/
    Authors
    Van Reenen, J., London School of Economics and Political Science, Centre for Economic Performance; Bloom, N., Stanford University (California), Department of Economics
    Time period covered
    Jan 1, 1994 - Jan 1, 2004
    Area covered
    United States, United Kingdom, Germany (October 1990-), France
    Description

    This study comprises the data used for the following published academic journal articles:
    Bloom, N. and Van Reenen, J. (2007) 'Measuring and explaining management practices across firms and countries', Quarterly Journal of Economics, 122(4), pp.1351–1408.

    The study used an innovative survey tool to collect management practice data from 732 medium sized firms in the USA, France, Germany and the UK. These measures of managerial practice are strongly associated with firm-level productivity, profitability, 'Tobin’s Q' and firm survival rates. Management practices also display significant cross-country differences, with US firms on average better managed than European firms, and significant within-country differences, with a long tail of extremely badly managed firms. The study found that poor management practices are more prevalent when product market competition is weak and/or when family-owned firms pass management control down to the eldest child (primogeniture). For further details, see documentation.

    A survey paper based on the same research was published in 2006:
    Bloom, N. and Van Reenen, J. (2006) 'Management practice, work-life balance and productivity: a review of some recent evidence', Oxford Review of Economic Policy, 22(4), pp.457-482.

    Users of these data are advised to consult the journal articles mentioned above.

  11. Michigan Longitudinal Study

    • openneuro.org
    Updated Mar 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael I. Demidenko; Ryan Klaus; Mary Soules; Mary M. Heitzeg (2024). Michigan Longitudinal Study [Dataset]. http://doi.org/10.18112/openneuro.ds005027.v1.0.0
    Explore at:
    Dataset updated
    Mar 11, 2024
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Michael I. Demidenko; Ryan Klaus; Mary Soules; Mary M. Heitzeg
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Michigan
    Description

    Michigan Longitudinal Study: Monetary Incentive Delay Task

    Structural: Low and High resolution. Functional: Monetary Incentive Delay task BOLD

    Uploaded onto Openn-Neuro by

    Michael Demidenko

    Provenance

    Collected at the University of Michigan under the grant: R01AA012217. Original Owners and curators: Mary Heitzeg, PhD; Data Manager: Mary Soules. Data transferred from University of Michigan to Stanford University November 2022.

    Relevant info for data: NIHReporter [Neuropsych]: https://reporter.nih.gov/project-details/8896367 pub: https://doi.org/10.1523/JNEUROSCI.1390-11.2012 pub: https://doi.org/10.1111/j.1530-0277.2007.00605.x pub: https://doi.org/10.1093/scan/nsx021

    Access Restrictions

    Any questions regarding other data use should be made to Mary Soules and Mary Heitzeg, PhD.

    Data notes

    Converted with mids2bids by Ryan Klaus and Mary Soules with support from Krisanne Litinas For some subjects MID runs for a session are unavailable/corrupt, so alternative session combo was recommended for use by MLS team.

    Command used for fMRIPrep v23.1.4:

    singularity run --cleanenv \
        -B ${DATA_DIR}:/data:ro -B ${DATA_OUT}:/out 
        ${SIMG_DIR}/fmriprep-23.1.4.simg 
        /data /out participant 
        --skip_bids_validation
        --participant-label sub-${subj} 
        --fs-license-file ${FS_LICENSE} 
        --fd-spike-threshold .9 
        --output-space MNI152NLin2009cAsym:res-2 
        --project-goodvoxels 
        --cifti-output 91k 
        --work-dir ${SCRATCH}
    

    Command used for MRIQC v23.1.0:

    singularity run --cleanenv
        -B $OAK:$OAK 
        -B $OAK/data/templateflow:/opt/templateflow 
        -B ${L_SCRATCH}:/tmp ${GROUP_HOME}/singularity_images/mriqc_23.1.0rc0.simg 
        ${BIDS_DIR} ${OUTPUT_DIR} participant 
        --participant-label ${sub} 
        --ants-nthreads 8 
        --nprocs 12 
        --mem_gb 30 
        -vv --dsname MLS 
        --verbose-reports 
        -w /tmp/work -m bold T1w T2w
    

    Command used for taskdescribe_v1.0:

    For each session and run, there is a derivatives folder summarizing the behavioral data in an JSON or PNG image. These were calculated based the associated events.tsv files. The PNG files are meant to be used as a quick summary of the behavioral performance throughout the task/run/session

  12. s

    Flood Data: Russian River Basin, California, (Q3) 1996

    • searchworks.stanford.edu
    zip
    Updated Nov 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Flood Data: Russian River Basin, California, (Q3) 1996 [Dataset]. https://searchworks.stanford.edu/view/yh376xf2882
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 16, 2024
    Area covered
    Russian River, California
    Description

    This polygon shapefile represents the portion of the Federal Emergenecy Management Q3 Flood Data Management maps within the Russian River basin region of California. Flood Data are derived from the 1:24000 Flood Insurance Rate Maps (FIRMS) published by FEMA.

  13. Alternative Service Delivery Survey

    • redivis.com
    application/jsonl +7
    Updated Aug 22, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford University Libraries (2023). Alternative Service Delivery Survey [Dataset]. http://doi.org/10.57761/fbcn-vc40
    Explore at:
    sas, stata, spss, csv, parquet, arrow, application/jsonl, avroAvailable download formats
    Dataset updated
    Aug 22, 2023
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford University Libraries
    Description

    Abstract

    This ICMA signature survey, conducted every five years, examines the service delivery choices, practices, and policies of local governments. Survey topics covered include adopting, evaluating private service delivery and obstacles in private service delivery.

    Methodology

    ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research). The survey was distributed to chief administrative officers of local governments.

    Usage

    Available documentation is contained in zip files labelled by survey year (see

    Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.

    For all years except 2017, each year's survey data was split across multiple files. Consequently, in Redivis, a single year of data is split into multiple tables. Users can JOIN the tables on shared variables (keys) like IMISID, USTATE, etc.

  14. WONDERBREAD: A Benchmark + Dataset for Business Process Management (BPM)...

    • zenodo.org
    csv, json, zip
    Updated Nov 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Wornow; Michael Wornow (2024). WONDERBREAD: A Benchmark + Dataset for Business Process Management (BPM) Tasks [Dataset]. http://doi.org/10.5281/zenodo.14094162
    Explore at:
    zip, csv, jsonAvailable download formats
    Dataset updated
    Nov 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Michael Wornow; Michael Wornow
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jul 6, 2024
    Description

    Paper: WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks

    Background

    The WONDERBREAD dataset contains 2,928 human demonstrations of 598 web navigation workflows across 6 types of BPM tasks. These tasks measure the ability of a model to generate accurate documentation, assist in knowledge transfer, and improve the efficiency of workflows.

    Please see our website for more details: https://wonderbread.stanford.edu/

    Quick Start

    To start, download debug_demos.zip (1 GB). It contains a subset of 24 demonstrations which can give you a sense of how the dataset is structured.

    To reproduce the paper, download gold_demos.zip (33 GB). It contains 724 demonstrations corresponding to the 162 "Gold" tasks which were used for all the evaluations in the original paper.

    To obtain the full dataset, download demos.zip (133 GB). This contains all 2,928 demonstrations and can be used for training, fine-tuning, and evaluating models.

    Dataset Structure

    The dataset contains several files, defined below.

    1. Raw Data (useful for training/fine-tuning/evaluation)
      1. debug_demos.zip (1 GB) -- a subset of only 24 demonstrations taken from the full dataset. Useful to get a sense of the dataset and for debugging.
      2. gold_demos.zip (22 GB) -- a subset of only 724 demonstrations corresopnding to the 162 "Gold" tasks. This is the dataset that was used for all evaluations in the original WONDERBREAD paper.
      3. demos.zip (133 GB) -- all 2,928 demonstrations across 598 tasks. Useful for training your own models.
    2. Modality-Specific Subsets of Raw Data (useful for specific types of training/fine-tuning/evaluation)
      1. All Demos
        1. demos_sop_only.zip (4 MB)-- only the SOP .txt files for all 2,928 demonstrations
        2. demos_sop_and_trace_only.zip (770 MB)-- only the SOP .txt files and action trace .json files for all 2,928 demonstrations
        3. demos_sop_and_trace_and_screenshots_only.zip (22 GB)-- only the SOP .txt files and action trace .json files and screenshot images for all 2,928 demonstrations
      2. "Gold" Demos
        1. gold_demos_sop_only.zip (1 MB)-- only the SOP .txt files for the 724 demonstrations in the "Gold" tasks.
        2. gold_demos_sop_and_trace_only.zip (190 MB) -- only the SOP .txt files and action trace .json files for the 724 demonstrations in the "Gold" tasks.
        3. gold_demos_sop_and_trace_and_screenshots_only.zip (6 GB) -- only the SOP .txt files and action trace .json files and screenshot images for the 724 demonstrations in the "Gold" tasks
    3. Evaluation (useful for evaluation)
      1. qa_dataset.csv -- contains all 120 questions and ground truth answers used in the "Knowlege Transfer" evaluation.
      2. df_rankings.csv -- contains the rankings of all "Gold" tasks used in the "SOP Ranking" evaluation.
    4. Metadata (can be safely ignored)
      1. Process Mining Task Demonstrations.xlsx -- maps human annotators to specific demonstrations; also contains "Gold" task rankings used in the "SOP Ranking" evaluation.
      2. metadata.json -- maps Google Drive URLs to Google Drive Folder IDs to demonstration names
      3. df_valid.csv -- tracks assets associated with each demonstration
  15. w

    Correlation of LinkedIn followers and governance score (ESG) by company in...

    • workwithdata.com
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Correlation of LinkedIn followers and governance score (ESG) by company in Stanford [Dataset]. https://www.workwithdata.com/charts/companies?chart=scatter&f=1&fcol0=city&fop0=%3D&fval0=Stanford&x=wwd_governance_score&y=linkedin_followers
    Explore at:
    Dataset updated
    May 6, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This scatter chart displays LinkedIn followers (followers) against governance score (ESG) (/ 100) in Stanford. The data is about companies.

  16. b

    Theoretical dislodgement data for mussels analyzed at Hopkins Marine Station...

    • bco-dmo.org
    • search.dataone.org
    csv
    Updated Jan 15, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Mark W Denny; Dr Bengt J Allen (2016). Theoretical dislodgement data for mussels analyzed at Hopkins Marine Station in 2014 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/632681
    Explore at:
    csv(4.10 MB)Available download formats
    Dataset updated
    Jan 15, 2016
    Dataset provided by
    Biological and Chemical Data Management Office
    Authors
    Dr Mark W Denny; Dr Bengt J Allen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    amplif, sample, step_hgt, fraction_dislodged, log10_ReturnTime_yr
    Description

    This dataset includes the number and attachment sites of byssal threads produced by individual mussels.

    Related Reference:

    Cole, A. and Denny MW. (2014) United we fail: group versus individual strength in the ribbed mussel Mytilus californianus. Biol. Bull. 227: 61-67.

    These data are also available at the Stanford Digital Repository: http://purl.stanford.edu/ph942zz5524

    Related Datasets:

    mussel size vs. byssal count
    mussel size vs. byssal width
    mussel byssus tenacity

  17. Municipal form of government (FOG)

    • redivis.com
    • stanford.redivis.com
    application/jsonl +7
    Updated Apr 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stanford University Libraries (2023). Municipal form of government (FOG) [Dataset]. http://doi.org/10.57761/0kkg-bc08
    Explore at:
    spss, parquet, avro, application/jsonl, csv, stata, sas, arrowAvailable download formats
    Dataset updated
    Apr 14, 2023
    Dataset provided by
    Redivis Inc.
    Authors
    Stanford University Libraries
    Description

    Abstract

    The International City/County Management Association conducts surveys on local government practices and policies, independently and with partners, including Form of Government (FOG) surveys. Form of government refers to the legal structure under which municipalities and counties in the United States organize (for example, the council-manager vs. the mayor-council form of government). The topic also refers to governance issues and how a local government operates. The Municipal Form of Government (FOG) surveys cover form of government, initiatives for referenda and recall, and the selection and composition of elected officials in cities/municipalities. The survey is conducted once every five years.

    Methodology

    ICMA’s database of local governments includes approximately 11,000 U.S. municipalities and 2,900 U.S. counties with populations of 2,500 or greater, as well as a majority of municipalities and counties with populations under 2,500 (https://icma.org/survey-research). The county or municipal clerk responds to the Form of Government survey.

    Usage

    Available documentation is contained in zip files labelled by survey year (see Supporting Files). Documentation will always include the survey instrument; where available, documentation may also include codebooks and response rates.

  18. E

    Executive Education Program Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jun 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Executive Education Program Report [Dataset]. https://www.datainsightsmarket.com/reports/executive-education-program-1987238
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Jun 10, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global executive education market is a dynamic and rapidly growing sector, driven by the increasing demand for upskilling and reskilling among professionals in a constantly evolving business landscape. While precise market sizing data is unavailable, considering the presence of numerous prestigious universities like Harvard, Stanford, and Wharton, along with a substantial number of other prominent institutions, a reasonable estimate places the 2025 market value at approximately $30 billion. The Compound Annual Growth Rate (CAGR) is expected to remain robust, potentially around 7-8% for the forecast period of 2025-2033, propelled by several key factors. These include the growing adoption of online and blended learning formats, expanding focus on leadership development programs tailored to specific industry needs, and the increasing recognition of executive education as a critical investment for both individual career advancement and organizational success. The market is segmented by program type (e.g., open enrollment, custom programs), industry focus (e.g., finance, technology), and geographic region. Key players are continually innovating their offerings, incorporating new technologies such as virtual reality and gamification to enhance the learning experience and cater to the evolving preferences of participants. The competitive landscape is fiercely competitive, with leading business schools vying for market share through strategic partnerships, curriculum development, and brand recognition. The North American market currently dominates, accounting for a substantial portion of the global revenue, but regions like Asia-Pacific are showing promising growth potential. Key restraints to market growth include the high cost of executive education programs, which may limit accessibility for some participants, and the need for continuous curriculum adaptation to remain relevant in a fast-paced world. However, the long-term outlook for the executive education market remains positive, driven by sustained demand for high-quality leadership and management development across industries globally. Further market penetration is expected in emerging economies, fueled by increased economic activity and a growing middle class.

  19. b

    Experimental results describing the tenacity of byssal threads produced by...

    • bco-dmo.org
    • search.dataone.org
    csv
    Updated Jan 14, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Mark W Denny; Dr Bengt J Allen (2016). Experimental results describing the tenacity of byssal threads produced by mussels analyzed at Hopkins Marine Station during 2014 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/632647
    Explore at:
    csv(2.25 KB)Available download formats
    Dataset updated
    Jan 14, 2016
    Dataset provided by
    Biological and Chemical Data Management Office
    Authors
    Dr Mark W Denny; Dr Bengt J Allen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    comment, num_muss, tenacity
    Description

    This dataset includes the number and attachment sites of byssal threads produced by individual mussels.

    Related Reference:

    Cole, A. and Denny MW. (2014) United we fail: group versus individual strength in the ribbed mussel Mytilus californianus. Biol. Bull. 227: 61-67.

    These data are also available at the Stanford Digital Repository: http://purl.stanford.edu/ph942zz5524

    Related Datasets:

    mussel size vs. byssal count
    mussel size vs. byssal width
    mussel dislodgement data

  20. b

    Data describing individual aquatic respiration rates for limpets in trials...

    • bco-dmo.org
    • search.dataone.org
    csv
    Updated Jan 13, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Mark W Denny; Dr Bengt J Allen (2016). Data describing individual aquatic respiration rates for limpets in trials conducted at the Hopkins Marine Station in 2013 (Experiments in a Model Ecosystem project) [Dataset]. https://www.bco-dmo.org/dataset/630089
    Explore at:
    csv(17.45 KB)Available download formats
    Dataset updated
    Jan 13, 2016
    Dataset provided by
    Biological and Chemical Data Management Office
    Authors
    Dr Mark W Denny; Dr Bengt J Allen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    temp, well, limpet, species, rate_raw, r_squared, rate_corr, resid_var, date_trial, rate_blank, and 2 more
    Measurement technique
    Water Temperature Sensor, Oxygen Sensor
    Description

    Growth data for animals on experimental plates in the field during 2013 was collected monthly via digital photographs and measured using ImageJ software. Respiration of limpets collected from the field during summer 2013 was measured in air or seawater at a range of temperatures for one hour or two hours.

    Related Reference:

    Miller, L.P., B.J. Allen, F.A. King, D.R. Chilin, V.M. Reynoso and M.W. Denny (2015). Warm microhabitats drive both increased respiration and growth rates of intertidal consumers. Marine Ecology Progress Series 522: 127-143 doi: http://dx.doi.org/10.3354/meps11117

    Related Datasets:

    limpet aerial respiration
    limpet mass and body volume
    limpet R-code and images

    These data are also available at the Stanford Digital Repository: https://purl.stanford.edu/mz343tz6255

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
John Borghi; Ana Van Gulick (2020). Data Management and Sharing: Practices and Perceptions of Psychology Researchers [Dataset]. http://doi.org/10.5061/dryad.6wwpzgmw3

Data from: Data Management and Sharing: Practices and Perceptions of Psychology Researchers

Related Article
Explore at:
zipAvailable download formats
Dataset updated
Jun 24, 2020
Dataset provided by
Stanford University
Figshare (United Kingdom)
Authors
John Borghi; Ana Van Gulick
License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Description

Research data is increasingly viewed as an important scholarly output. While a growing body of studies have investigated researcher practices and perceptions related to data sharing, information about data-related practices throughout the research process (including data collection and analysis) remains largely anecdotal. Building on our previous study of data practices in neuroimaging research, we conducted a survey of data management practices in the field of psychology. Our survey included questions about the type(s) of data collected, the tools used for data analysis, practices related to data organization, maintaining documentation, backup procedures, and long-term archiving of research materials. Our results demonstrate the complexity of managing and sharing data in psychology. Data is collected in multifarious forms from human participants, analyzed using a range of software tools, and archived in formats that may become obsolete. As individuals, our participants demonstrated relatively good data management practices, however they also indicated that there was little standardization within their research group. Participants generally indicated that they were willing to change their current practices in light of new technologies, opportunities, or requirements.

Methods To investigate the data-related practices of psychology researchers, we adapted a survey developed as part of our previous study of neuroimaging researchers. The survey was distributed via Qualtrics (http://www.qualtrics.com) from January 25 to March 25, 2019. Before beginning the survey, participants were required to verify that they were at least 18 years old and gave their informed consent to participate. Participants who did not meet these inclusion criteria or who did not complete at least the first section of the survey were not included in the final data analysis. After filtering, 274 psychology researchers from 31 countries participated in our survey.

All code for data collection and visualization is included in the Jupyter notebooks included here.

Search
Clear search
Close search
Google apps
Main menu