3 datasets found
  1. Data from: "A guide to using GitHub for developing and versioning data...

    • osti.gov
    • dataone.org
    • +1more
    Updated Jan 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agarwal, Deborah A.; Bond-Lamberty, Ben; Boye, Kristin; Burrus, Madison; Cholia, Shreyas; Crow, Michael; Crystal-Ornelas, Robert; Damerow, Joan; Devarakonda, Ranjeet; Ely, Kim S.; Goldman, Amy; Heinz, Susan; Hendrix, Valerie; Kakalia, Zarine; Pennington, Stephanie; Robles, Emily; Rogers, Alistair; Simmonds, Maegen; Varadharajan, Charuleka; Velliquette, Terri; Weierbach, Helen; Weisenhorn, Pamela; Welch, Jessica N. (2021). Data from: "A guide to using GitHub for developing and versioning data standards and reporting formats" [Dataset]. https://www.osti.gov/dataexplorer/biblio/dataset/1780565
    Explore at:
    Dataset updated
    Jan 1, 2021
    Dataset provided by
    United States Department of Energyhttp://energy.gov/
    Environmental System Science Data Infrastructure for a Virtual Ecosystem; Environmental Systems Science Data Infrastructure for a Virtual Ecosystem
    Authors
    Agarwal, Deborah A.; Bond-Lamberty, Ben; Boye, Kristin; Burrus, Madison; Cholia, Shreyas; Crow, Michael; Crystal-Ornelas, Robert; Damerow, Joan; Devarakonda, Ranjeet; Ely, Kim S.; Goldman, Amy; Heinz, Susan; Hendrix, Valerie; Kakalia, Zarine; Pennington, Stephanie; Robles, Emily; Rogers, Alistair; Simmonds, Maegen; Varadharajan, Charuleka; Velliquette, Terri; Weierbach, Helen; Weisenhorn, Pamela; Welch, Jessica N.
    Description

    These data are the results of a systematic review that investigated how data standards and reporting formats are documented on the version control platform GitHub. Our systematic review identified 32 data standards in earth science, environmental science, and ecology that use GitHub for version control of data standard documents. In our analysis, we characterized the documents and content within each of the 32 GitHub repositories to identify common practices for groups that version control their documents on GitHub.In this data package, there are 8 CSV files that contain data that we characterized from each repository, according to the location within the repository. For example, in 'readme_pages.csv' we characterize the content that appears across the 32 GitHub repositories included in our systematic review. Each of the 8 CSV files has an associated data dictionary file (names appended with '_dd.csv' and here we describe each content category within CSV files.There is one file-level metadata file (flmd.csv) that provides a description of each file within the data package.

  2. g

    Data Engineering Services

    • g-atai.com
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Data Engineering Services [Dataset]. https://www.g-atai.com/
    Explore at:
    Dataset updated
    Nov 24, 2025
    Description

    Data collection, cleaning, annotation, augmentation, and version control for machine learning pipelines.

  3. m

    Data for "Best Practices for Your Exploratory Factor Analysis: a Factor...

    • data.mendeley.com
    Updated Aug 17, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pablo Rogers (2021). Data for "Best Practices for Your Exploratory Factor Analysis: a Factor Tutorial" published by RAC-Revista de Administração Contemporânea [Dataset]. http://doi.org/10.17632/rdky78bk8r.2
    Explore at:
    Dataset updated
    Aug 17, 2021
    Authors
    Pablo Rogers
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains material related to the analysis performed in the article "Best Practices for Your Exploratory Factor Analysis: a Factor Tutorial". The material includes the data used in the analyses in .dat format, the labels (.txt) of the variables used in the Factor software, the outputs (.txt) evaluated in the article, and videos (.mp4 with English subtitles) recorded for the purpose of explaining the article. The videos can also be accessed in the following playlist: https://youtube.com/playlist?list=PLln41V0OsLHbSlYcDszn2PoTSiAwV5Oda. Below is a summary of the article:

    "Exploratory Factor Analysis (EFA) is one of the statistical methods most widely used in Administration, however, its current practice coexists with rules of thumb and heuristics given half a century ago. The purpose of this article is to present the best practices and recent recommendations for a typical EFA in Administration through a practical solution accessible to researchers. In this sense, in addition to discussing current practices versus recommended practices, a tutorial with real data on Factor is illustrated, a software that is still little known in the Administration area, but freeware, easy to use (point and click) and powerful. The step-by-step illustrated in the article, in addition to the discussions raised and an additional example, is also available in the format of tutorial videos. Through the proposed didactic methodology (article-tutorial + video-tutorial), we encourage researchers/methodologists who have mastered a particular technique to do the same. Specifically, about EFA, we hope that the presentation of the Factor software, as a first solution, can transcend the current outdated rules of thumb and heuristics, by making best practices accessible to Administration researchers".

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Agarwal, Deborah A.; Bond-Lamberty, Ben; Boye, Kristin; Burrus, Madison; Cholia, Shreyas; Crow, Michael; Crystal-Ornelas, Robert; Damerow, Joan; Devarakonda, Ranjeet; Ely, Kim S.; Goldman, Amy; Heinz, Susan; Hendrix, Valerie; Kakalia, Zarine; Pennington, Stephanie; Robles, Emily; Rogers, Alistair; Simmonds, Maegen; Varadharajan, Charuleka; Velliquette, Terri; Weierbach, Helen; Weisenhorn, Pamela; Welch, Jessica N. (2021). Data from: "A guide to using GitHub for developing and versioning data standards and reporting formats" [Dataset]. https://www.osti.gov/dataexplorer/biblio/dataset/1780565
Organization logo

Data from: "A guide to using GitHub for developing and versioning data standards and reporting formats"

Related Article
Explore at:
Dataset updated
Jan 1, 2021
Dataset provided by
United States Department of Energyhttp://energy.gov/
Environmental System Science Data Infrastructure for a Virtual Ecosystem; Environmental Systems Science Data Infrastructure for a Virtual Ecosystem
Authors
Agarwal, Deborah A.; Bond-Lamberty, Ben; Boye, Kristin; Burrus, Madison; Cholia, Shreyas; Crow, Michael; Crystal-Ornelas, Robert; Damerow, Joan; Devarakonda, Ranjeet; Ely, Kim S.; Goldman, Amy; Heinz, Susan; Hendrix, Valerie; Kakalia, Zarine; Pennington, Stephanie; Robles, Emily; Rogers, Alistair; Simmonds, Maegen; Varadharajan, Charuleka; Velliquette, Terri; Weierbach, Helen; Weisenhorn, Pamela; Welch, Jessica N.
Description

These data are the results of a systematic review that investigated how data standards and reporting formats are documented on the version control platform GitHub. Our systematic review identified 32 data standards in earth science, environmental science, and ecology that use GitHub for version control of data standard documents. In our analysis, we characterized the documents and content within each of the 32 GitHub repositories to identify common practices for groups that version control their documents on GitHub.In this data package, there are 8 CSV files that contain data that we characterized from each repository, according to the location within the repository. For example, in 'readme_pages.csv' we characterize the content that appears across the 32 GitHub repositories included in our systematic review. Each of the 8 CSV files has an associated data dictionary file (names appended with '_dd.csv' and here we describe each content category within CSV files.There is one file-level metadata file (flmd.csv) that provides a description of each file within the data package.

Search
Clear search
Close search
Google apps
Main menu