3 datasets found
  1. g

    The Institute for Research on Innovation & Science (IRIS) UMETRICS 2016Q3a...

    • datasearch.gesis.org
    • openicpsr.org
    Updated May 8, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Owen-Smith, Jason; Lane, Julia; Weinberg, Bruce; Jarmin, Ron; McFadden Allen, Barbara; Evans, James (2017). The Institute for Research on Innovation & Science (IRIS) UMETRICS 2016Q3a Data Release [Dataset]. http://doi.org/10.3886/E100605V3
    Explore at:
    Dataset updated
    May 8, 2017
    Dataset provided by
    da|ra (Registration agency for social science and economic data)
    Authors
    Owen-Smith, Jason; Lane, Julia; Weinberg, Bruce; Jarmin, Ron; McFadden Allen, Barbara; Evans, James
    Description

    The UMETRICS 2016Q3a Dataset is comprised of two collections. The first collection includes core files in which researchers will find university financial and personnel administrative data pertaining to sponsored project expenditures at IRIS member universities during a given year. UMETRICS core files are based on administrative data drawn directly from sponsored projects, procurement, and human resources data systems on each IRIS member university’s campus. Individual campus files are de-identified, cleaned and aggregated by IRIS to produce these core files. The core files include university data on sponsored project awards, direct cost wage payments from awards to employees, purchases of goods and services from vendors, and subaward transactions to subcontractors. Additional files provide supporting information to characterize and describe IRIS member institutions, identify sub-university units responsible for particular grants, and provide additional detail on object codes included by some data providers.

    In addition to core files, we are releasing crosswalk files linking UMETRICS data to external datasets at the individual and award level. In the 2016Q3a release we include match tables that: (i) link individual UMETRICS research employees to dissertation data (with a focus on dissertation topics) provided by ProQuest, and (ii) link federal awards from the National Institutes of Health (NIH), National Science Foundation (NSF) and U.S. Department of Agriculture (USDA) to detailed information about the content of grants. This documentation includes details about the data as well as the matching process. The data release includes code and original data files to allow replication and improvement of matching procedures by research users.

  2. o

    Measurement Using Linked SED and UMETRICS Data

    • explore.openaire.eu
    Updated Apr 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ekaterina Levitskaya; Brian Kim; Maryah Garner; Rukhshan Mian; Benjamin Feder; Allison Nunez (2022). Measurement Using Linked SED and UMETRICS Data [Dataset]. http://doi.org/10.5281/zenodo.6463886
    Explore at:
    Dataset updated
    Apr 15, 2022
    Authors
    Ekaterina Levitskaya; Brian Kim; Maryah Garner; Rukhshan Mian; Benjamin Feder; Allison Nunez
    Description

    This is a Jupyter notebook that explores the linked Survey of Earned Doctorates (SED)-Universities: Measuring the Impacts of Research on Innovation, Competitiveness, and Science (UMETRICS) data to get a better sense of how these two data sources might be used together. Furthermore, the purpose of this notebook is to allow participants to think critically about what exactly is being measured and how missingness in the data should be interpreted. This notebook was developed for the Fall 2021 Applied Data Analytics training facilitated by the National Center for Science and Engineering Statistics (NCSES) and Coleridge Initiative.

  3. o

    Supplemental Notebook for Unsupervised Machine Learning Using Linked SED and...

    • explore.openaire.eu
    Updated Apr 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benjamin Feder; Brian Kim; Ekaterina Levitskaya; Allison Nunez (2022). Supplemental Notebook for Unsupervised Machine Learning Using Linked SED and UMETRICS Data [Dataset]. http://doi.org/10.5281/zenodo.6463918
    Explore at:
    Dataset updated
    Apr 15, 2022
    Authors
    Benjamin Feder; Brian Kim; Ekaterina Levitskaya; Allison Nunez
    Description

    This Jupyter notebook introduces unsupervised machine learning through the lens of clustering. It demonstrates how k-means clustering can be employed to better understand the types of PhD students based on funding history by utilizing the linked Survey of Earned Doctorates (SED)-Universities: Measuring the Impacts of Research on Innovation, Competitiveness, and Science (UMETRICS) data. This supplemental notebook was developed for the Fall 2021 Applied Data Analytics training facilitated by the National Center for Science and Engineering Statistics (NCSES) and Coleridge Initiative.

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Owen-Smith, Jason; Lane, Julia; Weinberg, Bruce; Jarmin, Ron; McFadden Allen, Barbara; Evans, James (2017). The Institute for Research on Innovation & Science (IRIS) UMETRICS 2016Q3a Data Release [Dataset]. http://doi.org/10.3886/E100605V3

The Institute for Research on Innovation & Science (IRIS) UMETRICS 2016Q3a Data Release

Explore at:
Dataset updated
May 8, 2017
Dataset provided by
da|ra (Registration agency for social science and economic data)
Authors
Owen-Smith, Jason; Lane, Julia; Weinberg, Bruce; Jarmin, Ron; McFadden Allen, Barbara; Evans, James
Description

The UMETRICS 2016Q3a Dataset is comprised of two collections. The first collection includes core files in which researchers will find university financial and personnel administrative data pertaining to sponsored project expenditures at IRIS member universities during a given year. UMETRICS core files are based on administrative data drawn directly from sponsored projects, procurement, and human resources data systems on each IRIS member university’s campus. Individual campus files are de-identified, cleaned and aggregated by IRIS to produce these core files. The core files include university data on sponsored project awards, direct cost wage payments from awards to employees, purchases of goods and services from vendors, and subaward transactions to subcontractors. Additional files provide supporting information to characterize and describe IRIS member institutions, identify sub-university units responsible for particular grants, and provide additional detail on object codes included by some data providers.

In addition to core files, we are releasing crosswalk files linking UMETRICS data to external datasets at the individual and award level. In the 2016Q3a release we include match tables that: (i) link individual UMETRICS research employees to dissertation data (with a focus on dissertation topics) provided by ProQuest, and (ii) link federal awards from the National Institutes of Health (NIH), National Science Foundation (NSF) and U.S. Department of Agriculture (USDA) to detailed information about the content of grants. This documentation includes details about the data as well as the matching process. The data release includes code and original data files to allow replication and improvement of matching procedures by research users.

Search
Clear search
Close search
Google apps
Main menu