100+ datasets found
  1. NIH Data Sharing Repositories

    • catalog.data.gov
    • data.virginia.gov
    • +1more
    Updated Jul 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH), Department of Health & Human Services (2023). NIH Data Sharing Repositories [Dataset]. https://catalog.data.gov/dataset/nih-data-sharing-repositories
    Explore at:
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    United States Department of Health and Human Serviceshttp://www.hhs.gov/
    Description

    A list of NIH-supported repositories that accept submissions of appropriate scientific research data from biomedical researchers. It includes resources that aggregate information about biomedical data and information sharing systems. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of contact for further information or inquiries can be found on the websites of the individual repositories.

  2. NIH Research Portfolio Online Reporting Tools: Expenditures and Results...

    • catalog.data.gov
    • healthdata.gov
    • +1more
    Updated Jul 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH), Department of Health & Human Services (2023). NIH Research Portfolio Online Reporting Tools: Expenditures and Results (RePORTER) [Dataset]. https://catalog.data.gov/dataset/nih-exported-research-portfolio-online-reporting-tools-expenditures-and-results-exporter-f7455
    Explore at:
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    United States Department of Health and Human Serviceshttp://www.hhs.gov/
    Description

    Research projects funded by the National Institutes of Health (NIH), other DHHS Operating Divisions (ACF, AHRQ, CDC, FDA, HRSA), and the Department of Veterans Affairs. The ExPORTER files provide weekly and/or yearly snapshots of the data publicly accessible through the NIH Research Portfolio Online Reporting Tools, Expenditures and Results (RePORTER) system at https://reporter.nih.gov. The RePORTER database can also be queried using the user interface or the API. The RePORTER database contains information such as project title, abstract, principal investigator, funded organization, total awarded costs, categorization by area of research (NIH only), and project keywords. Also available is information on research publications and patents that have cited support from each project.

  3. d

    NIH Common Data Elements Repository

    • catalog.data.gov
    • datadiscovery.nlm.nih.gov
    • +3more
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). NIH Common Data Elements Repository [Dataset]. https://catalog.data.gov/dataset/nih-common-data-elements-repository-f6b3a
    Explore at:
    Dataset updated
    Jun 4, 2025
    Dataset provided by
    National Library of Medicine
    Description

    The NIH Common Data Elements (CDE) Repository has been designed to provide access to structured human and machine-readable definitions of data elements that have been recommended or required by NIH Institutes and Centers and other organizations for use in research and for other purposes. Visit the NIH CDE Resource Portal for contextual information about the repository.

  4. s

    NIH Data Sharing Repositories

    • scicrunch.org
    • dknet.org
    • +1more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NIH Data Sharing Repositories [Dataset]. http://identifiers.org/RRID:SCR_003551
    Explore at:
    Description

    A listing of NIH supported data sharing repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that aggregate information about biomedical data and information sharing systems. The table can be sorted according by name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites of the individual repositories.

  5. V

    The Immunology Database and Analysis Portal (ImmPort)

    • data.virginia.gov
    • healthdata.gov
    • +3more
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). The Immunology Database and Analysis Portal (ImmPort) [Dataset]. https://data.virginia.gov/dataset/the-immunology-database-and-analysis-portal-immport
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    The ImmPort system serves as a long-term, sustainable archive of immunology research data generated by investigators mainly funded through the NIAID/DAIT. The core component of the ImmPort system is an extensive data warehouse containing an integration of experimental data and clinical trial data. The ImmPort system also provides data analysis tools and an immunology-focused ontology. The analytical tools created and integrated as part of the ImmPort system are available to any researcher within ImmPort after registration and approval by DAIT. Additionally, the data provided mainly by NIAID/DAIT funded researchers in ImmPort will be available to all registered users after the appropriate embargo time.

  6. The Zebrafish Model Organism Database (ZFIN)

    • healthdata.gov
    • data.virginia.gov
    • +1more
    application/rssxml +4
    Updated Feb 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). The Zebrafish Model Organism Database (ZFIN) [Dataset]. https://healthdata.gov/w/vjxt-srnu/default?cur=IzBQE6nQN2D&from=YEd3M0A4hJY
    Explore at:
    tsv, json, csv, xml, application/rssxmlAvailable download formats
    Dataset updated
    Feb 13, 2021
    Description

    ZFIN serves as the zebrafish model organism database. It aims to: a) be the community database resource for the laboratory use of zebrafish, b) develop and support integrated zebrafish genetic, genomic and developmental information, c) maintain the definitive reference data sets of zebrafish research information, d) to link this information extensively to corresponding data in other model organism and human databases, e) facilitate the use of zebrafish as a model for human biology, and f) serve the needs of the research community.

  7. h

    the-pile-nih-refined-by-data-juicer

    • huggingface.co
    Updated Sep 15, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data-Juicer (2015). the-pile-nih-refined-by-data-juicer [Dataset]. https://huggingface.co/datasets/datajuicer/the-pile-nih-refined-by-data-juicer
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 15, 2015
    Dataset authored and provided by
    Data-Juicer
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Pile -- NIHExPorter (refined by Data-Juicer)

    A refined version of NIHExPorter dataset in The Pile by Data-Juicer. Removing some "bad" samples from the original dataset to make it higher-quality. This dataset is usually used to pretrain a Large Language Model. Notice: Here is a small subset for previewing. The whole dataset is available here (About 2.0G).

      Dataset Information
    

    Number of samples: 858,492 (Keep ~91.36% from the original dataset)

      Refining… See the full description on the dataset page: https://huggingface.co/datasets/datajuicer/the-pile-nih-refined-by-data-juicer.
    
  8. w

    Database of Interacting Proteins (DIP)

    • data.wu.ac.at
    • data.virginia.gov
    • +2more
    Updated Jul 19, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Health & Human Services (2016). Database of Interacting Proteins (DIP) [Dataset]. https://data.wu.ac.at/schema/data_gov/ZGRmOTE3ODItZjM0MC00ZmE0LWFhYzYtZWVmNDlhZjRmODZm
    Explore at:
    Dataset updated
    Jul 19, 2016
    Dataset provided by
    U.S. Department of Health & Human Services
    Description

    The DIP database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent set of protein-protein interactions. The data stored within the DIP database are curated both manually by expert curators and also automatically using computational approaches that utilize the the knowledge about the protein-protein interaction networks extracted from the most reliable, core subset of the DIP data.

  9. Cell Centred Database (CCDB)

    • healthdata.gov
    • data.virginia.gov
    • +3more
    application/rdfxml +5
    Updated Feb 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Cell Centred Database (CCDB) [Dataset]. https://healthdata.gov/dataset/Cell-Centred-Database-CCDB-/wfeu-vbrn
    Explore at:
    xml, csv, application/rssxml, tsv, application/rdfxml, jsonAvailable download formats
    Dataset updated
    Feb 13, 2021
    Description

    The Cell Centered Database (CCDB) is a web accessible database for high resolution 2D, 3D and 4D data from light and electron microscopy, including correlated imaging.

  10. N

    Learning Resources Database

    • datadiscovery.nlm.nih.gov
    • data.virginia.gov
    • +2more
    application/rdfxml +5
    Updated Jun 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Learning Resources Database [Dataset]. https://datadiscovery.nlm.nih.gov/Other/Learning-Resources-Database/khy6-95gu
    Explore at:
    xml, csv, application/rssxml, application/rdfxml, json, tsvAvailable download formats
    Dataset updated
    Jun 8, 2025
    Description

    The Learning Resources Database is a catalog of interactive tutorials, videos, online classes, finding aids, and other instructional resources on National Library of Medicine (NLM) products and services. Resources may be available for immediate use via a browser or downloadable for use in course management systems.

  11. d

    The National Institute on Aging Genetics of Alzheimer s Disease Data Storage...

    • datadiscoverystudio.org
    Updated Jul 15, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). The National Institute on Aging Genetics of Alzheimer s Disease Data Storage Site (NIAGADS) [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/407ca2478c0f4272999613b173e6d17b/html
    Explore at:
    Dataset updated
    Jul 15, 2016
    Description

    The National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) is a national genetics data repository facilitating access to genotypic and phenotypic data for Alzheimer's disease (AD). Data include GWAS, whole genome (WGS) and whole exome (WES), expression, RNA Seq, and CHIP Seq analyses. Data for the Alzheimer s Disease Sequencing Project (ADSP) are available through a partnership with dbGaP (ADSP at dbGaP). Results are integrated and annotated in the searchable genomics database that also provides access to a variety of software packages, analytic pipelines, online resources, and web-based tools to facilitate analysis and interpretation of large-scale genomic data. Data are available as defined by the NIA Genomics of Alzheimer s Disease Sharing Policy and the NIH Genomics Data Sharing Policy. Investigators return secondary analysis data to the database in keeping with the NIAGADS Data Distribution Agreement.

  12. V

    Eukaryotic Pathogen Database Resources (EuPathDB)

    • data.virginia.gov
    • healthdata.gov
    • +2more
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). Eukaryotic Pathogen Database Resources (EuPathDB) [Dataset]. https://data.virginia.gov/dataset/eukaryotic-pathogen-database-resources-eupathdb
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    EuPathDB Bioinformatics Resource Center for Biodefense and Emerging/Re-emerging Infectious Diseases is a portal for accessing genomic-scale datasets associated with the eukaryotic pathogens.

  13. Nonalcoholic Fatty Liver Disease (NAFLD) Adult Database

    • repository.niddk.nih.gov
    Updated Apr 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Tonascia (2024). Nonalcoholic Fatty Liver Disease (NAFLD) Adult Database [Dataset]. https://repository.niddk.nih.gov/studies/nafld_adult
    Explore at:
    Dataset updated
    Apr 22, 2024
    Authors
    James Tonascia
    Variables measured
    - Diagnosis of definite NASH - Stage of fibrosis - Grade of inflammation - Presence of hepatocellular ballooning injury
    Dataset funded by
    RFA-DK-18-506
    National Institute of Diabetes and Digestive and Kidney Diseaseshttp://niddk.nih.gov/
    Description

    Nonalcoholic fatty liver disease (NAFLD) affects 10%-30% of the general U.S. population and can progress to significant fibrosis and cirrhosis. When nonalcoholic steatohepatitis (NASH) is present, the 5-year and 10-year survivals are estimated at 67% and 59%, respectively. The presence of NASH and early fibrosis is currently established only by liver biopsy; noninvasively determining who has NASH and who is at risk for progressing to cirrhosis remains challenging.

    The Nonalcoholic Steatohepatitis Clinical Research Network (NASH CRN) was initiated by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) in 2002 to conduct multicenter, collaborative studies on the etiology, contributing factors, natural history, complications, and treatment of NASH. To meet these goals, patients with the full spectrum of NAFLD or cryptogenic cirrhosis were enrolled in an observational Database study.

    Comprehensive data, including demographics, medical history, symptoms, medication use, diet and exercise habits, and routine laboratory studies were collected on all patients at entry and at annual visits for up to 4 years after enrollment. Study questionnaires administered at enrollment and at selected follow-up visits included AUDIT; Block Food Questionnaire; Skinner Lifetime Drinking History, Physical Activity Questionnaire, Modifiable Activity Questionnaire; and the MOS 36-Item Short-Form Health Survey. Specimens were collected at selected time points during follow-up. If liver biopsies were obtained as part of routine patient care, they were scored using the NASH CRN NAFLD Activity Score (NAS) and fibrosis score.

  14. N

    Database of Short Genetic Variations (dbSNP)

    • datadiscovery.nlm.nih.gov
    • data.virginia.gov
    • +2more
    application/rdfxml +5
    Updated Jun 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Database of Short Genetic Variations (dbSNP) [Dataset]. https://datadiscovery.nlm.nih.gov/Molecular-biology-Genetics/Database-of-Short-Genetic-Variations-dbSNP-/x4yw-gnzq
    Explore at:
    json, tsv, csv, application/rdfxml, xml, application/rssxmlAvailable download formats
    Dataset updated
    Jun 17, 2021
    Description

    Database of Short Genetic Variations (dbSNP) contains human single nucleotide variations, microsatellites, and small-scale insertions and deletions along with publication, population frequency, molecular consequence, and genomic and RefSeq mapping information for both common variations and clinical mutations.

  15. d

    National Institutes of Health Research Portfolio Online Reporting Tool

    • dknet.org
    • neuinfo.org
    • +1more
    Updated Oct 18, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). National Institutes of Health Research Portfolio Online Reporting Tool [Dataset]. http://identifiers.org/RRID:SCR_006874
    Explore at:
    Dataset updated
    Oct 18, 2019
    Description

    A database of federally funded biomedical research projects conducted at universities, hospitals, and other research institutions that provides a central point of access to reports, data, and analyses of NIH research. The RePORTER has replaced the CRISP database. The database, maintained by the Office of Extramural Research at the National Institutes of Health, includes projects funded by the National Institutes of Health (NIH), Substance Abuse and Mental Health Services (SAMHSA), Health Resources and Services Administration (HRSA), Food and Drug Administration (FDA), Centers for Disease Control and Prevention (CDCP), Agency for Health Care Research and Quality (AHRQ), and Office of Assistant Secretary of Health (OASH).

  16. Influenza Research Database (IRD)

    • data.wu.ac.at
    • healthdata.gov
    • +4more
    Updated Jul 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Health & Human Services (2016). Influenza Research Database (IRD) [Dataset]. https://data.wu.ac.at/schema/data_gov/NWVhZGQ2Y2EtZTVlMC00YjQ4LWFkMGYtYTk5YmY1MzQwZGE0
    Explore at:
    Dataset updated
    Jul 19, 2016
    Dataset provided by
    United States Department of Health and Human Serviceshttp://www.hhs.gov/
    Description

    The Influenza Research Database (IRD) serves as a public repository and analysis platform for flu sequence, experiment, surveillance and related data.

  17. V

    Mouse Phenome Database (MPD)

    • data.virginia.gov
    • catalog.data.gov
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). Mouse Phenome Database (MPD) [Dataset]. https://data.virginia.gov/dataset/mouse-phenome-database-mpd
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    The Mouse Phenome Database (MPD) has characterizations of hundreds of strains of laboratory mice to facilitate translational discoveries and to assist in selection of strains for experimental studies.

  18. Z

    Data from: COInr a comprehensive, non-redundant COI database from NCBI-nt...

    • data.niaid.nih.gov
    • explore.openaire.eu
    • +1more
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meglecz, Emese (2024). COInr a comprehensive, non-redundant COI database from NCBI-nt and BOLD [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6555984
    Explore at:
    Dataset updated
    May 6, 2024
    Dataset authored and provided by
    Meglecz, Emese
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    COInr is a non-redundant, comprehensive database of COI sequences extracted from NCBI-nt and BOLD. It is not limited to a taxon, a gene region, or a taxonomic resolution. Sequences are dereplicated between databases and within taxa.

    Each taxon has a unique taxonomic Identifier (taxID), fundamental to avoid ambiguous associations of homonyms and synonyms in the source database. TaxIDs form a coherent hierarchical system fully compatible with the NCBI taxIDs allowing creating their full or ranked linages.

    COInr is a good starting point to create custom databases according to the users’ needs using mkCOInr scripts available at https://github.com/meglecz/mkCOInr
    It is possible to select/eliminate sequences for a list of taxa, select a specific gene region, select for minimum taxonomic resolution, add new custom sequences, and format the database for BLAST, QIIME, RDP classifiers.

  19. H

    Replication Data for: NIH funding and the pursuit of edge science

    • dataverse.harvard.edu
    Updated May 7, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mikko Packalen (2020). Replication Data for: NIH funding and the pursuit of edge science [Dataset]. http://doi.org/10.7910/DVN/QT5OGS
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 7, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    Mikko Packalen
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The zip files contains the data and programs for replicating the statistical analyses in Packalen M and J Bhattacharya (2020) “NIH Funding and the Pursuit of Edge Science”. Earlier version of the paper was circulated as NBER working paper No. 24860, titled “Does the NIH Fund Edge Science?” The file Readme_NIHEdgeScience_StatisticalAnalysis.pdf contains documentation for the data files and programs.

  20. V

    Data from: Gene Expression Omnibus (GEO)

    • data.virginia.gov
    • healthdata.gov
    • +2more
    Updated Jul 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). Gene Expression Omnibus (GEO) [Dataset]. https://data.virginia.gov/dataset/gene-expression-omnibus-geo
    Explore at:
    Dataset updated
    Jul 25, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided to help users query and download experiments and curated gene expression profiles.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Institutes of Health (NIH), Department of Health & Human Services (2023). NIH Data Sharing Repositories [Dataset]. https://catalog.data.gov/dataset/nih-data-sharing-repositories
Organization logo

NIH Data Sharing Repositories

Explore at:
Dataset updated
Jul 26, 2023
Dataset provided by
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Description

A list of NIH-supported repositories that accept submissions of appropriate scientific research data from biomedical researchers. It includes resources that aggregate information about biomedical data and information sharing systems. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of contact for further information or inquiries can be found on the websites of the individual repositories.

Search
Clear search
Close search
Google apps
Main menu