33 datasets found
  1. N

    National COVID Cohort Collaborative Data Enclave

    • datacatalog.med.nyu.edu
    Updated Aug 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States - National Center for Advancing Translational Sciences (NCATS) (2025). National COVID Cohort Collaborative Data Enclave [Dataset]. https://datacatalog.med.nyu.edu/dataset/10384
    Explore at:
    Dataset updated
    Aug 6, 2025
    Dataset authored and provided by
    United States - National Center for Advancing Translational Sciences (NCATS)
    Time period covered
    Jan 1, 2020 - Present
    Area covered
    United States
    Description

    The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.

    The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:

    • Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
      • Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists
    • De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
      • Can be requested by: Researchers from US-based or foreign institutions
    • Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
      • Can be requested by: Researchers from US-based institutions only

  2. M

    NCATS National COVID Cohort Collaborative (N3C) Data Enclave

    • catalog.midasnetwork.us
    Updated Sep 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Center for Advancing Translational Sciences; National COVID Cohort Collaborative (N3C) (2025). NCATS National COVID Cohort Collaborative (N3C) Data Enclave [Dataset]. https://catalog.midasnetwork.us/collection/337
    Explore at:
    Dataset updated
    Sep 1, 2025
    Dataset provided by
    MIDAS COORDINATION CENTER
    Authors
    National Center for Advancing Translational Sciences; National COVID Cohort Collaborative (N3C)
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Variables measured
    Viruses, disease, COVID-19, pathogen, Homo sapiens, host organism, mortality data, Population count, diagnostic tests, infectious disease, and 6 more
    Dataset funded by
    National Institute of General Medical Sciences
    Description

    The N3C Data Enclave is a secure platform through which harmonized clinical data provided by our contributing members are stored. The Enclave includes demographic and clinical characteristics of patients who have been tested for or diagnosed with COVID-19, and further information about the strategies and outcomes of treatments for those suspected or confirmed to have the virus. Additional data from individuals infected with pathogens such as SARS 1, MERS, and H1N1 are also included to support comparative studies. Data can be accessed only within the N3C Data Enclave and cannot be downloaded or removed. Three tiers of access are available for users depending on the scope and nature of their research; however, all will require verification and approval by the Data Access Committee (DAC) before data can be accessed.

  3. N3C-Formatted OMOP2OBO Mappings

    • zenodo.org
    • data.niaid.nih.gov
    csv, json +2
    Updated Oct 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tiffany J Callahan; Tiffany J Callahan; N3C OMOP to OBO Working Group; N3C OMOP to OBO Working Group (2022). N3C-Formatted OMOP2OBO Mappings [Dataset]. http://doi.org/10.5281/zenodo.7249166
    Explore at:
    csv, zip, json, text/x-pythonAvailable download formats
    Dataset updated
    Oct 27, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Tiffany J Callahan; Tiffany J Callahan; N3C OMOP to OBO Working Group; N3C OMOP to OBO Working Group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OMOP2OBO Mappings - N3C OMOP to OBO Working group

    This repository stores OMOP2OBO mappings which have been processed for use within the National COVID Cohort Collaborative (N3C) Enclave. The version of the mappings stored in this repository have been specifically formatted for use within the N3C Enclave.

    N3C OMOP to OBO Working Group: https://covid.cd2h.org/ontology

    Accessing the N3C-Formatted Mappings

    You can access the three OMOP2OBO HPO mapping files in the Enclave from the Knowledge store using the following link: https://unite.nih.gov/workspace/compass/view/ri.compass.main.folder.1719efcf-9a87-484f-9a67-be6a29598567.

    The mapping set includes three files, but you only need to merge the following two files with existing data in the Enclave in order to be able to create the concept sets:

    • OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv
    • OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv

    The first file OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv, contains columns for the OMOP concept ids and codes as well as specifies information like whether or not the OMOP concept’s descendants should be included when deriving the concept sets (defaults to FALSE). The other file OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv, contains details on the mapping’s label (i.e., the HPO curie and label in the concept_set_id field) and its provenance/evidence (the specific column to access for this information is called intention).

    Creating Concept Sets

    Merge these files together on the column named codeset_id and then join them with existing Enclave tables like concept and condition_occurrence to populate the actual concept sets. The name of the concept set can be obtained from the OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv file and is stored as a string in the column called concept_set_id. Although not ideal (but is the best way to approach this currently given what fields are available in the Enclave), to get the HPO CURIE and label will require applying a regex to this column.

    An example mapping is shown below (highlighting some of the most useful columns):

    codeset_id: 900000000
    concept_set_id: [OMOP2OBO] hp_0002031-abnormal_esophagus_morphology
    concept: 23868
    code: 69771008
    codeSystem: SNOMED
    includeDescendants: False
    intention:
    
    Mixed - This mapping was created using the OMOP2OBO mapping algorithm (https://github.com/callahantiff/OMOP2OBO).
    
    The Mapping Category and Evidence supporting the mappings are provided below, by OMOP concept:
    
    23868
    *******
    
    Mapping Category: Automatic Exact - Concept
    ------------------------------------------------
    
    Mapping Provenance
    ------------------
    OBO_DbXref-OMOP_ANCESTOR_SOURCE_CODE:snomed_69771008 | OBO_DbXref-OMOP_CONCEPT_SOURCE_CODE:snomed_69771008 | CONCEPT_SIMILARITY:HP_0002031_0.713

    Release Notes - v1.0.0

    Preparation

    In order to import data into the Enclave, the following items are needed:

    • Obtain API Token, which will be included in the authorization header (stored as GitHub Secret)
    • Obtain username hash from the Enclave
    • OMOP2OBO Mappings (v1.0.0)

    Data

    • Concept Set Container (concept_set_container): CreateNewConceptSet
    • Concept Set Version (code_sets): CreateNewDraftOMOPConceptSetVersion
    • Concept Set Expression Items (concept_set_version_item): addCodeAsVersionExpression

    Script

    Generated Output

    Need to have the codeset_id filled from self-generation (ideally, from a conserved range) prior to beginning any of the API steps. The current list of assigned identifiers is stored in the file named omop2obo_enclave_codeset_id_dict_v1.0.0.json.

    To be consistent with OMOP tools, specifically Atlas, we have also created Atlas-formatted json files for each mapping, which are stored in the zipped directory named atlas_json_files_v1.0.0.zip.

    File 1: concept_set_container

    • Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_container.csv
    • Columns:
      • concept_set_id
      • concept_set_name
      • intention
      • assigned_informatician
      • assigned_sme
      • project_id
      • status
      • stage
      • n3c_reviewer
      • alias
      • archived
      • created_by
      • created_at

    File 2: concept_set_expression_items

    • Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv
    • Columns:
      • codeset_id
      • concept_id
      • code
      • codeSystem
      • isExcluded
      • includeDescendants
      • includeMapped
      • item_id
      • annotation
      • created_by
      • created_at

    File 3: concept_set_version

    • Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv
    • Columns:
      • codeset_id
      • concept_set_id
      • concept_set_version_title
      • project
      • source_application
      • source_application_version
      • created_at
      • atlas_json
      • most_recent_version
      • comments
      • intention
      • limitations
      • issues
      • update_message
      • status
      • has_review
      • reviewed_by
      • created_by
      • provenance
      • atlas_json_resource_url
      • parent_version_id
      • is_draft

    Generated Output:

    • OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_container.csv
    • OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv
    • OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv
    • atlas_json_files_v1.0.0.zip
    • omop2obo_enclave_codeset_id_dict_v1.0.0.json
  4. f

    Baseline characteristics of all patients in N3C receiving 2 doses of mRNA...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alfred Jerrod Anzalone; Jing Sun; Amanda J. Vinson; William H. Beasley; William B. Hillegass; Kimberly Murray; Brian M. Hendricks; Melissa Haendel; Carol Reynolds Geary; Kristina L. Bailey; Corrine K. Hanson; Lucio Miele; Ronald Horswell; Julie A. McMurry; J. Zachary Porterfield; Michael T. Vest; H. Timothy Bunnell; Jeremy R. Harper; Bradley S. Price; Susan L. Santangelo; Clifford J. Rosen; James C. McClay; Sally L. Hodder (2023). Baseline characteristics of all patients in N3C receiving 2 doses of mRNA vaccine between January 1, 2021, and September 21, 2021. [Dataset]. http://doi.org/10.1371/journal.pone.0279968.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Alfred Jerrod Anzalone; Jing Sun; Amanda J. Vinson; William H. Beasley; William B. Hillegass; Kimberly Murray; Brian M. Hendricks; Melissa Haendel; Carol Reynolds Geary; Kristina L. Bailey; Corrine K. Hanson; Lucio Miele; Ronald Horswell; Julie A. McMurry; J. Zachary Porterfield; Michael T. Vest; H. Timothy Bunnell; Jeremy R. Harper; Bradley S. Price; Susan L. Santangelo; Clifford J. Rosen; James C. McClay; Sally L. Hodder
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Baseline characteristics of all patients in N3C receiving 2 doses of mRNA vaccine between January 1, 2021, and September 21, 2021.

  5. d

    Data from: An ordinal severity scale for COVID-19 retrospective studies...

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated May 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maryam Khodaverdi; Bradley Price; Zachary Porterfield; Timothy Bunnell; Michael Vest; Jerrod Anzalone; Jeremy Harper; Wes Kimble; Hamidreza Moradi; Brian Hendricks; Susan Santangelo; Sally Hodder (2025). An ordinal severity scale for COVID-19 retrospective studies using electronic health record data [Dataset]. http://doi.org/10.5061/dryad.dncjsxm2q
    Explore at:
    Dataset updated
    May 17, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Maryam Khodaverdi; Bradley Price; Zachary Porterfield; Timothy Bunnell; Michael Vest; Jerrod Anzalone; Jeremy Harper; Wes Kimble; Hamidreza Moradi; Brian Hendricks; Susan Santangelo; Sally Hodder
    Time period covered
    Jan 1, 2022
    Description

    Objectives: Although the World Health Organization (WHO) Clinical Progression Scale for COVID-19 is useful in prospective clinical trials, it cannot be effectively used with retrospective Electronic Health Record (EHR) datasets. Modifying the existing WHO Clinical Progression Scale, we developed an ordinal severity scale (OS) and assessed its usefulness in the analyses of COVID-19 patient outcomes using retrospective EHR data. Results: The data set used in this analysis consists of 2,880,456 patients. PCA of the day-to-day variation in OS levels over the totality of the 28-day period revealed contrasting patterns of variation in disease severity within the first and second 14 days and illustrated the importance of evaluation over the full 28-day period. Discussion: An OS with well-defined, robust features, based on discrete EHR data elements, is useful for assessments of COVID-19 patient outcomes, providing insights on progression of COVID-19 disease severity over time. Conclusion: The ...

  6. H

    N3C MACE

    • dataverse.harvard.edu
    Updated Mar 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anai Kothari (2023). N3C MACE [Dataset]. http://doi.org/10.7910/DVN/DFSGWU
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 15, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Anai Kothari
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Supplementary data for N3C MACE study

  7. Representative examples of false negatives for positive mentions of “fever”...

    • plos.figshare.com
    xls
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Representative examples of false negatives for positive mentions of “fever” in N3C COVID corpus, “diarrhea” in UMN PASC corpus and “chest pain” in N3C corpus as returned by BioMedICUS and both LLMs along with explanations. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Representative examples of false negatives for positive mentions of “fever” in N3C COVID corpus, “diarrhea” in UMN PASC corpus and “chest pain” in N3C corpus as returned by BioMedICUS and both LLMs along with explanations.

  8. anti-N3C(O)N

    • webbook.nist.gov
    Updated Mar 29, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2018). anti-N3C(O)N [Dataset]. https://webbook.nist.gov/cgi/cbook.cgi?ID=B1003961
    Explore at:
    Dataset updated
    Mar 29, 2018
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    License

    https://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRDhttps://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRD

    Description

    This page, "anti-N3C(O)N", is part of the NIST Chemistry WebBook. This site and its contents are part of the NIST Standard Reference Data Program.

  9. e

    Clostridium sp. N3C

    • ebi.ac.uk
    Updated Aug 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Clostridium sp. N3C [Dataset]. https://www.ebi.ac.uk/interpro/taxonomy/uniprot/1776758
    Explore at:
    Dataset updated
    Aug 16, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The main entity of this document is a taxonomy with accession number 1776758

  10. w

    xn--n3c.com - Historical whois Lookup

    • whoisdatacenter.com
    csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, xn--n3c.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--n3c.com/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Aug 15, 2025
    Description

    Explore the historical Whois records related to xn--n3c.com (Domain). Get insights into ownership history and changes over time.

  11. w

    xn--reisercktritts-versicherung-n3c.com - Historical whois Lookup

    • whoisdatacenter.com
    csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, xn--reisercktritts-versicherung-n3c.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--reisercktritts-versicherung-n3c.com/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Oct 8, 2025
    Description

    Explore the historical Whois records related to xn--reisercktritts-versicherung-n3c.com (Domain). Get insights into ownership history and changes over time.

  12. f

    Demographics of various corpora.

    • plos.figshare.com
    xls
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Demographics of various corpora. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundPatient symptoms, crucial for disease progression and diagnosis, are often captured in unstructured clinical notes. Large language models (LLMs) offer potential advantages in extracting patient symptoms compared to traditional rule-based information extraction (IE) systems.MethodsThis study compared fine-tuned LLMs (LLaMA2-13B and LLaMA3-8B) against BioMedICUS, a rule-based IE system, for extracting symptoms related to acute and post-acute sequelae of SARS-CoV-2 from clinical notes. The study utilized three corpora: UMN-COVID, UMN-PASC, and N3C-COVID. Prevalence, keyword and fairness analyses were conducted to assess symptom distribution and model equity across demographics.ResultsBioMedICUS outperformed fine-tuned LLMs in most cases. On the UMN PASC dataset, BioMedICUS achieved a macro-averaged F1-score of 0.70 for positive mention detection, compared to 0.66 for LLaMA2-13B and 0.62 for LLaMA3-8B. For the N3C COVID dataset, BioMedICUS scored 0.75, while LLaMA2-13B and LLaMA3-8B scored 0.53 and 0.68, respectively for positive mention detection. However, LLMs performed better in specific instances, such as detecting positive mentions of change in sleep in the UMN PASC dataset, where LLaMA2-13B (0.79) and LLaMA3-8B (0.65) outperformed BioMedICUS (0.60). For fairness analysis, BioMedICUS generally showed stronger performance across patient demographics. Keyword analysis using ANOVA on symptom distributions across all three corpora showed that both corpus (df = 2, p 

  13. w

    xn--warnemnde-zimmervermittlung-n3c.info - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, xn--warnemnde-zimmervermittlung-n3c.info - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--warnemnde-zimmervermittlung-n3c.info/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Oct 9, 2025
    Description

    Explore the historical Whois records related to xn--warnemnde-zimmervermittlung-n3c.info (Domain). Get insights into ownership history and changes over time.

  14. Macro-averaged metrics with 95% confidence intervals for evaluation of...

    • plos.figshare.com
    xls
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Macro-averaged metrics with 95% confidence intervals for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B extraction performance in positive and negative symptom mentions for N3C COVID. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Macro-averaged metrics with 95% confidence intervals for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B extraction performance in positive and negative symptom mentions for N3C COVID.

  15. f

    COVID-19 severity indicator codes used in N3C.

    • plos.figshare.com
    xlsx
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shannon Wuller; Nora G. Singer; Colby Lewis; Elizabeth W. Karlson; Grant S. Schulert; Jason D. Goldman; Jennifer Hadlock; Jonathan Arnold; Kathryn Hirabayashi; Lauren E. Stiles; Lawrence C. Kleinman; Lindsay G. Cowell; Mady Hornig; Margaret A. Hall; Mark G. Weiner; Michael Koropsak; Michelle F. Lamendola-Essel; Rachel Kenney; Richard A. Moffitt; Sajjad Abedian; Shari Esquenazi-Karonika; Steven G. Johnson; Stephenson Stroebel; Zachary S. Wallace; Karen H. Costenbader (2025). COVID-19 severity indicator codes used in N3C. [Dataset]. http://doi.org/10.1371/journal.pone.0324513.s007
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 4, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Shannon Wuller; Nora G. Singer; Colby Lewis; Elizabeth W. Karlson; Grant S. Schulert; Jason D. Goldman; Jennifer Hadlock; Jonathan Arnold; Kathryn Hirabayashi; Lauren E. Stiles; Lawrence C. Kleinman; Lindsay G. Cowell; Mady Hornig; Margaret A. Hall; Mark G. Weiner; Michael Koropsak; Michelle F. Lamendola-Essel; Rachel Kenney; Richard A. Moffitt; Sajjad Abedian; Shari Esquenazi-Karonika; Steven G. Johnson; Stephenson Stroebel; Zachary S. Wallace; Karen H. Costenbader
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SARS-CoV-2 infection has been associated with increased autoimmune disease risk. Past studies have not aligned regarding the most prevalent autoimmune diseases after infection, however. Furthermore, the relationship between infection severity and new autoimmune disease risk has not been well examined. We used RECOVER’s electronic health record (EHR) networks, N3C, PCORnet, and PEDSnet, to estimate types and frequency of autoimmune diseases arising after SARS-CoV-2 infection and assessed how infection severity related to autoimmune disease risk. We identified patients of any age with SARS-CoV-2 infection between April 1, 2020 and April 1, 2021, and assigned them to a World Health Organization COVID-19 severity category for adults or the PEDSnet acute COVID-19 illness severity classification system for children (30 days after SARS-CoV-2 infection index date and occurring ≥1 day apart. We calculated overall and infection severity-stratified incidence ratesper 1000 person-years for all autoimmune diseases. With least severe COVID-19 severity as reference, survival analyses examined incident autoimmune disease risk. The most common new-onset autoimmune diseases in all networks were thyroid disease, psoriasis/psoriatic arthritis, and inflammatory bowel disease. Among adults, inflammatory arthritis was the most common, and Sjögren’s disease also had high incidence. Incident type 1 diabetes and hematological autoimmune diseases were specifically found in children. Across networks, after adjustment, patients with highest COVID-19 severity had highest risk for new autoimmune disease vs. those with least severe disease (N3C: adjusted Hazard Ratio, (aHR) 1.47 (95%CI 1.33–1.66); PCORnet aHR 1.14 (95%CI 1.02–1.26); PEDSnet: aHR 3.14 (95%CI 2.42–4.07)]. Overall, severe acute COVID-19 was most strongly associated with autoimmune disease risk in three EHR networks.

  16. Characteristics of cohorts.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated May 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asif Rahman; Michael Russell; Wanhong Zheng; Daniel Eckrich; Imtiaz Ahmed (2024). Characteristics of cohorts. [Dataset]. http://doi.org/10.1371/journal.pone.0295891.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Asif Rahman; Michael Russell; Wanhong Zheng; Daniel Eckrich; Imtiaz Ahmed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Amid the ongoing global repercussions of SARS-CoV-2, it is crucial to comprehend its potential long-term psychiatric effects. Several recent studies have suggested a link between COVID-19 and subsequent mental health disorders. Our investigation joins this exploration, concentrating on Schizophrenia Spectrum and Psychotic Disorders (SSPD). Different from other studies, we took acute respiratory distress syndrome (ARDS) and COVID-19 lab-negative cohorts as control groups to accurately gauge the impact of COVID-19 on SSPD. Data from 19,344,698 patients, sourced from the N3C Data Enclave platform, were methodically filtered to create propensity matched cohorts: ARDS (n = 222,337), COVID-19 positive (n = 219,264), and COVID-19 negative (n = 213,183). We systematically analyzed the hazard rate of new-onset SSPD across three distinct time intervals: 0-21 days, 22-90 days, and beyond 90 days post-infection. COVID-19 positive patients consistently exhibited a heightened hazard ratio (HR) across all intervals [0-21 days (HR: 4.6; CI: 3.7-5.7), 22-90 days (HR: 2.9; CI: 2.3 -3.8), beyond 90 days (HR: 1.7; CI: 1.5-1.)]. These are notably higher than both ARDS and COVID-19 lab-negative patients. Validations using various tests, including the Cochran Mantel Haenszel Test, Wald Test, and Log-rank Test confirmed these associations. Intriguingly, our data indicated that younger individuals face a heightened risk of SSPD after contracting COVID-19, a trend not observed in the ARDS and COVID-19 negative groups. These results, aligned with the known neurotropism of SARS-CoV-2 and earlier studies, accentuate the need for vigilant psychiatric assessment and support in the era of Long-COVID, especially among younger populations.

  17. f

    Concepts categorization by accuracy level.

    • figshare.com
    xls
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li (2023). Concepts categorization by accuracy level. [Dataset]. http://doi.org/10.1371/journal.pone.0276923.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Concepts categorization by accuracy level.

  18. The Mitigating Effects of Telehealth Uptake on Disparities in Maternal Care...

    • icpsr.umich.edu
    Updated May 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hung, Peiyin; Li, Xiaoming (2025). The Mitigating Effects of Telehealth Uptake on Disparities in Maternal Care Access, Quality, Outcomes, and Expenditures, United States, 2018-2022 [Dataset]. http://doi.org/10.3886/ICPSR39023.v3
    Explore at:
    Dataset updated
    May 12, 2025
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    Hung, Peiyin; Li, Xiaoming
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/39023/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/39023/terms

    Time period covered
    2018 - 2022
    Area covered
    South Carolina, United States
    Description

    This study explores whether perinatal telehealth uptake has mitigated the pandemic's effects on disparities in maternal care access, quality, and outcomes by race, ethnicity, and rural or urban residence. Research to date has approached this question in several ways. First, researchers have utilized census data to assess whether community-wide broadband infrastructure exists to support the use of telehealth services in areas with high travel times to maternal care units. Findings suggest that socioeconomically disadvantaged communities face significant barriers to maternity care access, both with substantial travel burdens and inadequate digital access to facilitate telehealth services. Second, to examine maternal care quality, researchers have employed South Carolina hospital-based claims data and vital statistics to identify racial, ethnic, and urban/rural disparities in rates of cesarean delivery before and during the COVID-19 pandemic period. Results indicate that cesarean rates differed by rural vs. urban facility locations and racial and ethnic groups but observed disparities were not significantly exacerbated by the pandemic. Third, using South Carolina hospital-based claims data and COVID-19 testing data, researchers found significant racial, ethnic, and rural disparities in postpartum readmissions involving mental health and substance use disorders from childbirth discharge through one year postpartum during the COVID-19 pandemic. Finally, drawing on data from the National COVID Cohort Collaborative (N3C), research has shown that hybrid care increased substantially during the COVID-19 public health emergency, but pregnant people living in rural areas had lower levels of hybrid care than urban people, and individuals who belonged to racial and ethnic minority groups were more likely to have hybrid care than White individuals. Future research will investigate the impact of the COVID-19 pandemic and perinatal telehealth uptake on additional maternity care and birth outcomes by race, ethnicity, and urbanicity. The study also aims to assess how state-level telehealth policies relate to perinatal telehealth uptake by race, ethnicity, and urbanicity, and to develop a model to predict long-term changes in maternal care access, quality, outcomes, and expenditures, with and without state telehealth policies. The ICPSR provides variable-level metadata for the data associated with this study. The actual data may only be available from the Principal Investigator directly. The variable descriptions available through ICPSR also include information regarding the source of each variable listed, as does the Data Source field of these metadata.

  19. f

    Confusion matrix of the accuracy rating for the performance of the GA...

    • plos.figshare.com
    xls
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li (2023). Confusion matrix of the accuracy rating for the performance of the GA algorithm. [Dataset]. http://doi.org/10.1371/journal.pone.0276923.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Confusion matrix of the accuracy rating for the performance of the GA algorithm.

  20. f

    Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and...

    • plos.figshare.com
    xls
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B equity for race and gender in positive (+) and negative (-) symptom mentions for UMN PASC. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B equity for race and gender in positive (+) and negative (-) symptom mentions for UMN PASC.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
United States - National Center for Advancing Translational Sciences (NCATS) (2025). National COVID Cohort Collaborative Data Enclave [Dataset]. https://datacatalog.med.nyu.edu/dataset/10384

National COVID Cohort Collaborative Data Enclave

N3C Data Enclave

Explore at:
59 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Aug 6, 2025
Dataset authored and provided by
United States - National Center for Advancing Translational Sciences (NCATS)
Time period covered
Jan 1, 2020 - Present
Area covered
United States
Description

The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.

The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:

  • Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
    • Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists
  • De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
    • Can be requested by: Researchers from US-based or foreign institutions
  • Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
    • Can be requested by: Researchers from US-based institutions only

Search
Clear search
Close search
Google apps
Main menu