33 datasets found

N
National COVID Cohort Collaborative Data Enclave
datacatalog.med.nyu.edu
Updated Aug 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States - National Center for Advancing Translational Sciences (NCATS) (2025). National COVID Cohort Collaborative Data Enclave [Dataset]. https://datacatalog.med.nyu.edu/dataset/10384
Explore at:
Dataset updated
Aug 6, 2025
Dataset authored and provided by
United States - National Center for Advancing Translational Sciences (NCATS)
Time period covered
Jan 1, 2020 - Present
Area covered
United States
Description
The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.

The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:
Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists

De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
Can be requested by: Researchers from US-based or foreign institutions

Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
Can be requested by: Researchers from US-based institutions only
M
NCATS National COVID Cohort Collaborative (N3C) Data Enclave
catalog.midasnetwork.us
Updated Sep 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Center for Advancing Translational Sciences; National COVID Cohort Collaborative (N3C) (2025). NCATS National COVID Cohort Collaborative (N3C) Data Enclave [Dataset]. https://catalog.midasnetwork.us/collection/337
Explore at:
Dataset updated
Sep 1, 2025
Dataset provided by
MIDAS COORDINATION CENTER
Authors
National Center for Advancing Translational Sciences; National COVID Cohort Collaborative (N3C)
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Variables measured
Viruses, disease, COVID-19, pathogen, Homo sapiens, host organism, mortality data, Population count, diagnostic tests, infectious disease, and 6 more
Dataset funded by
National Institute of General Medical Sciences
Description
The N3C Data Enclave is a secure platform through which harmonized clinical data provided by our contributing members are stored. The Enclave includes demographic and clinical characteristics of patients who have been tested for or diagnosed with COVID-19, and further information about the strategies and outcomes of treatments for those suspected or confirmed to have the virus. Additional data from individuals infected with pathogens such as SARS 1, MERS, and H1N1 are also included to support comparative studies. Data can be accessed only within the N3C Data Enclave and cannot be downloaded or removed. Three tiers of access are available for users depending on the scope and nature of their research; however, all will require verification and approval by the Data Access Committee (DAC) before data can be accessed.
N3C-Formatted OMOP2OBO Mappings
zenodo.org
data.niaid.nih.gov
csv, json +2
Updated Oct 27, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tiffany J Callahan; Tiffany J Callahan; N3C OMOP to OBO Working Group; N3C OMOP to OBO Working Group (2022). N3C-Formatted OMOP2OBO Mappings [Dataset]. http://doi.org/10.5281/zenodo.7249166
Explore at:
csv, zip, json, text/x-pythonAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7249166
Dataset updated
Oct 27, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Tiffany J Callahan; Tiffany J Callahan; N3C OMOP to OBO Working Group; N3C OMOP to OBO Working Group
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
OMOP2OBO Mappings - N3C OMOP to OBO Working group

This repository stores OMOP2OBO mappings which have been processed for use within the National COVID Cohort Collaborative (N3C) Enclave. The version of the mappings stored in this repository have been specifically formatted for use within the N3C Enclave.

N3C OMOP to OBO Working Group: https://covid.cd2h.org/ontology

Accessing the N3C-Formatted Mappings

You can access the three OMOP2OBO HPO mapping files in the Enclave from the Knowledge store using the following link: https://unite.nih.gov/workspace/compass/view/ri.compass.main.folder.1719efcf-9a87-484f-9a67-be6a29598567.

The mapping set includes three files, but you only need to merge the following two files with existing data in the Enclave in order to be able to create the concept sets:

OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv

OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv

The first file OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv, contains columns for the OMOP concept ids and codes as well as specifies information like whether or not the OMOP concept’s descendants should be included when deriving the concept sets (defaults to FALSE). The other file OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv, contains details on the mapping’s label (i.e., the HPO curie and label in the concept_set_id field) and its provenance/evidence (the specific column to access for this information is called intention).

Creating Concept Sets

Merge these files together on the column named codeset_id and then join them with existing Enclave tables like concept and condition_occurrence to populate the actual concept sets. The name of the concept set can be obtained from the OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv file and is stored as a string in the column called concept_set_id. Although not ideal (but is the best way to approach this currently given what fields are available in the Enclave), to get the HPO CURIE and label will require applying a regex to this column.

An example mapping is shown below (highlighting some of the most useful columns):

codeset_id: 900000000 concept_set_id: [OMOP2OBO] hp_0002031-abnormal_esophagus_morphology concept: 23868 code: 69771008 codeSystem: SNOMED includeDescendants: False intention: Mixed - This mapping was created using the OMOP2OBO mapping algorithm (https://github.com/callahantiff/OMOP2OBO). The Mapping Category and Evidence supporting the mappings are provided below, by OMOP concept: 23868 ******* Mapping Category: Automatic Exact - Concept ------------------------------------------------ Mapping Provenance ------------------ OBO_DbXref-OMOP_ANCESTOR_SOURCE_CODE:snomed_69771008 | OBO_DbXref-OMOP_CONCEPT_SOURCE_CODE:snomed_69771008 | CONCEPT_SIMILARITY:HP_0002031_0.713

Release Notes - v1.0.0

Preparation

In order to import data into the Enclave, the following items are needed:

Obtain API Token, which will be included in the authorization header (stored as GitHub Secret)

Obtain username hash from the Enclave

OMOP2OBO Mappings (v1.0.0)

Data

Concept Set Container (concept_set_container): CreateNewConceptSet

Concept Set Version (code_sets): CreateNewDraftOMOPConceptSetVersion

Concept Set Expression Items (concept_set_version_item): addCodeAsVersionExpression

Script

n3c_mapping_conversion.py: https://github.com/callahantiff/OMOP2OBO/tree/master/applications/N3C

Generated Output

Need to have the codeset_id filled from self-generation (ideally, from a conserved range) prior to beginning any of the API steps. The current list of assigned identifiers is stored in the file named omop2obo_enclave_codeset_id_dict_v1.0.0.json.

To be consistent with OMOP tools, specifically Atlas, we have also created Atlas-formatted json files for each mapping, which are stored in the zipped directory named atlas_json_files_v1.0.0.zip.

File 1: concept_set_container

Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_container.csv

Columns:

concept_set_id

concept_set_name

intention

assigned_informatician

assigned_sme

project_id

status

stage

n3c_reviewer

alias

archived

created_by

created_at

File 2: concept_set_expression_items

Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv

Columns:

codeset_id

concept_id

code

codeSystem

isExcluded

includeDescendants

includeMapped

item_id

annotation

created_by

created_at

File 3: concept_set_version

Generated Data: OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv

Columns:

codeset_id

concept_set_id

concept_set_version_title

project

source_application

source_application_version

created_at

atlas_json

most_recent_version

comments

intention

limitations

issues

update_message

status

has_review

reviewed_by

created_by

provenance

atlas_json_resource_url

parent_version_id

is_draft

Generated Output:

OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_container.csv

OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_expression_items.csv

OMOP2OBO_v1.0.0_N3C_Enclave_CSV_concept_set_version.csv

atlas_json_files_v1.0.0.zip

omop2obo_enclave_codeset_id_dict_v1.0.0.json
f
Baseline characteristics of all patients in N3C receiving 2 doses of mRNA...
figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alfred Jerrod Anzalone; Jing Sun; Amanda J. Vinson; William H. Beasley; William B. Hillegass; Kimberly Murray; Brian M. Hendricks; Melissa Haendel; Carol Reynolds Geary; Kristina L. Bailey; Corrine K. Hanson; Lucio Miele; Ronald Horswell; Julie A. McMurry; J. Zachary Porterfield; Michael T. Vest; H. Timothy Bunnell; Jeremy R. Harper; Bradley S. Price; Susan L. Santangelo; Clifford J. Rosen; James C. McClay; Sally L. Hodder (2023). Baseline characteristics of all patients in N3C receiving 2 doses of mRNA vaccine between January 1, 2021, and September 21, 2021. [Dataset]. http://doi.org/10.1371/journal.pone.0279968.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0279968.t001
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Alfred Jerrod Anzalone; Jing Sun; Amanda J. Vinson; William H. Beasley; William B. Hillegass; Kimberly Murray; Brian M. Hendricks; Melissa Haendel; Carol Reynolds Geary; Kristina L. Bailey; Corrine K. Hanson; Lucio Miele; Ronald Horswell; Julie A. McMurry; J. Zachary Porterfield; Michael T. Vest; H. Timothy Bunnell; Jeremy R. Harper; Bradley S. Price; Susan L. Santangelo; Clifford J. Rosen; James C. McClay; Sally L. Hodder
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Baseline characteristics of all patients in N3C receiving 2 doses of mRNA vaccine between January 1, 2021, and September 21, 2021.
d
Data from: An ordinal severity scale for COVID-19 retrospective studies...
search.dataone.org
data.niaid.nih.gov
+1more
Updated May 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maryam Khodaverdi; Bradley Price; Zachary Porterfield; Timothy Bunnell; Michael Vest; Jerrod Anzalone; Jeremy Harper; Wes Kimble; Hamidreza Moradi; Brian Hendricks; Susan Santangelo; Sally Hodder (2025). An ordinal severity scale for COVID-19 retrospective studies using electronic health record data [Dataset]. http://doi.org/10.5061/dryad.dncjsxm2q
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.dncjsxm2q
Dataset updated
May 17, 2025
Dataset provided by
Dryad Digital Repository
Authors
Maryam Khodaverdi; Bradley Price; Zachary Porterfield; Timothy Bunnell; Michael Vest; Jerrod Anzalone; Jeremy Harper; Wes Kimble; Hamidreza Moradi; Brian Hendricks; Susan Santangelo; Sally Hodder
Time period covered
Jan 1, 2022
Description
Objectives: Although the World Health Organization (WHO) Clinical Progression Scale for COVID-19 is useful in prospective clinical trials, it cannot be effectively used with retrospective Electronic Health Record (EHR) datasets. Modifying the existing WHO Clinical Progression Scale, we developed an ordinal severity scale (OS) and assessed its usefulness in the analyses of COVID-19 patient outcomes using retrospective EHR data. Results: The data set used in this analysis consists of 2,880,456 patients. PCA of the day-to-day variation in OS levels over the totality of the 28-day period revealed contrasting patterns of variation in disease severity within the first and second 14 days and illustrated the importance of evaluation over the full 28-day period. Discussion: An OS with well-defined, robust features, based on discrete EHR data elements, is useful for assessments of COVID-19 patient outcomes, providing insights on progression of COVID-19 disease severity over time. Conclusion: The ...
H
N3C MACE
dataverse.harvard.edu
Updated Mar 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anai Kothari (2023). N3C MACE [Dataset]. http://doi.org/10.7910/DVN/DFSGWU
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/DFSGWU
Dataset updated
Mar 15, 2023
Dataset provided by
Harvard Dataverse
Authors
Anai Kothari
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Supplementary data for N3C MACE study
Representative examples of false negatives for positive mentions of “fever”...
plos.figshare.com
xls
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Representative examples of false negatives for positive mentions of “fever” in N3C COVID corpus, “diarrhea” in UMN PASC corpus and “chest pain” in N3C corpus as returned by BioMedICUS and both LLMs along with explanations. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t007
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0323535.t007
Dataset updated
May 15, 2025
Dataset provided by
PLOShttp://plos.org/
Authors
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Representative examples of false negatives for positive mentions of “fever” in N3C COVID corpus, “diarrhea” in UMN PASC corpus and “chest pain” in N3C corpus as returned by BioMedICUS and both LLMs along with explanations.
anti-N3C(O)N
webbook.nist.gov
Updated Mar 29, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2018). anti-N3C(O)N [Dataset]. https://webbook.nist.gov/cgi/cbook.cgi?ID=B1003961
Explore at:
Dataset updated
Mar 29, 2018
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
License
https://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRDhttps://www.nist.gov/open/copyright-fair-use-and-licensing-statements-srd-data-software-and-technical-series-publications#SRD
Description
This page, "anti-N3C(O)N", is part of the NIST Chemistry WebBook. This site and its contents are part of the NIST Standard Reference Data Program.
e
Clostridium sp. N3C
ebi.ac.uk
Updated Aug 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Clostridium sp. N3C [Dataset]. https://www.ebi.ac.uk/interpro/taxonomy/uniprot/1776758
Explore at:
Dataset updated
Aug 16, 2025
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The main entity of this document is a taxonomy with accession number 1776758
w
xn--n3c.com - Historical whois Lookup
whoisdatacenter.com
csv
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc, xn--n3c.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--n3c.com/
Explore at:
csvAvailable download formats
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Mar 15, 1985 - Aug 15, 2025
Description
Explore the historical Whois records related to xn--n3c.com (Domain). Get insights into ownership history and changes over time.
w
xn--reisercktritts-versicherung-n3c.com - Historical whois Lookup
whoisdatacenter.com
csv
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc, xn--reisercktritts-versicherung-n3c.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--reisercktritts-versicherung-n3c.com/
Explore at:
csvAvailable download formats
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Mar 15, 1985 - Oct 8, 2025
Description
Explore the historical Whois records related to xn--reisercktritts-versicherung-n3c.com (Domain). Get insights into ownership history and changes over time.
f
Demographics of various corpora.
plos.figshare.com
xls
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Demographics of various corpora. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0323535.t002
Dataset updated
May 15, 2025
Dataset provided by
PLOS ONE
Authors
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundPatient symptoms, crucial for disease progression and diagnosis, are often captured in unstructured clinical notes. Large language models (LLMs) offer potential advantages in extracting patient symptoms compared to traditional rule-based information extraction (IE) systems.MethodsThis study compared fine-tuned LLMs (LLaMA2-13B and LLaMA3-8B) against BioMedICUS, a rule-based IE system, for extracting symptoms related to acute and post-acute sequelae of SARS-CoV-2 from clinical notes. The study utilized three corpora: UMN-COVID, UMN-PASC, and N3C-COVID. Prevalence, keyword and fairness analyses were conducted to assess symptom distribution and model equity across demographics.ResultsBioMedICUS outperformed fine-tuned LLMs in most cases. On the UMN PASC dataset, BioMedICUS achieved a macro-averaged F1-score of 0.70 for positive mention detection, compared to 0.66 for LLaMA2-13B and 0.62 for LLaMA3-8B. For the N3C COVID dataset, BioMedICUS scored 0.75, while LLaMA2-13B and LLaMA3-8B scored 0.53 and 0.68, respectively for positive mention detection. However, LLMs performed better in specific instances, such as detecting positive mentions of change in sleep in the UMN PASC dataset, where LLaMA2-13B (0.79) and LLaMA3-8B (0.65) outperformed BioMedICUS (0.60). For fairness analysis, BioMedICUS generally showed stronger performance across patient demographics. Keyword analysis using ANOVA on symptom distributions across all three corpora showed that both corpus (df = 2, p
w
xn--warnemnde-zimmervermittlung-n3c.info - Historical whois Lookup
whoisdatacenter.com
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc, xn--warnemnde-zimmervermittlung-n3c.info - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/xn--warnemnde-zimmervermittlung-n3c.info/
Explore at:
csvAvailable download formats
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Mar 15, 1985 - Oct 9, 2025
Description
Explore the historical Whois records related to xn--warnemnde-zimmervermittlung-n3c.info (Domain). Get insights into ownership history and changes over time.
Macro-averaged metrics with 95% confidence intervals for evaluation of...
plos.figshare.com
xls
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Macro-averaged metrics with 95% confidence intervals for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B extraction performance in positive and negative symptom mentions for N3C COVID. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t005
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0323535.t005
Dataset updated
May 15, 2025
Dataset provided by
PLOShttp://plos.org/
Authors
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Macro-averaged metrics with 95% confidence intervals for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B extraction performance in positive and negative symptom mentions for N3C COVID.
f
COVID-19 severity indicator codes used in N3C.
plos.figshare.com
xlsx
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shannon Wuller; Nora G. Singer; Colby Lewis; Elizabeth W. Karlson; Grant S. Schulert; Jason D. Goldman; Jennifer Hadlock; Jonathan Arnold; Kathryn Hirabayashi; Lauren E. Stiles; Lawrence C. Kleinman; Lindsay G. Cowell; Mady Hornig; Margaret A. Hall; Mark G. Weiner; Michael Koropsak; Michelle F. Lamendola-Essel; Rachel Kenney; Richard A. Moffitt; Sajjad Abedian; Shari Esquenazi-Karonika; Steven G. Johnson; Stephenson Stroebel; Zachary S. Wallace; Karen H. Costenbader (2025). COVID-19 severity indicator codes used in N3C. [Dataset]. http://doi.org/10.1371/journal.pone.0324513.s007
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0324513.s007
Dataset updated
Jun 4, 2025
Dataset provided by
PLOS ONE
Authors
Shannon Wuller; Nora G. Singer; Colby Lewis; Elizabeth W. Karlson; Grant S. Schulert; Jason D. Goldman; Jennifer Hadlock; Jonathan Arnold; Kathryn Hirabayashi; Lauren E. Stiles; Lawrence C. Kleinman; Lindsay G. Cowell; Mady Hornig; Margaret A. Hall; Mark G. Weiner; Michael Koropsak; Michelle F. Lamendola-Essel; Rachel Kenney; Richard A. Moffitt; Sajjad Abedian; Shari Esquenazi-Karonika; Steven G. Johnson; Stephenson Stroebel; Zachary S. Wallace; Karen H. Costenbader
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SARS-CoV-2 infection has been associated with increased autoimmune disease risk. Past studies have not aligned regarding the most prevalent autoimmune diseases after infection, however. Furthermore, the relationship between infection severity and new autoimmune disease risk has not been well examined. We used RECOVER’s electronic health record (EHR) networks, N3C, PCORnet, and PEDSnet, to estimate types and frequency of autoimmune diseases arising after SARS-CoV-2 infection and assessed how infection severity related to autoimmune disease risk. We identified patients of any age with SARS-CoV-2 infection between April 1, 2020 and April 1, 2021, and assigned them to a World Health Organization COVID-19 severity category for adults or the PEDSnet acute COVID-19 illness severity classification system for children (30 days after SARS-CoV-2 infection index date and occurring ≥1 day apart. We calculated overall and infection severity-stratified incidence ratesper 1000 person-years for all autoimmune diseases. With least severe COVID-19 severity as reference, survival analyses examined incident autoimmune disease risk. The most common new-onset autoimmune diseases in all networks were thyroid disease, psoriasis/psoriatic arthritis, and inflammatory bowel disease. Among adults, inflammatory arthritis was the most common, and Sjögren’s disease also had high incidence. Incident type 1 diabetes and hematological autoimmune diseases were specifically found in children. Across networks, after adjustment, patients with highest COVID-19 severity had highest risk for new autoimmune disease vs. those with least severe disease (N3C: adjusted Hazard Ratio, (aHR) 1.47 (95%CI 1.33–1.66); PCORnet aHR 1.14 (95%CI 1.02–1.26); PEDSnet: aHR 3.14 (95%CI 2.42–4.07)]. Overall, severe acute COVID-19 was most strongly associated with autoimmune disease risk in three EHR networks.
Characteristics of cohorts.
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated May 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asif Rahman; Michael Russell; Wanhong Zheng; Daniel Eckrich; Imtiaz Ahmed (2024). Characteristics of cohorts. [Dataset]. http://doi.org/10.1371/journal.pone.0295891.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0295891.t001
Dataset updated
May 30, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Asif Rahman; Michael Russell; Wanhong Zheng; Daniel Eckrich; Imtiaz Ahmed
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Amid the ongoing global repercussions of SARS-CoV-2, it is crucial to comprehend its potential long-term psychiatric effects. Several recent studies have suggested a link between COVID-19 and subsequent mental health disorders. Our investigation joins this exploration, concentrating on Schizophrenia Spectrum and Psychotic Disorders (SSPD). Different from other studies, we took acute respiratory distress syndrome (ARDS) and COVID-19 lab-negative cohorts as control groups to accurately gauge the impact of COVID-19 on SSPD. Data from 19,344,698 patients, sourced from the N3C Data Enclave platform, were methodically filtered to create propensity matched cohorts: ARDS (n = 222,337), COVID-19 positive (n = 219,264), and COVID-19 negative (n = 213,183). We systematically analyzed the hazard rate of new-onset SSPD across three distinct time intervals: 0-21 days, 22-90 days, and beyond 90 days post-infection. COVID-19 positive patients consistently exhibited a heightened hazard ratio (HR) across all intervals [0-21 days (HR: 4.6; CI: 3.7-5.7), 22-90 days (HR: 2.9; CI: 2.3 -3.8), beyond 90 days (HR: 1.7; CI: 1.5-1.)]. These are notably higher than both ARDS and COVID-19 lab-negative patients. Validations using various tests, including the Cochran Mantel Haenszel Test, Wald Test, and Log-rank Test confirmed these associations. Intriguingly, our data indicated that younger individuals face a heightened risk of SSPD after contracting COVID-19, a trend not observed in the ARDS and COVID-19 negative groups. These results, aligned with the known neurotropism of SARS-CoV-2 and earlier studies, accentuate the need for vigilant psychiatric assessment and support in the era of Long-COVID, especially among younger populations.
f
Concepts categorization by accuracy level.
figshare.com
xls
Updated Jun 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li (2023). Concepts categorization by accuracy level. [Dataset]. http://doi.org/10.1371/journal.pone.0276923.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276923.t001
Dataset updated
Jun 8, 2023
Dataset provided by
PLOS ONE
Authors
Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Concepts categorization by accuracy level.
The Mitigating Effects of Telehealth Uptake on Disparities in Maternal Care...
icpsr.umich.edu
Updated May 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hung, Peiyin; Li, Xiaoming (2025). The Mitigating Effects of Telehealth Uptake on Disparities in Maternal Care Access, Quality, Outcomes, and Expenditures, United States, 2018-2022 [Dataset]. http://doi.org/10.3886/ICPSR39023.v3
Explore at:
Unique identifier
https://doi.org/10.3886/ICPSR39023.v3
Dataset updated
May 12, 2025
Dataset provided by
Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
Authors
Hung, Peiyin; Li, Xiaoming
License
https://www.icpsr.umich.edu/web/ICPSR/studies/39023/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/39023/terms
Time period covered
2018 - 2022
Area covered
South Carolina, United States
Description
This study explores whether perinatal telehealth uptake has mitigated the pandemic's effects on disparities in maternal care access, quality, and outcomes by race, ethnicity, and rural or urban residence. Research to date has approached this question in several ways. First, researchers have utilized census data to assess whether community-wide broadband infrastructure exists to support the use of telehealth services in areas with high travel times to maternal care units. Findings suggest that socioeconomically disadvantaged communities face significant barriers to maternity care access, both with substantial travel burdens and inadequate digital access to facilitate telehealth services. Second, to examine maternal care quality, researchers have employed South Carolina hospital-based claims data and vital statistics to identify racial, ethnic, and urban/rural disparities in rates of cesarean delivery before and during the COVID-19 pandemic period. Results indicate that cesarean rates differed by rural vs. urban facility locations and racial and ethnic groups but observed disparities were not significantly exacerbated by the pandemic. Third, using South Carolina hospital-based claims data and COVID-19 testing data, researchers found significant racial, ethnic, and rural disparities in postpartum readmissions involving mental health and substance use disorders from childbirth discharge through one year postpartum during the COVID-19 pandemic. Finally, drawing on data from the National COVID Cohort Collaborative (N3C), research has shown that hybrid care increased substantially during the COVID-19 public health emergency, but pregnant people living in rural areas had lower levels of hybrid care than urban people, and individuals who belonged to racial and ethnic minority groups were more likely to have hybrid care than White individuals. Future research will investigate the impact of the COVID-19 pandemic and perinatal telehealth uptake on additional maternity care and birth outcomes by race, ethnicity, and urbanicity. The study also aims to assess how state-level telehealth policies relate to perinatal telehealth uptake by race, ethnicity, and urbanicity, and to develop a model to predict long-term changes in maternal care access, quality, outcomes, and expenditures, with and without state telehealth policies. The ICPSR provides variable-level metadata for the data associated with this study. The actual data may only be available from the Principal Investigator directly. The variable descriptions available through ICPSR also include information regarding the source of each variable listed, as does the Data Source field of these metadata.
f
Confusion matrix of the accuracy rating for the performance of the GA...
plos.figshare.com
xls
Updated Jun 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li (2023). Confusion matrix of the accuracy rating for the performance of the GA algorithm. [Dataset]. http://doi.org/10.1371/journal.pone.0276923.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0276923.t004
Dataset updated
Jun 11, 2023
Dataset provided by
PLOS ONE
Authors
Tianchu Lyu; Chen Liang; Jihong Liu; Berry Campbell; Peiyin Hung; Yi-Wen Shih; Nadia Ghumman; Xiaoming Li
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Confusion matrix of the accuracy rating for the performance of the GA algorithm.
f
Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and...
plos.figshare.com
xls
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli (2025). Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B equity for race and gender in positive (+) and negative (-) symptom mentions for UMN PASC. [Dataset]. http://doi.org/10.1371/journal.pone.0323535.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0323535.t006
Dataset updated
May 15, 2025
Dataset provided by
PLOS ONE
Authors
Vedansh Thakkar; Greg M. Silverman; Abhinab Kc; Nicholas E. Ingraham; Emma K. Jones; Samantha King; Genevieve B. Melton; Rui Zhang; Christopher J. Tignanelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and LLaMA3-8B equity for race and gender in positive (+) and negative (-) symptom mentions for UMN PASC.

Facebook

Twitter

Click to copy link

Link copied

Cite

United States - National Center for Advancing Translational Sciences (NCATS) (2025). National COVID Cohort Collaborative Data Enclave [Dataset]. https://datacatalog.med.nyu.edu/dataset/10384

National COVID Cohort Collaborative Data Enclave

N3C Data Enclave

Explore at:

59 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

Aug 6, 2025

Dataset authored and provided by

United States - National Center for Advancing Translational Sciences (NCATS)

Time period covered

Jan 1, 2020 - Present

Area covered

United States

Description

The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.

The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:

Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
- Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists
De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
- Can be requested by: Researchers from US-based or foreign institutions
Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
- Can be requested by: Researchers from US-based institutions only

Clear search

Close search

Google apps

Main menu

National COVID Cohort Collaborative Data Enclave

NCATS National COVID Cohort Collaborative (N3C) Data Enclave

N3C-Formatted OMOP2OBO Mappings

Baseline characteristics of all patients in N3C receiving 2 doses of mRNA...

Data from: An ordinal severity scale for COVID-19 retrospective studies...

N3C MACE

Representative examples of false negatives for positive mentions of “fever”...

anti-N3C(O)N

Clostridium sp. N3C

xn--n3c.com - Historical whois Lookup

xn--reisercktritts-versicherung-n3c.com - Historical whois Lookup

Demographics of various corpora.

xn--warnemnde-zimmervermittlung-n3c.info - Historical whois Lookup

Macro-averaged metrics with 95% confidence intervals for evaluation of...

COVID-19 severity indicator codes used in N3C.

Characteristics of cohorts.

Concepts categorization by accuracy level.

The Mitigating Effects of Telehealth Uptake on Disparities in Maternal Care...

Confusion matrix of the accuracy rating for the performance of the GA...

Macro-averaged metrics for evaluation of BioMedICUS’, LLaMA2-13B, and...

National COVID Cohort Collaborative Data Enclave

N3C Data Enclave