13 datasets found
  1. d

    Database of Genotype and Phenotype (dbGaP)

    • catalog.data.gov
    • data.virginia.gov
    • +1more
    Updated Jun 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). Database of Genotype and Phenotype (dbGaP) [Dataset]. https://catalog.data.gov/dataset/database-of-genotype-and-phenotype-dbgap
    Explore at:
    Dataset updated
    Jun 19, 2025
    Dataset provided by
    National Library of Medicine
    Description

    Database of Genotype and Phenotype (dbGaP) was developed to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype in Humans.

  2. r

    NCBI database of Genotypes and Phenotypes (dbGap)

    • rrid.site
    • scicrunch.org
    Updated Jul 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). NCBI database of Genotypes and Phenotypes (dbGap) [Dataset]. http://identifiers.org/RRID:SCR_002709
    Explore at:
    Dataset updated
    Jul 13, 2025
    Description

    Database developed to archive and distribute clinical data and results from studies that have investigated interaction of genotype and phenotype in humans. Database to archive and distribute results of studies including genome-wide association studies, medical sequencing, molecular diagnostic assays, and association between genotype and non-clinical traits.

  3. d

    NCBI dbGaP

    • datadiscoverystudio.org
    resource url
    Updated 2008
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2008). NCBI dbGaP [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/af8685d4ea5b41f1b9a6c8a5fe7473c1/html
    Explore at:
    resource urlAvailable download formats
    Dataset updated
    2008
    Description

    Link Function: information

  4. Database of Genotype and Phenotype (dbGaP) - th78-z3aq - Archive Repository

    • healthdata.gov
    application/rdfxml +5
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Database of Genotype and Phenotype (dbGaP) - th78-z3aq - Archive Repository [Dataset]. https://healthdata.gov/dataset/Database-of-Genotype-and-Phenotype-dbGaP-th78-z3aq/7f2m-hztq
    Explore at:
    application/rssxml, tsv, csv, xml, json, application/rdfxmlAvailable download formats
    Dataset updated
    Jun 28, 2025
    Description

    This dataset tracks the updates made on the dataset "Database of Genotype and Phenotype (dbGaP)" as a repository for previous versions of the data and metadata.

  5. f

    Quickly identifying identical and closely related subjects in large...

    • plos.figshare.com
    xlsx
    Updated May 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo (2023). Quickly identifying identical and closely related subjects in large databases using genotype data [Dataset]. http://doi.org/10.1371/journal.pone.0179106
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Genome-wide association studies (GWAS) usually rely on the assumption that different samples are not from closely related individuals. Detection of duplicates and close relatives becomes more difficult both statistically and computationally when one wants to combine datasets that may have been genotyped on different platforms. The dbGaP repository at the National Center of Biotechnology Information (NCBI) contains datasets from hundreds of studies with over one million samples. There are many duplicates and closely related individuals both within and across studies from different submitters. Relationships between studies cannot always be identified by the submitters of individual datasets. To aid in curation of dbGaP, we developed a rapid statistical method called Genetic Relationship and Fingerprinting (GRAF) to detect duplicates and closely related samples, even when the sets of genotyped markers differ and the DNA strand orientations are unknown. GRAF extracts genotypes of 10,000 informative and independent SNPs from genotype datasets obtained using different methods, and implements quick algorithms that enable it to find all of the duplicate pairs from more than 880,000 samples within and across dbGaP studies in less than two hours. In addition, GRAF uses two statistical metrics called All Genotype Mismatch Rate (AGMR) and Homozygous Genotype Mismatch Rate (HGMR) to determine subject relationships directly from the observed genotypes, without estimating probabilities of identity by descent (IBD), or kinship coefficients, and compares the predicted relationships with those reported in the pedigree files. We implemented GRAF in a freely available C++ program of the same name. In this paper, we describe the methods in GRAF and validate the usage of GRAF on samples from the dbGaP repository. Other scientists can use GRAF on their own samples and in combination with samples downloaded from dbGaP.

  6. f

    Mean HGMR and AGMR values and correlation coefficients between HGMR and AGMR...

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo (2023). Mean HGMR and AGMR values and correlation coefficients between HGMR and AGMR of all related subjects reported in the data files submitted to dbGaP. [Dataset]. http://doi.org/10.1371/journal.pone.0179106.t008
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mean HGMR and AGMR values and correlation coefficients between HGMR and AGMR of all related subjects reported in the data files submitted to dbGaP.

  7. f

    Predicted HGMR values and standard deviations for different types of...

    • plos.figshare.com
    xls
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo (2023). Predicted HGMR values and standard deviations for different types of relationships assuming allele frequencies are evenly distributed between 0.1 and 0.9. [Dataset]. http://doi.org/10.1371/journal.pone.0179106.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yumi Jin; Alejandro A. Schäffer; Stephen T. Sherry; Michael Feolo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Predicted HGMR values and standard deviations for different types of relationships assuming allele frequencies are evenly distributed between 0.1 and 0.9.

  8. o

    Center for Common Disease Genomics (CCDG) Neuropsychiatric: Autism Center of...

    • explore.openaire.eu
    • omicsdi.org
    Updated Oct 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Center for Common Disease Genomics (CCDG) Neuropsychiatric: Autism Center of Excellence (ACE II) [Dataset]. https://explore.openaire.eu/search/dataset?datasetId=_OmicsDI::2c407f364a0b8e595b6c2b9472e03611
    Explore at:
    Dataset updated
    Oct 20, 2024
    Description

    In this study, we address the enormous challenges common complex diseases pose for genomic analysis and the enormous opportunities surmounting them offers for advancing healthcare. The common genetic disorders proposed for study here are believed to have extreme locus heterogeneity, requiring the analysis of large numbers of samples to comprehensively identify the genomic variants underlying them. We propose that a combination of deep population studies and joint analysis of SNPs, indels, and structural variants, both in coding and noncoding regions, will provide the next level of understanding of common genetic disorders. Whole genome sequencing (WGS) will be critical to this next-generation approach to the genomics of complex disease. WGS will need to be accompanied by the technical ability to generate and handle very large data sets, a particular focus and strength of NYGC. WGS will also need to be accompanied by new statistical tools and algorithms... (for more see dbGaP study page.)

  9. d

    NIMH Repository and Genomics Resources (RGR)

    • catalog.data.gov
    • healthdata.gov
    • +3more
    Updated Jul 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). NIMH Repository and Genomics Resources (RGR) [Dataset]. https://catalog.data.gov/dataset/nimh-repository-and-genomics-resources-rgr
    Explore at:
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    The NIMH Repository and Genomics Resource (RGR) stores biosamples, genetic, pedigree and clinical data collected in designated NIMH-funded human subject studies. The RGR database likewise links to other repositories holding data from the same subjects, including dbGAP, GEO and NDAR. The NIMH RGR allows the broader research community to access these data and biospecimens (e.g., lymphoblastoid cell lines, induced pluripotent cell lines, fibroblasts) and further expand the genetic and molecular characterization of patient populations with severe mental illness.

  10. Subtypes of Stage 1 GWAS samples.

    • plos.figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Dapas; Frederick T. J. Lin; Girish N. Nadkarni; Ryan Sisk; Richard S. Legro; Margrit Urbanek; M. Geoffrey Hayes; Andrea Dunaif (2023). Subtypes of Stage 1 GWAS samples. [Dataset]. http://doi.org/10.1371/journal.pmed.1003132.s006
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Matthew Dapas; Frederick T. J. Lin; Girish N. Nadkarni; Ryan Sisk; Richard S. Legro; Margrit Urbanek; M. Geoffrey Hayes; Andrea Dunaif
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Subtypes are provided for each of the 555 Stage 1 GWAS samples included in the clustering analysis according to their dbGaP SUBJIDs. dbGaP, database of Genotypes and Phenotypes; GWAS, genome-wide association study; SUBJID, subject ID. (TXT)

  11. V

    Phenotype-Genotype Integrator (PheGenI)

    • data.virginia.gov
    • datadiscovery.nlm.nih.gov
    • +3more
    html
    Updated Jun 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). Phenotype-Genotype Integrator (PheGenI) [Dataset]. https://data.virginia.gov/dataset/phenotype-genotype-integrator-phegeni
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 18, 2025
    Dataset provided by
    National Library of Medicine
    Description

    Supports finding human phenotype/genotype relationships with queries by phenotype, chromosome location, gene, and SNP identifiers. Currently includes information from dbGaP, the National Human Genome Research Institute (NHGRI) genome-wide association study (GWAS) Catalog, and Genotype - Tissue Expression (GTeX).

  12. o

    Implementation, Adoption, and Utility of Family History in Diverse Care...

    • explore.openaire.eu
    Updated Oct 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Implementation, Adoption, and Utility of Family History in Diverse Care Settings [Dataset]. https://explore.openaire.eu/search/dataset?datasetId=_OmicsDI::4faa4ed3887035b687fc8ddcfe296b6a
    Explore at:
    Dataset updated
    Oct 12, 2024
    Description

    The purpose of this study is to address the key question of whether and how family health history (FHH) is adopted as a tool to more efficiently manage patients at risk for breast, colon, ovarian, and hereditary cancer syndromes as well as thrombophilia and coronary heart disease (CHD) and to provide evidence supporting clinical utility -- improved health behaviors in patients and physician screening recommendations. Five health care delivery organizations will participate in this demonstration project: Duke University, the Medical College of Wisconsin, the Air Force, Essentia Health, University of North Texas. Duke will serve as a coordinating center for this project (Pro00043372) as well as a site. Healthcare Effectiveness Data and Information Set (HEDIS) measures as intermediate clinical effectiveness measures for Coronary Heart Disease (CHD) and selected cancers as well as survey/formative data and electronic medical record (EMR) data will be used... (for more see dbGaP study page.)

  13. f

    LD matrices associated with publication "GWAS meta-analysis of psoriasis...

    • kcl.figshare.com
    bin
    Updated Dec 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nick Dand; Philip E. Stuart; Lam C. Tsoi; Michael A. Simpson; James T. Elder (2024). LD matrices associated with publication "GWAS meta-analysis of psoriasis identifies new susceptibility alleles impacting disease mechanisms and therapeutic targets" [Dataset]. http://doi.org/10.18742/27982057.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 11, 2024
    Dataset provided by
    King's College London
    Authors
    Nick Dand; Philip E. Stuart; Lam C. Tsoi; Michael A. Simpson; James T. Elder
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Preprint: Dand N, Stuart PE, et al., GWAS meta-analysis of psoriasis identifies new susceptibility alleles impacting disease mechanisms and therapeutic targets, medRxiv. 2023 Oct 5:2023.10.04.23296543. doi: 10.1101/2023.10.04.23296543. PMID: 37873414Abstract: Psoriasis is a common, debilitating immune-mediated skin disease. Genetic studies have identified biological mechanisms of psoriasis risk, including those targeted by effective therapies. However, the genetic liability to psoriasis is not fully explained by variation at robustly identified risk loci. To refine the genetic map of psoriasis susceptibility we meta-analysed 18 GWAS comprising 36,466 cases and 458,078 controls and identified 109 distinct psoriasis susceptibility loci, including 46 that have not been previously reported. These include susceptibility variants at loci in which the therapeutic targets IL17RA and AHR are encoded, and deleterious coding variants supporting potential new drug targets (including in STAP2, CPVL and POU2F3). We conducted a transcriptome-wide association study to identify regulatory effects of psoriasis susceptibility variants and cross-referenced these against single cell expression profiles in psoriasis-affected skin, highlighting roles for the transcriptional regulation of haematopoietic cell development and epigenetic modulation of interferon signalling in psoriasis pathobiology.This dataset: This study used a custom LD reference panel comprising six GWAS datasets. Individual level genotype data for the CASP GWAS, PsA GWAS, and Exomechip case-control studies are available on dbGaP (dbGaP: phs000019.v1.p1 [https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000019.v1.p1], phs000982.v1.p1 [http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000982.v1.p1], and phs001306.v1.p1 [http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001306.v1.p1]), and WTCCC2 genotype data are archived at the European Genome-Phenome Archive (study ID EGAS00000000108 [https://ega-archive.org/studies/EGAS00000000108]). Data sharing restrictions do not allow making genotype data publicly available for the remaining two case-control cohorts. However, LD matrices based on the full reference panel for all 109 susceptibility loci have been deposited in the King’s Open Research Data System.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Library of Medicine (2025). Database of Genotype and Phenotype (dbGaP) [Dataset]. https://catalog.data.gov/dataset/database-of-genotype-and-phenotype-dbgap

Database of Genotype and Phenotype (dbGaP)

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jun 19, 2025
Dataset provided by
National Library of Medicine
Description

Database of Genotype and Phenotype (dbGaP) was developed to archive and distribute the data and results from studies that have investigated the interaction of genotype and phenotype in Humans.

Search
Clear search
Close search
Google apps
Main menu