29 datasets found
  1. f

    ukbtools: An R package to manage and query UK Biobank data

    • plos.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ken B. Hanscombe; Jonathan R. I. Coleman; Matthew Traylor; Cathryn M. Lewis (2023). ukbtools: An R package to manage and query UK Biobank data [Dataset]. http://doi.org/10.1371/journal.pone.0214311
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ken B. Hanscombe; Jonathan R. I. Coleman; Matthew Traylor; Cathryn M. Lewis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionThe UK Biobank (UKB) is a resource that includes detailed health-related data on about 500,000 individuals and is available to the research community. However, several obstacles limit immediate analysis of the data: data files vary in format, may be very large, and have numerical codes for column names.Resultsukbtools removes all the upfront data wrangling required to get a single dataset for statistical analysis. All associated data files are merged into a single dataset with descriptive column names. The package also provides tools to assist in quality control by exploring the primary demographics of subsets of participants; query of disease diagnoses for one or more individuals, and estimating disease frequency relative to a reference variable; and to retrieve genetic metadata.ConclusionHaving a dataset with meaningful variable names, a set of UKB-specific exploratory data analysis tools, disease query functions, and a set of helper functions to explore and write genetic metadata to file, will rapidly enable UKB users to undertake their research.

  2. o

    Data from: Summary-Level Data From Meta-Analysis Of Fat Distribution...

    • explore.openaire.eu
    • zenodo.org
    Updated May 23, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sara L Pulit (2018). Summary-Level Data From Meta-Analysis Of Fat Distribution Phenotypes In Uk Biobank And Giant [Dataset]. http://doi.org/10.5281/zenodo.1251812
    Explore at:
    Dataset updated
    May 23, 2018
    Authors
    Sara L Pulit
    Description

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Summary-level data as presented in: "Meta-analysis of genome-wide association studies for body fat distribution in 694,649 individuals of European ancestry." Pulit, SL et al. bioRxiv, 2018. https://www.biorxiv.org/content/early/2018/04/18/304030 **If you use these data, please cite the above preprint. If you have any questions or comments regarding these files, please contact me: Sara L Pulit spulit@well.ox.ac.uk or s.l.pulit@umcutrecht.nl ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ (1) Data files i. whradjbmi.giant-ukbb.meta-analysis.combined.23May2018.txt Meta-analysis of waist-to-hip ratio adjusted for body mass index (whradjbmi) in UK Biobank and GIANT data. Combined set of samples, max N = 694,649. ii. whradjbmi.giant-ukbb.meta-analysis.females.23May2018.txt Meta-analysis of whradjbmi in UK Biobank and GIANT data. Female samples only, max N = 379,501. iii. whradjbmi.giant-ukbb.meta-analysis.males.23May2018.txt Meta-analysis of whradjbmi in UK Biobank and GIANT data. Male samples only, max N = 315,284. iv. whr.giant-ukbb.meta-analysis.combined.23May2018.txt Meta-analysis of waist-to-hip ratio (whr) in UK Biobank and GIANT data. Combined set of samples, max N = 697,734. v. whr.giant-ukbb.meta-analysis.females.23May2018.txt Meta-analysis of whr in UK Biobank and GIANT data. Female samples only, max N = 381,152. vi. whr.giant-ukbb.meta-analysis.males.23May2018.txt Meta-analysis of whr in UK Biobank and GIANT data. Male samples only, max N = 316,772. vii. bmi.giant-ukbb.meta-analysis.combined.23May2018.txt Meta-analysis of body mass index (bmi) in UK Biobank and GIANT data. Combined set of samples, max N = 806,834. viii. bmi.giant-ukbb.meta-analysis.females.23May2018.txt Meta-analysis of bmi in UK Biobank and GIANT data. Female samples only, max N = 434,794. ix. bmi.giant-ukbb.meta-analysis.males.23May2018.txt Meta-analysis of bmi in UK Biobank and GIANT data. Male samples only, max N = 374,756. (2) Data file format CHR: Chromosome POS: Chromosomal position of the SNP, build hg19 SNP: the dbSNP151 identifier of the SNP, followed by the first allele and second allele of the SNP, delimited with a colon. A small number of SNPs (<9,000) from the GIANT data had no dbSNP151 identifier, and are left as just an rsID. Note that these SNPs are also missing chromosome and position information (not provided in the GIANT data). Tested_Allele: the allele for which all association statistics are reported Other_Allele: the other allele at the SNP Freq_Tested_Allele: frequency of the tested allele BETA: the effect size of the tested allele SE: the standard error of the beta P: the p-value of the SNP, as reported from the inverse variance-weighted fixed effects meta-analysis N: the total sample size for this SNP INFO: the imputation quality (info score) of the SNP, as reported by UK Biobank. A number between 0 and 1 indicating quality of imputation (0, poor quality; 1, high quality or genotyped). Note that the summary-level GIANT data does not report info score, so SNPs appearing only in the GIANT analysis do not have info scores.

  3. E

    Data from: Genome-wide meta-analysis of depression identifies 102...

    • dtechtive.com
    • find.data.gov.scot
    txt
    Updated Oct 24, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh (2018). Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions [Dataset]. http://doi.org/10.7488/ds/2458
    Explore at:
    txt(0.0166 MB), txt(0.0023 MB), txt(347.4 MB), txt(0.4338 MB)Available download formats
    Dataset updated
    Oct 24, 2018
    Dataset provided by
    University of Edinburgh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Major depression is a debilitating psychiatric illness that is typically associated with low mood, anhedonia and a range of comorbidities. Depression has a heritable component that has remained difficult to elucidate with current sample sizes due to the polygenic nature of the disorder. To maximise sample size, we meta-analysed data on 807,553 individuals (246,363 cases and 561,190 controls) from the three largest genome-wide association studies of depression. We identified 102 independent variants, 269 genes, and 15 gene-sets associated with depression, including both genes and gene-pathways associated with synaptic structure and neurotransmission. Further evidence of the importance of prefrontal brain regions in depression was provided by an enrichment analysis. In an independent replication sample of 1,306,354 individuals (414,055 cases and 892,299 controls), 87 of the 102 associated variants were significant following multiple testing correction. Based on the putative genes associated with depression this work also highlights several potential drug repositioning opportunities. These findings advance our understanding of the complex genetic architecture of depression and provide several future avenues for understanding aetiology and developing new treatment approaches. The data contained in this item is described in a published manuscript located at https://doi.org/10.1038/s41593-018-0326-7.

  4. f

    Aggregated UK Biobank clinical assessments and neuroimaging biomarkers.

    • figshare.com
    xls
    Updated Jun 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov (2023). Aggregated UK Biobank clinical assessments and neuroimaging biomarkers. [Dataset]. http://doi.org/10.1371/journal.pone.0228520.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 14, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Aggregated UK Biobank clinical assessments and neuroimaging biomarkers.

  5. Summary statistics of the UK Biobank + Estonian Biobank sex-combined GWAS...

    • zenodo.org
    tar
    Updated May 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dennis van der Meer; Dennis van der Meer; Alexey Shadrin; Alexey Shadrin (2025). Summary statistics of the UK Biobank + Estonian Biobank sex-combined GWAS meta-analysis for the Nightingale panel of 249 circulating plasma metabolic markers from "Pleiotropic and sex-specific genetic architecture of circulating metabolic markers" [https://doi.org/10.1101/2024.07.30.24311254]. [Dataset]. http://doi.org/10.5281/zenodo.15420219
    Explore at:
    tarAvailable download formats
    Dataset updated
    May 14, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Dennis van der Meer; Dennis van der Meer; Alexey Shadrin; Alexey Shadrin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    "meta.tar" archive contains 249 files representing UK Biobank + Estonian Biobank sex-combined GWAS meta-analysis summary statistics for the Nightingale panel of 249 circulating plasma metabolic markers presented in "Pleiotropic and sex-specific genetic architecture of circulating metabolic markers" [https://doi.org/10.1101/2024.07.30.24311254].

    Each file contains nine columns:

    SNP: ID of the genetic marker;

    CHR: chromosome code (GRCh37 genomic build);

    BP: base-pair coordinate (GRCh37 genomic build);

    PVAL: regression p-value;

    A1: effect allele;

    A2: other allele;

    N: sample size;

    BETA: regression coefficient for effect allele (A1);

    SE: standard error of regression coefficient (BETA).

  6. E

    MM GWAS dataset

    • ega-archive.org
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). MM GWAS dataset [Dataset]. https://ega-archive.org/datasets/EGAD50000000422
    Explore at:
    Dataset updated
    Jul 22, 2024
    License

    https://ega-archive.org/dacs/EGAC50000000050https://ega-archive.org/dacs/EGAC50000000050

    Description

    This is a meta-analysis of myeloma datasets, both with and without the UK Biobank cohort included.

  7. f

    Library of the 55 different classification and regression machine-learning...

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov (2023). Library of the 55 different classification and regression machine-learning algorithms used by the ensemble predictor SuperLearner (SL.library) in the CBDA 2.0 implementation. [Dataset]. http://doi.org/10.1371/journal.pone.0228520.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Library of the 55 different classification and regression machine-learning algorithms used by the ensemble predictor SuperLearner (SL.library) in the CBDA 2.0 implementation.

  8. f

    Subsampling specifications for different Big Data sizes.

    • plos.figshare.com
    xls
    Updated Aug 28, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov (2020). Subsampling specifications for different Big Data sizes. [Dataset]. http://doi.org/10.1371/journal.pone.0228520.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 28, 2020
    Dataset provided by
    PLOS ONE
    Authors
    Simeone Marino; Yi Zhao; Nina Zhou; Yiwang Zhou; Arthur W. Toga; Lu Zhao; Yingsi Jian; Yichen Yang; Yehu Chen; Qiucheng Wu; Jessica Wild; Brandon Cummings; Ivo D. Dinov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The total number of subsamples M = 5,000.

  9. Genome-wide association summary statistics for back pain

    • zenodo.org
    txt
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maxim B Freidin; Yakov A Tsepilov; Yakov A Tsepilov; Melody Palmer; Lennart Karssen; Lennart Karssen; CHARGE Musculoskeletal Working Group; Pradeep Suri; Yurii S Aulchenko; Yurii S Aulchenko; Frances MK Williams; Maxim B Freidin; Melody Palmer; CHARGE Musculoskeletal Working Group; Pradeep Suri; Frances MK Williams (2020). Genome-wide association summary statistics for back pain [Dataset]. http://doi.org/10.5281/zenodo.1319332
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Maxim B Freidin; Yakov A Tsepilov; Yakov A Tsepilov; Melody Palmer; Lennart Karssen; Lennart Karssen; CHARGE Musculoskeletal Working Group; Pradeep Suri; Yurii S Aulchenko; Yurii S Aulchenko; Frances MK Williams; Maxim B Freidin; Melody Palmer; CHARGE Musculoskeletal Working Group; Pradeep Suri; Frances MK Williams
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains results of a genome-wide association study of back pain. Two files contain association summary statistics for discovery GWAS based on the analysis of 350,000 white British individuals from the UK Biobank and meta-analysis GWAS based on the meta-analysis of the same 350,000 individuals and additional 103,862 individuals of European Ancestry from the UK biobank (total N = 453,862). The phenotype of back pain was defined by the answer provided by the UK biobank participants to the following question: "Pain type(s) experienced in last month". Those who reported “Back pain”, were considered as cases, all the rest were considered as controls. Individuals who did not reply or replied: "Prefer not to answer" or "Pain all over the body" were excluded. This dataset is also available for graphical exploration in the genomic context at http://gwasarchive.org.

    The data are provided on an "AS-IS" basis, without warranty of any type, expressed or implied, including but not limited to any warranty as to their performance, merchantability, or fitness for any particular purpose. If investigators use these data, any and all consequences are entirely their responsibility. By downloading and using these data, you agree that you will cite the appropriate publication in any communications or publications arising directly or indirectly from these data; for utilisation of data available prior to publication, you agree to respect the requested responsibilities of resource users under 2003 Fort Lauderdale principles; you agree that you will never attempt to identify any participant. This research has been conducted using the UK Biobank Resource and the use of the data is guided by the principles formulated by the UK Biobank.

    When using downloaded data, please cite corresponding paper and this repository:

    1. Insight into the genetic architecture of back pain and its risk factors from a study of 509,000 individuals. Freidin, Maxim; Tsepilov, Yakov; Palmer, Melody; Karssen, Lennart; Suri, Pradeep; Aulchenko, Yurii; Williams, Frances MK,# CHARGE Musculoskeletal Working Group. PAIN: February 06, 2019 - Volume Articles in Press - Issue - p
      doi: 10.1097/j.pain.0000000000001514
    2. Maxim B Freidin, Yakov A Tsepilov, Melody Palmer, Lennart Karssen, CHARGE Musculoskeletal Working Group, Pradeep Suri, … Frances MK Williams. (2018). Genome-wide association summary statistics for back pain (Version 1) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.1319332

    Funding:

    This study was supported by the European Community’s Seventh Framework Programme funded project PainOmics (Grant agreement # 602736).
    The research has been conducted using the UK Biobank Resource (project # 18219).

    The development of software implementing SMR/HEIDI test and database for GWAS results was supported by the Russian Ministry of Science and Education under the 5-100 Excellence Program”.

    Dr. Suri’s time for this work was supported by VA Career Development Award # 1IK2RX001515 from the United States (U.S.) Department of Veterans Affairs Rehabilitation Research and Development Service. The contents of this work do not represent the views of the U.S. Department of Veterans Affairs or the United States Government.

    Dr. Tsepilov’s time for this work was supported in part by the Russian Ministry of Science and Education under the 5-100 Excellence Program.

    Column headers - discovery (350K)

    1. CHR: chromosome
    2. POS: position (GRCh37 build)
    3. ID: SNP rsID
    4. REF: reference allele (coded as "0")
    5. ALT: effect allele (coded as "1")
    6. CASE_ALLELE_CT: allele observation count in cases
    7. CTRL_ALLELE_CT: allele observation count in controls
    8. ALT_FREQ: effect allele frequency
    9. MACH_R2: imputation quality
    10. TEST: model of association test (additive)
    11. OBS_CT: sample size
    12. BETA: effect size of effect allele
    13. SE: standard error of effect size
    14. T_STAT: Z-value of effect allele
    15. P: P-value of association (without GC correction)
    16. MAF: minor allele frequency

    Column headers - meta-analysis (450K)

    1. MarkerName: SNP rsID
    2. Allele1: effect allele (coded as "1")
    3. Allele2: reference allele (coded as "0")
    4. Freq1: effect allele frequency
    5. FreqSE: standard error of effect allele frequency
    6. Effect: effect size of effect allele
    7. StdErr: standard error of effect size
    8. P-value: P-value of association (without GC correction)
    9. Direction: sign of effect in discovery and replication samples
    10. n_total: Total sample size
    11. CHR: chromosome
    12. POS: position (GRCh37 build)
    13. MACH_R2_discovery: imputation quality in discovery sample
  10. f

    Anxiety-Disorders-plink.meta_P_Sorted.gz

    • figshare.com
    application/gzip
    Updated Oct 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xinglun Dang (2023). Anxiety-Disorders-plink.meta_P_Sorted.gz [Dataset]. http://doi.org/10.6084/m9.figshare.23659653.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Oct 27, 2023
    Dataset provided by
    figshare
    Authors
    Xinglun Dang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    GWAS summary statistics from anxiety disorders meta-analysis (combining genome-wide associations from UK Biobank, iPSYCH, ANGST, MVP, and FINNGEN)

  11. d

    Data from: Multi-ancestry meta-analysis of host genetic susceptibility to...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Nov 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haiko Schurz; Vivek Naranbhai; Tom A. Yates; James J. Gilchrist; Tom Parks; Peter J. Dodd; Marlo Möller; Eileen G. Hoal; Andrew P. Morris; Adrian V. S. Hill (2022). Multi-ancestry meta-analysis of host genetic susceptibility to tuberculosis identifies shared genetic architecture [Dataset]. http://doi.org/10.5061/dryad.6wwpzgn2s
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 21, 2022
    Dataset provided by
    Dryad
    Authors
    Haiko Schurz; Vivek Naranbhai; Tom A. Yates; James J. Gilchrist; Tom Parks; Peter J. Dodd; Marlo Möller; Eileen G. Hoal; Andrew P. Morris; Adrian V. S. Hill
    Time period covered
    2022
    Description

    Summary statistics are stored as compressed .zip files. Unzipped files can be opened with any text editor, Excel, or read into R programming environment (or equivalent data analysis software)' note the files are space-separated and have a header row. Each row in the files contains information for a single SNP. All authors have approved sharing the summary statistics of the analysis. The original individual datasets are only available from request from the original authors.

  12. A total of 12 novel testosterone-associated loci with leading bi-allelic...

    • plos.figshare.com
    xls
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boxi Lin; Andrew D. Paterson; Lei Sun (2024). A total of 12 novel testosterone-associated loci with leading bi-allelic SNPs uniquely identified by the recommended T2,metaQ but missed by any other methods in the UK Biobank data application. [Dataset]. http://doi.org/10.1371/journal.pgen.1011221.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 6, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Boxi Lin; Andrew D. Paterson; Lei Sun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A total of 12 novel testosterone-associated loci with leading bi-allelic SNPs uniquely identified by the recommended T2,metaQ but missed by any other methods in the UK Biobank data application.

  13. m

    Data from: Multi-Ancestry Whole Genome Sequencing Analysis of Lean Body Mass...

    • data.mendeley.com
    Updated Feb 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaoyu Zhang (2025). Multi-Ancestry Whole Genome Sequencing Analysis of Lean Body Mass [Dataset]. http://doi.org/10.17632/mk32tnmwwt.1
    Explore at:
    Dataset updated
    Feb 3, 2025
    Authors
    Xiaoyu Zhang
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Description

    Background Lean body mass is a crucial physiological component of body composition. Although lean body mass has a high heritability, studies evaluating the genetic determinants of lean mass (LM) have to date been limited largely to genome-wide association studies (GWAS) and common variants. Using whole genome sequencing (WGS)-based studies, we aimed to discover novel genetic variants associated with LM in population-based cohorts with multiple ancestries. Results We describe the largest WGS-based meta-analysis of lean body mass to date, encompassing 10,729 WGS samples from six TOPMed cohorts and the Louisiana Osteoporosis Study (LOS) cohort, measured with dual-energy X-ray absorptiometry. We identify seven genome-wide loci significantly associated with LM not reported by previous GWAS. We partially replicate these associations in UK Biobank samples. In rare variant analysis, we discover one novel protein-coding gene, DMAC1, associated with both whole-body LM and appendicular LM in females, and a long non-coding RNA gene linked to appendicular LM in males. Both genes exhibit notably high expression levels in skeletal muscle tissue. We investigate the functional roles of two novel lean-mass-related genes, EMP2 and SSUH2, in animal models. EMP2 deficiency in Drosophila leads to significantly reduced mobility without altering muscle tissue or body fat morphology, whereas an SSUH2 gene mutation in zebrafish stimulates muscle fiber growth. Conclusions Our comprehensive analysis, encompassing a large-scale WGS meta-analysis and functional investigations, reveals novel genomic loci and genes associated with lean mass traits, shedding new insights into pathways influencing muscle metabolism and muscle mass regulation.

  14. d

    Data from: Genetic and functional analysis of Raynaud’s syndrome implicates...

    • datadryad.org
    • data.niaid.nih.gov
    zip
    Updated Aug 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anniina Tervi; Markus Ramste; Erik Abner; Paul Cheng; Jacqueline M. Lane; Matthew Maher; Jesse Valliere; Vilma Lammi; Satu Strausz; Juha Riikonen; Trieu Nguyen; Gabriella E. Martyn; Maya U. Sheth; Fan Xia; Mauro Lago Docampo; Wenduo Gu; Tõnu Esko; Richa Saxena; Matti Pirinen; Aarno Palotie; Samuli Ripatti; Nasa Sinnott-Armstrong; Mark Daly; Jesse M. Engreitz; Marlene Rabinovitch; Caroline A. Heckman; Thomas Quertermous; Samuel E. Jones; Hanna M. Ollila (2024). Genetic and functional analysis of Raynaud’s syndrome implicates loci in vasculature and immunity [Dataset]. http://doi.org/10.5061/dryad.1g1jwsv53
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 21, 2024
    Dataset provided by
    Dryad
    Authors
    Anniina Tervi; Markus Ramste; Erik Abner; Paul Cheng; Jacqueline M. Lane; Matthew Maher; Jesse Valliere; Vilma Lammi; Satu Strausz; Juha Riikonen; Trieu Nguyen; Gabriella E. Martyn; Maya U. Sheth; Fan Xia; Mauro Lago Docampo; Wenduo Gu; Tõnu Esko; Richa Saxena; Matti Pirinen; Aarno Palotie; Samuli Ripatti; Nasa Sinnott-Armstrong; Mark Daly; Jesse M. Engreitz; Marlene Rabinovitch; Caroline A. Heckman; Thomas Quertermous; Samuel E. Jones; Hanna M. Ollila
    Description

    This dataset includes summary statistics from Raynaud's syndrome meta-analysis conducted from GWAS summary statistics of four independent population cohorts: The UK Biobank, FinnGen data freeze 10, The Estonian Biobank and The Mass-General Brigham Biobank. Details of the phenotype used and the GWAS performed in the individual cohorts can be found from the corresponding publication. The meta-analysis was conducted using METAL (https://genome.sph.umich.edu/wiki/METAL_Documentation) with the standard settings and the final summary statistics are displayed in GRCh38 (for further details, see the corresponding publication).

  15. f

    S2 File -

    • plos.figshare.com
    xlsx
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Changlong Yang; Zhenglin Chang; Youguo Dai; Jinzhao Mo; Qitai Zhang; Mingming Zhu; Likun Luan; Jinhu Zhang; Baoqing Sun; Junyi Jia (2024). S2 File - [Dataset]. http://doi.org/10.1371/journal.pone.0301811.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Changlong Yang; Zhenglin Chang; Youguo Dai; Jinzhao Mo; Qitai Zhang; Mingming Zhu; Likun Luan; Jinhu Zhang; Baoqing Sun; Junyi Jia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundRecent studies have demonstrated the relevance of circulating factors in the occurrence and development of colorectal cancer (CRC); however, the causal relationship remains unclear.MethodsSummary-level data for CRC were obtained from the UK Biobank (5,657 cases and 372,016 controls), FinnGen cohort (3,022 cases and 215,770 controls), and BioBank Japan Project (BBJ, 7,062 cases and 195,745 controls). Thirty-two peripheral markers with consistent definitions were collected from the three biobanks. Mendelian randomization (MR) was used to evaluate the causal effect of circulating factors on CRC. The effects from the three consortiums were combined using trans-ancestry meta-analysis methods.ResultsOur analysis provided compelling evidence for the causal association of higher genetically predicted eosinophil cell count (EOS, odds ratio [OR], 0.8639; 95% confidence interval [CI] 0.7922–0.9421) and red cell distribution width (RDW, OR, 0.9981; 95% CI, 0.9972–0.9989) levels with a decreased risk of CRC. Additionally, we found suggestive evidence indicating that higher levels of total cholesterol (TC, OR, 1.0022; 95% CI, 1.0002–1.0042) may increase the risk of CRC. Conversely, higher levels of platelet count (PLT, OR, 0.9984; 95% CI, 0.9972–0.9996), total protein (TP, OR, 0.9445; 95% CI, 0.9037–0.9872), and C-reactive protein (CRP, OR, 0.9991; 95% CI, 0.9983–0.9999) may confer a protective effect against CRC. Moreover, we identified six ancestry-specific causal factors, indicating the necessity of considering patients’ ancestry backgrounds before formulating prevention strategies.ConclusionsMR findings support the independent causal roles of circulating factors in CRC, which might provide a deeper insight into early detection of CRC and supply potential preventative strategies.

  16. d

    Data from: X-linked multi-ancestry meta-analysis reveals tuberculosis...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Jun 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haiko Schurz (2024). X-linked multi-ancestry meta-analysis reveals tuberculosis susceptibility variants [Dataset]. http://doi.org/10.5061/dryad.2z34tmpv5
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 5, 2024
    Dataset provided by
    Dryad
    Authors
    Haiko Schurz
    Description

    Globally, tuberculosis (TB) presents with a clear male bias that cannot be completely accounted for by environment, behaviour, socioeconomic factors, or the impact of sex hormones on the immune system. This suggests that genetic and biological differences, which may be mediated by the X chromosome, further influence the observed male sex bias. The X chromosome is heavily implicated in immune function and yet has largely been ignored in previous association studies. Here we report the first multi-ancestry X chromosome specific meta-analysis on TB susceptibility. We identified X-linked TB susceptibility variants using seven genotyping data sets and 20,255 individuals from diverse genetic ancestries. Sex-specific effects were also identified in polygenic heritability between males and females along with enhanced concordance in direction of genetic effects for males but not females. These sex-specific genetic effects were supported by a sex-stratified and combined meta-analysis conducted us...

  17. A total of 19 bi-allelic SNPs exhibiting genome-wide significant effects in...

    • plos.figshare.com
    xls
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boxi Lin; Andrew D. Paterson; Lei Sun (2024). A total of 19 bi-allelic SNPs exhibiting genome-wide significant effects in opposite directions between sexes, all associated with urate levels in the UK Biobank data application. [Dataset]. http://doi.org/10.1371/journal.pgen.1011221.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 6, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Boxi Lin; Andrew D. Paterson; Lei Sun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A total of 19 bi-allelic SNPs exhibiting genome-wide significant effects in opposite directions between sexes, all associated with urate levels in the UK Biobank data application.

  18. Meta-GWAS for age-related hearing impairement

    • zenodo.org
    application/gzip
    Updated May 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natalia Trpchevska; Natalia Trpchevska; Maxim B. Freidin; Maxim B. Freidin; Linda Broer; Linda Broer; Christopher Cederroth; Christopher Cederroth; Frances M.K. Williams; Frances M.K. Williams; Paul Nagtegaal; Paul Nagtegaal (2022). Meta-GWAS for age-related hearing impairement [Dataset]. http://doi.org/10.5281/zenodo.5769707
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 16, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Natalia Trpchevska; Natalia Trpchevska; Maxim B. Freidin; Maxim B. Freidin; Linda Broer; Linda Broer; Christopher Cederroth; Christopher Cederroth; Frances M.K. Williams; Frances M.K. Williams; Paul Nagtegaal; Paul Nagtegaal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset comprises summary statistics from the meta-GWAS of 17 studies on age-related hearing impairement. The dataset accompanies the following paper:

    Genome-wide association meta-analysis identifies 48 risk variants and highlights the role of the stria vascularis in age-related hearing impairment

    Please cite the paper if using this dataset.

    Phenotype of ARHI was established using ICD diagnoses and self-reported hearing loss. The study comprised 148,152 cases and 575,472 controls or European ancestry. Adult male and female participants were included from the following 17 population-based cohort studies: Age, Genes/Environment Susceptibility - Reykjavik (AGES), the Danish Twin Registry (DTR), the Estonian Genome Center at the University of Tartu (EGCUT), FinnGen, Framingham Heart Study (FHS), Health Aging and Body Composition (HABC), Italian Network of Genetic Isolates - Friuli Venezia Giulia (INGI-FVG), the Rotterdam Study (RS, cohorts 1 - 3), the Salus in Apulia study (SA; formerly known as Great Age study), Screening Across the Lifespan Twin (SALT and SALTY - young), Screening Twin Adults: Genes and Environment (STAGE), TwinsUK, UK Biobank (UKBB), and the Women’s Genome Health Study (WGHS).

    UK Biobank data have been used under project #11516.

    Individual GWASs have been QC'd and harmonyzed using EasyQC followed by fixed-effects IVW meta-analysis using METAL. The dataset includes the results of meta-analysis for n = 8,244,938 SNV with MAF >0.001 and present in at least 9 cohorts.

    Dataset columns:

    SNP, rsID

    CHR, chromosome

    BP, genomic position (hg19)

    Allele1, effect allele

    Allele2, other allele

    Freq1, mean frequency of Allele1

    FreqSE, standard error of Freq1

    MinFreq, minimal frequency of Allele1 in the study cohorts

    MaxFreq, maximal frequency of Allele1 in the study cohorts

    Effect, effect size from the meta-analysis for Allele1

    StdErr, standard error of Effect

    P.value, corresponding p-value for meta-analysis

    Direction, direction of effects in individual studies

    HetISq, I2 statistic for heterogeneity between studies

    HetChiSq, chi2 statistic for heterogeneity between studies

    HetDf, degrees of freedom for the chi2 statistic

    HetPval, p-value for heterogeneity between studies

    N, summary sample size

    Dataset columns description:

    Seventeen studies included:

  19. A Large Genome-Wide Association Study of Age-Related Hearing Impairment...

    • plos.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas J. Hoffmann; Bronya J. Keats; Noriko Yoshikawa; Catherine Schaefer; Neil Risch; Lawrence R. Lustig (2023). A Large Genome-Wide Association Study of Age-Related Hearing Impairment Using Electronic Health Records [Dataset]. http://doi.org/10.1371/journal.pgen.1006371
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Thomas J. Hoffmann; Bronya J. Keats; Noriko Yoshikawa; Catherine Schaefer; Neil Risch; Lawrence R. Lustig
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Age-related hearing impairment (ARHI), one of the most common sensory disorders, can be mitigated, but not cured or eliminated. To identify genetic influences underlying ARHI, we conducted a genome-wide association study of ARHI in 6,527 cases and 45,882 controls among the non-Hispanic whites from the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. We identified two novel genome-wide significant SNPs: rs4932196 (odds ratio = 1.185, p = 4.0x10-11), 52Kb 3’ of ISG20, which replicated in a meta-analysis of the other GERA race/ethnicity groups (1,025 cases, 12,388 controls, p = 0.00094) and in a UK Biobank case-control analysis (30,802 self-reported cases, 78,586 controls, p = 0.015); and rs58389158 (odds ratio = 1.132, p = 1.8x10-9), which replicated in the UK Biobank (p = 0.00021). The latter SNP lies just outside exon 8 and is highly correlated (r2 = 0.96) with the missense SNP rs5756795 in exon 7 of TRIOBP, a gene previously associated with prelingual nonsyndromic hearing loss. We further tested these SNPs in phenotypes from audiologist notes available on a subset of GERA (4,903 individuals), stratified by case/control status, to construct an independent replication test, and found a significant effect of rs58389158 on speech reception threshold (SRT; overall GERA meta-analysis p = 1.9x10-6). We also tested variants within exons of 132 other previously-identified hearing loss genes, and identified two common additional significant SNPs: rs2877561 (synonymous change in ILDR1, p = 6.2x10-5), which replicated in the UK Biobank (p = 0.00057), and had a significant GERA SRT (p = 0.00019) and speech discrimination score (SDS; p = 0.0019); and rs9493627 (missense change in EYA4, p = 0.00011) which replicated in the UK Biobank (p = 0.0095), other GERA groups (p = 0.0080), and had a consistent significant result for SRT (p = 0.041) and suggestive result for SDS (p = 0.081). Large cohorts with GWAS data and electronic health records may be a useful method to characterize the genetic architecture of ARHI.

  20. A list of association methods considered in this analysis.

    • plos.figshare.com
    xls
    Updated May 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boxi Lin; Andrew D. Paterson; Lei Sun (2024). A list of association methods considered in this analysis. [Dataset]. http://doi.org/10.1371/journal.pgen.1011221.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 6, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Boxi Lin; Andrew D. Paterson; Lei Sun
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A list of association methods considered in this analysis.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ken B. Hanscombe; Jonathan R. I. Coleman; Matthew Traylor; Cathryn M. Lewis (2023). ukbtools: An R package to manage and query UK Biobank data [Dataset]. http://doi.org/10.1371/journal.pone.0214311

ukbtools: An R package to manage and query UK Biobank data

Explore at:
16 scholarly articles cite this dataset (View in Google Scholar)
pdfAvailable download formats
Dataset updated
May 31, 2023
Dataset provided by
PLOS ONE
Authors
Ken B. Hanscombe; Jonathan R. I. Coleman; Matthew Traylor; Cathryn M. Lewis
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

IntroductionThe UK Biobank (UKB) is a resource that includes detailed health-related data on about 500,000 individuals and is available to the research community. However, several obstacles limit immediate analysis of the data: data files vary in format, may be very large, and have numerical codes for column names.Resultsukbtools removes all the upfront data wrangling required to get a single dataset for statistical analysis. All associated data files are merged into a single dataset with descriptive column names. The package also provides tools to assist in quality control by exploring the primary demographics of subsets of participants; query of disease diagnoses for one or more individuals, and estimating disease frequency relative to a reference variable; and to retrieve genetic metadata.ConclusionHaving a dataset with meaningful variable names, a set of UKB-specific exploratory data analysis tools, disease query functions, and a set of helper functions to explore and write genetic metadata to file, will rapidly enable UKB users to undertake their research.

Search
Clear search
Close search
Google apps
Main menu