19 datasets found
  1. n

    Sociability GWAS in a population-based sample : summary statistics of a...

    • narcis.nl
    • lifesciences.datastations.nl
    pdf
    Updated Mar 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bralten, J.B. (Radboud University); Roth Mota, N. (Radboud University); Klemann, C.J.H.M. (Radboud University); Witte, W. de (2021). Sociability GWAS in a population-based sample : summary statistics of a genome-wide association study of an aggregated sociability score in the UK Biobank [Dataset]. http://doi.org/10.17026/dans-ztj-zga6
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Mar 12, 2021
    Dataset provided by
    Data Archiving and Networked Services (DANS)
    Authors
    Bralten, J.B. (Radboud University); Roth Mota, N. (Radboud University); Klemann, C.J.H.M. (Radboud University); Witte, W. de
    Area covered
    northlimit=59.62358300012501; eastlimit=2.374666058072581; southlimit=49.568413008749225; westlimit=-8.205608345022652United Kingdom
    Description

    Levels of sociability are continuously distributed in the general population, and decreased sociability represents an early manifestation of several brain disorders. Here, we investigated the genetic underpinnings of sociability in the population.

    Main question of our research: 1. Are there common genetic variants that are associated with sociability in the general population? 2. Are genetic variants that are associated with sociability also associated with neuropsychiatric disorders?

    Type of data uploaded in this repository: The UK Biobank project (see https://www.ukbiobank.ac.uk/) is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants. The database is globally accessible to approved researchers undertaking vital research into the most common and life-threatening diseases. The raw data that this project is based on comes from the publically available UK Biobank set, which is very large and is therefore not provided here. Here we only provide the results from our analysis, that is also described here: https://www.biorxiv.org/content/10.1101/781195v2 and currently in revision in a scientific journal. In the dataset you will find the association of 9327396 genetic variants with the phenotype sociability. This dataset is not applicable to be opened with Excel, and can best be opened on a cluster computer or using specfic software.

    Subjects The UK Biobank (UKBB) is a major population-based cohort from the United Kingdom that includes individuals aged between 37 and 73 years. We constructed a sociability measure based on the the aggregation of scores per participant on four questions from the UKBB database that link to sociability, including (1) a question about the frequency of friend/family visits, (2) a question on the number and type of social venues that are visited, (3) a question about worrying after social embarrassment and (4) a question about feeling lonely, leading to a sociability score ranging from 0-4. Participants were excluded if they had somatic problems that could be related to social withdrawal (BMI < 15 or BMI > 40, narcolepsy (all the time), stroke, severe tinnitus, deafness or brain-related cancers) or if they answered that they had “No friends/family outside household” or “Do not know” or “Prefer not to answer” to any of the questions.

    SNP genotyping and quality control Details about the available genome-wide genotyping data for UKBB participants have been reported previously (PMID: 30305743). We used third-release genotyping data (see https://biobank.ctsu.ox.ac.uk/crystal/label.cgi?id=100319). Briefly, 49,950 participants were genotyped using the UK BiLEVE Axiom Array and 438,427 participants were genotyped using UK Biobank Axiom Array. Genotypes were imputed into the dataset using the Haplotype Reference Consortium (HRC), and the UK10K haplotype resource. To account for ethnicity, we included only those individuals that identified themselves as "white" by self-report and plotted the Principal Components (PC) provided by the UKBB, excluding individuals considered to be outliers according to PCs 1 and 2. Genetic relatedness calculated with KING kinship and provided by the UKBB (https://kenhanscombe.github.io/ukbtools/articles/explore-ukb-data.html ; http://www.ukbiobank.ac.uk/wp-content/uploads/2014/04/UKBiobank_genotyping_QC_documentation-web.pdf) was used to identify first and second-degree relatives. Subsequently ´families´ (i.e. clusters of related individuals above an IBD>0.125 threshold) were created and only one individual from each of these created ‘families’ was included in the analysis. If self-reported sex and SNP-based sex differed, individuals were excluded from further analysis. Single nucleotide polymorphisms (SNPs) with minor allele frequency <0.005, Hardy-Weinberg equilibrium test P value<1e−6, missing genotype rate >0.05, and imputation quality of INFO <0.8 were excluded. In the current study, all analyses are based on 342,461 participants of European ancestry for which both genotype data and sociability scores were available.

    Genome-wide association analysis Genome-wide association analysis with the imputed marker dosages was performed in PLINK1.9, using a linear regression model with the sociability measure as the dependent variable and including sex, age, 10 first PCs, assessment center, and genotype batch as covariates. SNPs were considered significantly associated if they had p-value < 5e-8. Associated loci were considered independent of each other at r2 0.6 and lead SNPs were classified as the SNP with the smallest association p-value and at r2 0.1, using a 250kb window. The summary statistics come from the plink2 linear regression analysis.

  2. h

    UK BiLEVE Consortium Dataset

    • web.prod.hdruk.cloud
    unknown
    Updated Oct 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UK BiLEVE Consortium. Please cite Wain et al 2015, doi: 10.1016/S2213-2600(15)00283-0 (2024). UK BiLEVE Consortium Dataset [Dataset]. https://web.prod.hdruk.cloud/dataset/281
    Explore at:
    unknownAvailable download formats
    Dataset updated
    Oct 8, 2024
    Dataset authored and provided by
    UK BiLEVE Consortium. Please cite Wain et al 2015, doi: 10.1016/S2213-2600(15)00283-0
    License

    https://www.ebi.ac.uk/gwas/publications/30804560https://www.ebi.ac.uk/gwas/publications/30804560

    Description

    https://www.nature.com/articles/s41588-018-0321-7

    Lung function is an important indicator of respiratory health and mortality. Measures of lung function show irreversible airway obstruction in chronic obstructive pulmonary disease (COPD), a progressive condition affecting 900,000 people in the UK. Smoking is a strong risk factor for COPD but not all smokers are equally susceptible. Genetic approaches to understanding the mechanisms underlying the maintenance of good lungs aim to reveal previously unknown molecular targets for drug development and to facilitate stratified approaches to treatment and care. This project aims to detect rare genetic variants associated with lung function. Once discovered, such variants tend to exert a large effect on disease risk and provide a means to translate findings from genetic studies of lung function to clinical relevant research and development. The proposed study leverages the power of Uk Biobank and respiratory genomics to advance understanding of lung function and COPD.

  3. Data from: Brain Ages Derived from Different MRI Modalities are Associated...

    • zenodo.org
    csv
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrei-Claudiu Roibu; Andrei-Claudiu Roibu; Stanislaw Adaszewski; Torsten Schindler; Stephen M. Smith; Stephen M. Smith; Ana I.L. Namburete; Ana I.L. Namburete; Frederik J. Lange; Frederik J. Lange; Stanislaw Adaszewski; Torsten Schindler (2025). Brain Ages Derived from Different MRI Modalities are Associated with Distinct Biological Phenotypes [Dataset]. http://doi.org/10.5281/zenodo.8110876
    Explore at:
    csvAvailable download formats
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Andrei-Claudiu Roibu; Andrei-Claudiu Roibu; Stanislaw Adaszewski; Torsten Schindler; Stephen M. Smith; Stephen M. Smith; Ana I.L. Namburete; Ana I.L. Namburete; Frederik J. Lange; Frederik J. Lange; Stanislaw Adaszewski; Torsten Schindler
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    Brain ageing is a highly variable, spatially and temporally heterogeneous process, marked by numerous structural and functional changes. These can cause discrepancies between individuals’ chronological age and the apparent age of their brain, as inferred from neuroimaging data. Machine learning models, and particularly Convolutional Neural Networks (CNNs), have proven adept in capturing patterns relating to ageing induced changes in the brain. The differences between the predicted and chronological ages, referred to as brain age deltas, have emerged as useful biomarkers for exploring those factors which promote accelerated ageing or resilience, such as pathologies or lifestyle factors. However, previous studies rely only on structural neuroimaging for predictions, overlooking potentially informative functional and microstructural changes. Here we show that multiple contrasts derived from different MRI modalities can predict brain age, each encoding bespoke brain ageing information. By using 3D CNNs and UK Biobank data, we found that 57 contrasts derived from structural, susceptibility-weighted, diffusion, and functional MRI can successfully predict brain age. For each contrast, different patterns of association with non-imaging phenotypes were found, resulting in a total of 191 unique, statistically significant associations. Furthermore, we found that ensembling data from multiple contrasts results in both higher prediction accuracies and stronger correlations to non-imaging measurements. Our results demonstrate that other 3D contrasts and modalities, which have not been considered so far for the task of brain age prediction, encode different information about the ageing brain. We envision our work as being the starting point for future investigations into the causal links underpinning the observed brain age deltas and non-imaging measurement associations. For instance, drug effects can be monitored, given that certain medications correlated with accelerated brain ageing. Furthermore, continued development of brain age models could facilitate their deployment in clinical trials for recruitment and monitoring, and hospitals for diagnostic and screening tasks.

    Data Description

    This dataset contains the full correlation results with all nIDPs in the UK Biobank. These are presented in datasets split by sex in Female and Male subjects. For easier data manipulation, two smaller datasets have also been made available, containing just those correlation which pass the False Discovery Rate (FDR) threshold.

    As experiments were also conducted for ensembles using multiple contrasts, similar datasets are provided for those.

    Finally, global datasets are also provided. These are the concatenation of the associations contained in the Male and Female datasets.

    Paper & Code

    The original paper for this article can be accessed here:

    To access the codes relevant for this project, please access the project GitHub Repos:

    If using this work, please cite it based on the above paper, or using the following BibTex:

    @inproceedings{roibu2023brain,
     title={Brain Ages Derived from Different MRI Modalities are Associated with Distinct Biological Phenotypes},
     author={Roibu, Andrei-Claudiu and Adaszewski, Stanislaw and Schindler, Torsten and Smith, Stephen M and Namburete, Ana IL and Lange, Frederik J},
     booktitle={2023 10th IEEE Swiss Conference on Data Science (SDS)},
     pages={17--25},
     year={2023},
     organization={IEEE},
     doi={10.1109/SDS57534.2023.00010}
    }

    Data Access

    The data for this project is freely available upon application at the UK Biobank. For more information regarding the individual nIDPs, please access the UK Biobank Showcase website at: https://biobank.ctsu.ox.ac.uk/showcase/search.cgi

    Funding

    ACR is supported by EPSRC Grant EP/S024093/1, F. Hoffmann-La Roche AG and a 2021 Industrial Fellowship offered by the Royal Commission for the Exhibition of 1851. SMS is supported by a Wellcome Trust Collaborative Award 215573/Z/19/Z. AILN is grateful for support from the Academy of Medical Sciences under the Springboard Awards scheme (SBF005/1136), and the Bill and Melinda Gates Foundation. FJL is supported by a Wellcome Trust Collaborative Award (215573/Z/19/Z). The WIN is supported by core funding from the Wellcome Trust (203139/Z/16/Z). The computational aspects were supported by the Wellcome Trust (203141/Z/16/Z) and the NIHR Oxford BRC. Corresponding authors: ACR (andreiroibu@icloud.com), SA (stanislaw.adaszewski@roche.com) and AILN (ana.namburete@cs.ox.ac.uk).

  4. Association between coding load scores restricted to sites where the human...

    • plos.figshare.com
    xlsx
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do (2023). Association between coding load scores restricted to sites where the human genome reference allele is the ancestral allele and 27 phenotypes. [Dataset]. http://doi.org/10.1371/journal.pgen.1009337.s011
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This table shows the associations between the 27 phenotypes (Table 1) and coding load score computed from sites where the human genome reference allele is same as the annotated ancestral allele. (XLSX)

  5. E

    'Hill_CB_2016' - Data supporting paper 'Molecular genetic contributions to...

    • find.data.gov.scot
    • dtechtive.com
    txt
    Updated Jun 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh. Centre for Cognitive Ageing and Cognitive Epidemiology (2019). 'Hill_CB_2016' - Data supporting paper 'Molecular genetic contributions to social deprivation and household income in UK Biobank'. Current Biology (2016). [Dataset]. http://doi.org/10.7488/ds/2562
    Explore at:
    txt(792.3 MB), txt(0.0166 MB), txt(792.7 MB), txt(0.0005 MB), txt(793.1 MB)Available download formats
    Dataset updated
    Jun 4, 2019
    Dataset provided by
    University of Edinburgh. Centre for Cognitive Ageing and Cognitive Epidemiology
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data supporting paper 'Molecular genetic contributions to social deprivation and household income in UK Biobank'. Current Biology (2016). doi: 10.1016/j.cub.2016.09.035 ## Note re working with data ## Each of the three data files contains over seventeen million rows. Users will encounter difficulties if they attempt to view the content using Notepad++ or Microsoft Notepad. Microsoft Excel 2016 will not display all rows. These space-delimited text files contains seven columns, with a header row, which are listed in the readme file. ## Note re other copy ## The data files are identical to the files of the same name previously made available on the website of the Centre for Cognitive Ageing and Cognitive Epidemiology (CCACE) http://www.ccace.ed.ac.uk/node/335 as the zip archive 'Hill_CB_2016.zip'.

  6. DataSheet1_Associations of serum cystatin C concentrations with total...

    • frontiersin.figshare.com
    docx
    Updated Apr 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Changzhi Huang; Jiayi Lu; Jing Yang; Zhenling Wang; Dong Hang; Zan Fu (2024). DataSheet1_Associations of serum cystatin C concentrations with total mortality and mortality of 12 site-specific cancers.docx [Dataset]. http://doi.org/10.3389/fmolb.2024.1209349.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Apr 25, 2024
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Changzhi Huang; Jiayi Lu; Jing Yang; Zhenling Wang; Dong Hang; Zan Fu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Purpose:Cystatin C (CysC), beyond its biomarker role of renal function, has been implicated in various physical and pathological activities. However, the impact of serum CysC on cancer mortality in a general population remains unknown. We aimed to examine the associations of serum CysC concentrations with total mortality and mortality of 12 site-specific cancers.Methods:We included 241,008 participants of the UK Biobank cohort with CysC measurements who had normal creatinine-based estimated glomerular filtration rates and were free of cancer and renal diseases at baseline (2006–2010). Death information was obtained from the National Health Service death records through 28 February 2021. Multivariable Cox proportional hazards models were used to compute hazard ratios (HR) per one standard deviation increase in log-transformed CysC concentrations and 95% confidence intervals (95% CI) for mortality.Results:Over a median follow-up of 12.1 (interquartile range, 11.3–12.8) years, 5,744 cancer deaths occurred. We observed a positive association between serum CysC concentrations and total cancer mortality (HR = 1.16, 95% CI: 1.12–1.20). Specifically, participants with higher serum CysC concentrations had increased mortality due to lung cancer (HR = 1.12, 95% CI: 1.05–1.20), blood cancer (HR = 1.29, 95% CI: 1.16–1.44), brain cancer (HR = 1.19, 95% CI: 1.04–1.36), esophageal cancer (HR = 1.20, 95% CI: 1.05–1.37), breast cancer (HR = 1.18, 95% CI: 1.03–1.36), and liver cancer (HR = 1.49, 95% CI: 1.31–1.69).Conclusion:Our findings indicate that higher CysC concentrations are associated with increased mortality due to lung, blood, brain, esophageal, breast, and liver cancers. Future studies are necessary to clarify underlying mechanisms.

  7. Varicose veins GWAS

    • kaggle.com
    zip
    Updated Jul 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MariusC (2023). Varicose veins GWAS [Dataset]. https://www.kaggle.com/datasets/mariusc/varicose-veins-gwas
    Explore at:
    zip(238177722 bytes)Available download formats
    Dataset updated
    Jul 5, 2023
    Authors
    MariusC
    Description

    The Dataset is a summary statistics for discovery GWAS of 22,473 varicose veins cases and 379,183 non-varicose veins controls in UK Biobank (Ahmed W. et al., Nature Communications, 2022). Since it's under OAI, this dataset can be freely used as long it respects the ORA terms of use:

    The University of Oxford ('the University'), through the Bodleian Libraries, has developed Oxford University Research Archive ('ORA') so that users may access research outputs of the University.

    Copyright and other rights in the items held in ORA are retained by the individual authors, the University, or other third parties. The extent of the rights reserved and the identity of the parties retaining those rights will vary on an item-by-item basis.

    Users are required to comply with the permissions notice that applies to each item in ORA. Unless otherwise indicated in the relevant permissions notice on the item record, users may download and/or print one copy of any item in ORA to facilitate their private study or research for non-commercial purposes. Whilst items held in ORA may be subject to multiple intellectual property rights, depositors only licence the copyright through ORA, and no licence under any patents, design rights, trade marks or any other right that may exist in relation to such items is granted or implied. Some items in ORA are distributed under Creative Commons or other licences which may allow for further commercial or non-commercial use. Subject to such licences, users may not use material obtained from ORA for any profit-making activities or any commercial gain.

    You may freely distribute the URL (https://ora.ox.ac.uk) of the ORA website but if you wish to link to any item in ORA you must do so through the item record page.

    ORA supports and participates in the Open Archives Initiative (OAI). You can find details of the ORA API at https://ora.ox.ac.uk/api.

    These terms are governed by English law, and the Courts of England have exclusive jurisdiction in relation to them.

  8. Associations of genetic predisposition to smoking initiation with...

    • plos.figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Susanna C. Larsson; Paul Carter; Siddhartha Kar; Mathew Vithayathil; Amy M. Mason; Karl Michaëlsson; Stephen Burgess (2023). Associations of genetic predisposition to smoking initiation with site-specific cancers in the primary inverse-variance weighted analysis and in sensitivity analyses using other MR methods. [Dataset]. http://doi.org/10.1371/journal.pmed.1003178.s004
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Susanna C. Larsson; Paul Carter; Siddhartha Kar; Mathew Vithayathil; Amy M. Mason; Karl Michaëlsson; Stephen Burgess
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MR, mendelian randomisation. (XLSX)

  9. Derived allele frequency stratification analysis.

    • figshare.com
    xlsx
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do (2023). Derived allele frequency stratification analysis. [Dataset]. http://doi.org/10.1371/journal.pgen.1009337.s007
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Associations between the 27 phenotypes (Table 1) and four load scores computed for four different groups of coding variants stratified by derived allele frequency: 76,185 variants with DAF∈(0,0.05); 10,552 variants with DAF∈[0.05,0.3); 5,530 variants with DAF∈[0.3,0.7); and 3,587 variants with DAF∈[0.7,1). (XLSX)

  10. phyloP score stratification analysis.

    • plos.figshare.com
    xlsx
    Updated Jun 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do (2023). phyloP score stratification analysis. [Dataset]. http://doi.org/10.1371/journal.pgen.1009337.s008
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ha My T. Vy; Daniel M. Jordan; Daniel J. Balick; Ron Do
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Associations between the 27 phenotypes (Table 1) and five load scores computed for five different groups of coding variants stratified by phyloP score: 54,840 variants with phyloP∈(0,2]; 19,414 variants with phyloP∈(2,4]; 10,865 variants with phyloP∈(4,6]; 7,582 variants with phyloP∈(6,8]; and 3,315 variants with phyloP∈(8,10]. (XLSX)

  11. f

    ‘Alarm’ symptom-cancer pairs in the UK Biobank and CPRD comparison cohorts.

    • plos.figshare.com
    xls
    Updated Dec 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Barclay; Cristina Renzi; Antonis Antoniou; Spiros Denaxas; Hannah Harrison; Samantha Ip; Nora Pashayan; Ana Torralbo; Juliet Usher-Smith; Angela Wood; Georgios Lyratzopoulos (2023). ‘Alarm’ symptom-cancer pairs in the UK Biobank and CPRD comparison cohorts. [Dataset]. http://doi.org/10.1371/journal.pdig.0000383.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 15, 2023
    Dataset provided by
    PLOS Digital Health
    Authors
    Matthew Barclay; Cristina Renzi; Antonis Antoniou; Spiros Denaxas; Hannah Harrison; Samantha Ip; Nora Pashayan; Ana Torralbo; Juliet Usher-Smith; Angela Wood; Georgios Lyratzopoulos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ‘Alarm’ symptom-cancer pairs in the UK Biobank and CPRD comparison cohorts.

  12. Baseline characteristics of the UKBB participants included in this study.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mathew Vithayathil; Paul Carter; Siddhartha Kar; Amy M. Mason; Stephen Burgess; Susanna C. Larsson (2023). Baseline characteristics of the UKBB participants included in this study. [Dataset]. http://doi.org/10.1371/journal.pmed.1003706.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Mathew Vithayathil; Paul Carter; Siddhartha Kar; Amy M. Mason; Stephen Burgess; Susanna C. Larsson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Baseline characteristics of the UKBB participants included in this study.

  13. Multimodal MRI protocols for COVID-related neuroimaging with Siemens and GE...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eugene Duff; Fernando Zelaya; Fidel Alfaro Almagro; Karla L. Miller; Naomi Martin; Thomas E. Nichols; Bernd Taschler; Ludovica Griffanti; Christoph Arthofer; Gwenaëlle Douaud; Chaoyue Wang; Thomas W. Okell; Richard A. I. Bethlehem; Klaus Eickel; Matthias Günther; David K. Menon; Guy Williams; Bethany Facer; David J. Lythgoe; Flavio Dell’Acqua; Greta K. Wood; Steven C. R. Williams; Gavin Houston; Simon S. Keller; Catherine Holden; Monika Hartmann; Lily George; Gerome Breen; Benedict D. Michael; Peter Jezzard; Stephen M. Smith; Edward T. Bullmore (2023). Multimodal MRI protocols for COVID-related neuroimaging with Siemens and GE 3T scanners. [Dataset]. http://doi.org/10.1371/journal.pone.0273704.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Eugene Duff; Fernando Zelaya; Fidel Alfaro Almagro; Karla L. Miller; Naomi Martin; Thomas E. Nichols; Bernd Taschler; Ludovica Griffanti; Christoph Arthofer; Gwenaëlle Douaud; Chaoyue Wang; Thomas W. Okell; Richard A. I. Bethlehem; Klaus Eickel; Matthias Günther; David K. Menon; Guy Williams; Bethany Facer; David J. Lythgoe; Flavio Dell’Acqua; Greta K. Wood; Steven C. R. Williams; Gavin Houston; Simon S. Keller; Catherine Holden; Monika Hartmann; Lily George; Gerome Breen; Benedict D. Michael; Peter Jezzard; Stephen M. Smith; Edward T. Bullmore
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Multimodal MRI protocols for COVID-related neuroimaging with Siemens and GE 3T scanners.

  14. Expanded table of outlier SNPs.

    • figshare.com
    xlsx
    Updated Oct 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuval B. Simons; Hakhamanesh Mostafavi; Huisheng Zhu; Courtney J. Smith; Jonathan K. Pritchard; Guy Sella (2025). Expanded table of outlier SNPs. [Dataset]. http://doi.org/10.1371/journal.pbio.3003402.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 13, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yuval B. Simons; Hakhamanesh Mostafavi; Huisheng Zhu; Courtney J. Smith; Jonathan K. Pritchard; Guy Sella
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Genome-wide association studies have revealed that the genetic architectures of complex traits vary widely, including in terms of the numbers, effect sizes, and allele frequencies of significant hits. However, at present we lack a principled way of understanding the similarities and differences among traits. Here, we describe a probabilistic model that combines the effects of mutation, drift, and stabilizing selection at individual sites with a genome-scale model of phenotypic variation. In this model, the architecture of a trait arises from the distribution of selection coefficients of mutations and from two scaling parameters. We fit this model for 95 highly polygenic quantitative traits of different kinds from the UK Biobank. Notably, we infer that all these traits have fairly similar, though not identical, distributions of selection coefficients. This similarity suggests that differences in architectures of highly polygenic traits arise mainly from the two scaling parameters: the mutational target size and heritability per site, which vary by orders of magnitude among traits. When these two scale factors are accounted for, we find that the architectures of all 95 traits are very similar.

  15. f

    Details of the different outcome measures considered.

    • plos.figshare.com
    xls
    Updated Dec 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew Barclay; Cristina Renzi; Antonis Antoniou; Spiros Denaxas; Hannah Harrison; Samantha Ip; Nora Pashayan; Ana Torralbo; Juliet Usher-Smith; Angela Wood; Georgios Lyratzopoulos (2023). Details of the different outcome measures considered. [Dataset]. http://doi.org/10.1371/journal.pdig.0000383.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 15, 2023
    Dataset provided by
    PLOS Digital Health
    Authors
    Matthew Barclay; Cristina Renzi; Antonis Antoniou; Spiros Denaxas; Hannah Harrison; Samantha Ip; Nora Pashayan; Ana Torralbo; Juliet Usher-Smith; Angela Wood; Georgios Lyratzopoulos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Details of the different outcome measures considered.

  16. DataSheet_1_Causal associations between site-specific cancer and diabetes...

    • frontiersin.figshare.com
    docx
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rong Xu; Tingjin Zheng; Chaoqun Ouyang; Xiaoming Ding; Chenjin Ge (2023). DataSheet_1_Causal associations between site-specific cancer and diabetes risk: A two-sample Mendelian randomization study.docx [Dataset]. http://doi.org/10.3389/fendo.2023.1110523.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Rong Xu; Tingjin Zheng; Chaoqun Ouyang; Xiaoming Ding; Chenjin Ge
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundBoth cancer and diabetes are complex chronic diseases that have high economic costs for society. The co-occurrence of these two diseases in people is already well known. The causal effects of diabetes on the development of several malignancies have been established, but the reverse causation of these two diseases (e.g., what type of cancer can cause T2D) has been less investigated.MethodsMultiple Mendelian randomization (MR) methods, such as the inverse-variance weighted (IVW) method, weighted median method, MR-Egger, and MR pleiotropy residual sum and outlier test, were performed to evaluate the causal association of overall and eight site-specific cancers with diabetes risk using genome-wide association study summary data from different consortia, such as Finngen and UK biobank.ResultsA suggestive level of evidence was observed for the causal association between lymphoid leukaemia and diabetes by using the IVW method in MR analyses (P = 0.033), indicating that lymphoid leukaemia increased diabetes risk with an odds ratio of 1.008 (95% confidence interval, 1.001-1.014). Sensitivity analyses using MR-Egger and weighted median methods showed consistent direction of the association compared with the IVW method. Overall and seven other site-specific cancers under investigation (i.e., multiple myeloma, non-Hodgkin lymphoma, and cancer of bladder, brain, stomach, lung, and pancreas) were not causally associated with diabetes risk.ConclusionsThe causal relationship between lymphoid leukaemia and diabetes risk points to the necessity of diabetes prevention amongst leukaemia survivors as a strategy for ameliorating the associated disease burden.

  17. S1 File -

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Oct 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qinyao Huang; Jianglong Guo; Hongjun Zhao; Yi Zheng; Yuying Zhang (2023). S1 File - [Dataset]. http://doi.org/10.1371/journal.pone.0292881.s015
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 19, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Qinyao Huang; Jianglong Guo; Hongjun Zhao; Yi Zheng; Yuying Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundReduced bone mineral density (BMD) and osteoporosis are common in chronic liver diseases. However, the causal effect of alcoholic liver disease (ALD) and non-alcoholic fatty liver disease (NAFLD) on BMD remains uncertain.ObjectivesThis study uses a two-sample Mendelian randomization (MR) design to evaluate the genetically predicted effect of ALD and NAFLD on BMDs using summary data from publically available genome-wide association studies (GWASs).MethodsThe GWAS summary statistics of ALD (1416 cases and 213,592 controls) and NAFLD (894 cases and 217,898 controls) were obtained from the FinnGen consortium. BMDs of four sites (total body, n = 56,284; femoral neck, n = 32,735; lumbar spine, n = 28,498; forearm, n = 8143) were from the GEnetic Factors for OSteoporosis Consortium. Data for alcohol consumption (n = 112,117) and smoking (n = 33,299) and serum 25-Hydroxyvitamin D (25-OHD) level (n = 417,580) were from UK-biobank. We first performed univariate MR analysis with the Inverse Variance Weighted (IVW) method as the primary analysis to investigate the genetically predicted effect of ALD or NAFLD on BMD. Then, multivariate MR and mediation analysis were performed to identify whether the effect was mediated by alcohol consumption, smoking, or serum 25-OHD level.ResultsThe MR results suggested a robust genetically predicted effect of ALD on reduced BMD in the femoral neck (FN-BMD) (IVW beta = -0.0288; 95% CI: -0.0488, -0.00871; P = 0.00494) but not the other three sites. Serum 25-OHD level exhibited a significant mediating effect on the association between ALD and reduced FN-BMD albeit the proportion of mediation was mild (2.21%). No significant effects of NAFLD, alcohol consumption, or smoking on BMD in four sites, or reverse effect of BMD on ALD or NAFLD were detected.ConclusionOur findings confirm the genetically predicted effect of ALD on reduced FN-BMD, and highlight the importance of periodic BMD and serum 25-OHD monitoring and vitamin D supplementation as needed in patients with ALD. Future research is required to validate our results and investigate the probable underlying mechanisms.

  18. f

    DataSheet_1_Fasting Insulin and Risk of Overall and 14 Site-Specific...

    • figshare.com
    docx
    Updated May 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Han Zhang; Doudou Li; Xiaozhuan Liu; Zhongxiao Wan; Zengli Yu; Yuming Wang; Xue Li (2023). DataSheet_1_Fasting Insulin and Risk of Overall and 14 Site-Specific Cancers: Evidence From Genetic Data.docx [Dataset]. http://doi.org/10.3389/fonc.2022.863340.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Frontiers
    Authors
    Han Zhang; Doudou Li; Xiaozhuan Liu; Zhongxiao Wan; Zengli Yu; Yuming Wang; Xue Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ObjectiveWhether fasting insulin (FI) plays a role in cancer risk remains unclear. This study aimed to investigate the association between FI and cancer risk and to explore its potential mediator role in the association between type 2 diabetes mellitus (T2DM) and cancer.MethodsTwo-sample Mendelian randomization (TSMR) analysis was performed to evaluate the effect of FI on overall and 14 site-specific cancers using genome-wide association study (GWAS) summary-level data from Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC) and consortia of 14 site-specific cancers. The primary MR approach was conducted by using the random-effect inverse-variance weighted (IVW) method, and sensitivity analyses were implemented by adopting weighted-median, weighted-mode, MR-Egger, and MR-PRESSO tests. Polygenic risk score analysis was executed by using individual-level data from UK Biobank to validate the findings from TSMR analyses. Multivariable Mendelian randomization (MVMR) was carried out to estimate the mediation effect of FI on the association between T2DM and cancer.ResultsTSMR study suggested that genetically determined high FI levels were associated with increased risk of colorectal cancer (odds ratio (OR) = 1.87, 95% CI: 1.23–2.84, p = 0.003) and endometrial cancer (OR = 1.89, 95% CI: 1.08–3.01, p = 0.008), but not associated with overall cancer risk or the other 12 studied cancer sites. Polygenic risk score analysis successfully replicated the association between genetic liability to high FI levels and the increased risk of colorectal and endometrial cancers. MVMR and MR mediation analyses detected an intermediary effect of FI and quantified that FI mediated 21.3% of the association between T2DM and endometrial cancer.ConclusionsThis study demonstrated that FI levels are associated with the risk of colorectal and endometrial cancers, and FI was found to play an intermediary role in the association between T2DM and endometrial cancer. The associations between FI and other cancers need to be further studied.

  19. STROBE-MR checklist.

    • plos.figshare.com
    xlsx
    Updated Sep 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu Wang; Haiyue Zhang; Yuanyuan Zhan; Zhuoran Li; Sujing Li; Yingchao Zhang; Shubin Guo (2025). STROBE-MR checklist. [Dataset]. http://doi.org/10.1371/journal.pone.0331023.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Sep 11, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Yu Wang; Haiyue Zhang; Yuanyuan Zhan; Zhuoran Li; Sujing Li; Yingchao Zhang; Shubin Guo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundSepsis is a severe systemic infection that can result in organ dysfunction and mortality. Dyslipidemia emerges as a key player in the intricate web of sepsis pathogenesis. Yet, the causal relationship between blood lipid profiles and sepsis risk remains uncertain. This study aims to investigate the association between genetically predicted lipid traits, drug targets, and sepsis.MethodsThe UK Biobank’s Genome-wide association studies (GWAS) produced data on lipid and apolipoprotein characteristics. Four independent GWAS datasets were used to generate the sepsis statistics. The study utilized the two-sample Mendelian randomization (MR) approach, which incorporates multivariable (MVMR) models, to assess the correlations between sepsis risk and lipid-related parameters. To gain further insight, expression quantitative trait loci (eQTL) data were used to investigate the significant drug targets for lipid-lowering.ResultsIncreasing ApoA-1 levels was associated with a diminished risk of sepsis (under 75) (OR 0.927, 95% CI 0.861–0.999; p = 0.047). This inverse correlation persevered even after performing multivariable MR. Elevated levels of HDL-C were associated with a decreased risk of sepsis (under 75) (OR 0.897, 95% CI 0.838–0.960; P = 0.002) and incidence of sepsis (OR 0.883, 95% CI 0.820–0.951; P = 0.001), which was consistent across sensitivity analyses. Furthermore, a decrease in total cholesterol exhibited a causal effect on sepsis in multivariable MR (OR 0.779, 95% CI 0.642–0.944; P = 0.01). The genetic variants related to lowering LDL-C, located near the HMGCR and LDLR genes, were predicted to elevate the risk of sepsis. Moreover, genetic mimicry near the ANGPTL3 and LPL gene suggested that reducing the activity of ANGPTL3 and LPL (mimicking antisense anti-ANGPTL3 and LPL agents) was forecasted to decrease sepsis risk.ConclusionGenetically inferred elevated ApoA-1, total cholesterol, and HDL-C manifest a protective effect against sepsis. Within the 9 lipid-lowering drug targets investigated ANGPTL3 and LPL exhibit potential as candidate drug targets for sepsis.

  20. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Bralten, J.B. (Radboud University); Roth Mota, N. (Radboud University); Klemann, C.J.H.M. (Radboud University); Witte, W. de (2021). Sociability GWAS in a population-based sample : summary statistics of a genome-wide association study of an aggregated sociability score in the UK Biobank [Dataset]. http://doi.org/10.17026/dans-ztj-zga6

Sociability GWAS in a population-based sample : summary statistics of a genome-wide association study of an aggregated sociability score in the UK Biobank

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
pdfAvailable download formats
Dataset updated
Mar 12, 2021
Dataset provided by
Data Archiving and Networked Services (DANS)
Authors
Bralten, J.B. (Radboud University); Roth Mota, N. (Radboud University); Klemann, C.J.H.M. (Radboud University); Witte, W. de
Area covered
northlimit=59.62358300012501; eastlimit=2.374666058072581; southlimit=49.568413008749225; westlimit=-8.205608345022652United Kingdom
Description

Levels of sociability are continuously distributed in the general population, and decreased sociability represents an early manifestation of several brain disorders. Here, we investigated the genetic underpinnings of sociability in the population.

Main question of our research: 1. Are there common genetic variants that are associated with sociability in the general population? 2. Are genetic variants that are associated with sociability also associated with neuropsychiatric disorders?

Type of data uploaded in this repository: The UK Biobank project (see https://www.ukbiobank.ac.uk/) is a large-scale biomedical database and research resource, containing in-depth genetic and health information from half a million UK participants. The database is globally accessible to approved researchers undertaking vital research into the most common and life-threatening diseases. The raw data that this project is based on comes from the publically available UK Biobank set, which is very large and is therefore not provided here. Here we only provide the results from our analysis, that is also described here: https://www.biorxiv.org/content/10.1101/781195v2 and currently in revision in a scientific journal. In the dataset you will find the association of 9327396 genetic variants with the phenotype sociability. This dataset is not applicable to be opened with Excel, and can best be opened on a cluster computer or using specfic software.

Subjects The UK Biobank (UKBB) is a major population-based cohort from the United Kingdom that includes individuals aged between 37 and 73 years. We constructed a sociability measure based on the the aggregation of scores per participant on four questions from the UKBB database that link to sociability, including (1) a question about the frequency of friend/family visits, (2) a question on the number and type of social venues that are visited, (3) a question about worrying after social embarrassment and (4) a question about feeling lonely, leading to a sociability score ranging from 0-4. Participants were excluded if they had somatic problems that could be related to social withdrawal (BMI < 15 or BMI > 40, narcolepsy (all the time), stroke, severe tinnitus, deafness or brain-related cancers) or if they answered that they had “No friends/family outside household” or “Do not know” or “Prefer not to answer” to any of the questions.

SNP genotyping and quality control Details about the available genome-wide genotyping data for UKBB participants have been reported previously (PMID: 30305743). We used third-release genotyping data (see https://biobank.ctsu.ox.ac.uk/crystal/label.cgi?id=100319). Briefly, 49,950 participants were genotyped using the UK BiLEVE Axiom Array and 438,427 participants were genotyped using UK Biobank Axiom Array. Genotypes were imputed into the dataset using the Haplotype Reference Consortium (HRC), and the UK10K haplotype resource. To account for ethnicity, we included only those individuals that identified themselves as "white" by self-report and plotted the Principal Components (PC) provided by the UKBB, excluding individuals considered to be outliers according to PCs 1 and 2. Genetic relatedness calculated with KING kinship and provided by the UKBB (https://kenhanscombe.github.io/ukbtools/articles/explore-ukb-data.html ; http://www.ukbiobank.ac.uk/wp-content/uploads/2014/04/UKBiobank_genotyping_QC_documentation-web.pdf) was used to identify first and second-degree relatives. Subsequently ´families´ (i.e. clusters of related individuals above an IBD>0.125 threshold) were created and only one individual from each of these created ‘families’ was included in the analysis. If self-reported sex and SNP-based sex differed, individuals were excluded from further analysis. Single nucleotide polymorphisms (SNPs) with minor allele frequency <0.005, Hardy-Weinberg equilibrium test P value<1e−6, missing genotype rate >0.05, and imputation quality of INFO <0.8 were excluded. In the current study, all analyses are based on 342,461 participants of European ancestry for which both genotype data and sociability scores were available.

Genome-wide association analysis Genome-wide association analysis with the imputed marker dosages was performed in PLINK1.9, using a linear regression model with the sociability measure as the dependent variable and including sex, age, 10 first PCs, assessment center, and genotype batch as covariates. SNPs were considered significantly associated if they had p-value < 5e-8. Associated loci were considered independent of each other at r2 0.6 and lead SNPs were classified as the SNP with the smallest association p-value and at r2 0.1, using a 250kb window. The summary statistics come from the plink2 linear regression analysis.

Search
Clear search
Close search
Google apps
Main menu