74 datasets found
  1. f

    Supplementary Data from Identification of Causal Plasma Proteins in...

    • aacr.figshare.com
    xlsx
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weihao Tang; Xiaoke Ma (2025). Supplementary Data from Identification of Causal Plasma Proteins in Hepatocellular Carcinoma via Two-Sample Mendelian Randomization and Integrative Transcriptomic‒Proteomic Analysis [Dataset]. http://doi.org/10.1158/2767-9764.29050894
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 13, 2025
    Dataset provided by
    American Association for Cancer Research
    Authors
    Weihao Tang; Xiaoke Ma
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary Data

  2. f

    Table 2 from Identification of Causal Plasma Proteins in Hepatocellular...

    • aacr.figshare.com
    xls
    Updated May 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weihao Tang; Xiaoke Ma (2025). Table 2 from Identification of Causal Plasma Proteins in Hepatocellular Carcinoma via Two-Sample Mendelian Randomization and Integrative Transcriptomic‒Proteomic Analysis [Dataset]. http://doi.org/10.1158/2767-9764.28581209
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 13, 2025
    Dataset provided by
    American Association for Cancer Research
    Authors
    Weihao Tang; Xiaoke Ma
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Results of protein-drug molecular docking

  3. The performance of different models on the common independent proteins.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). The performance of different models on the common independent proteins. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The performance of different models on the common independent proteins.

  4. f

    Demographics of patients (SGH cohort).

    • figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Who-Whong Wang; Soo Fan Ang; Rajneesh Kumar; Charmain Heah; Andi Utama; Navessa Padma Tania; Huihua Li; Sze Huey Tan; Desmond Poo; Su Pin Choo; Wan Cheng Chow; Chee Kiat Tan; Han Chong Toh (2023). Demographics of patients (SGH cohort). [Dataset]. http://doi.org/10.1371/journal.pone.0068904.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Who-Whong Wang; Soo Fan Ang; Rajneesh Kumar; Charmain Heah; Andi Utama; Navessa Padma Tania; Huihua Li; Sze Huey Tan; Desmond Poo; Su Pin Choo; Wan Cheng Chow; Chee Kiat Tan; Han Chong Toh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Demographics of patients (SGH cohort).

  5. Integrative membrane proteome analysis of High and Low-metastatic Lung...

    • data.niaid.nih.gov
    • ebi.ac.uk
    xml
    Updated Aug 18, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yan Kong; Hui Lu (2020). Integrative membrane proteome analysis of High and Low-metastatic Lung Cancers [Dataset]. https://data.niaid.nih.gov/resources?id=pxd016912
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Aug 18, 2020
    Dataset provided by
    Shanghai Jiao Tong University
    SJTU-Yale Joint Center for Biostatistics, School of Life Sciences and Biotechnology Shanghai Jiao Tong University
    Authors
    Yan Kong; Hui Lu
    Variables measured
    Proteomics
    Description

    Membrane proteins play critical roles between the tumor cells and the extra-cellular matrix (ECM) during metastasis. In this study, we performed quantitative proteomic analysis of membrane proteins from two human giant-cell lung carcinoma cell strains, low- (95C) and high-(95D) metastatistic cell lines, and combining with microRNA analysis, we identified a multi-omics regulation module.

  6. f

    Statistic for the 62 optimal features selected by random forest.

    • plos.figshare.com
    xls
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). Statistic for the 62 optimal features selected by random forest. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Statistic for the 62 optimal features selected by random forest.

  7. f

    Differentially Expressed Proteins Identified by MALDI/TOF MS.

    • figshare.com
    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ge Gao; Chao Xuan; Qin Yang; Xiao-Cheng Liu; Zhi-Gang Liu; Guo-Wei He (2023). Differentially Expressed Proteins Identified by MALDI/TOF MS. [Dataset]. http://doi.org/10.1371/journal.pone.0072111.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ge Gao; Chao Xuan; Qin Yang; Xiao-Cheng Liu; Zhi-Gang Liu; Guo-Wei He
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RVD = rheumatic valvular disease; DVD = degenerative valvular disease; MW = protein molecular weight; pI = isoelectric point.*Candidate proteins for validation;#UniProt Knowledgebase, http://expasy.org/uniprot.

  8. f

    Table S1 - Multivariate Modeling of Proteins Related to Trapezius Myalgia, a...

    • figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jenny Hadrevi; Bijar Ghafouri; Britt Larsson; Björn Gerdle; Fredrik Hellström (2023). Table S1 - Multivariate Modeling of Proteins Related to Trapezius Myalgia, a Comparative Study of Female Cleaners with or without Pain [Dataset]. http://doi.org/10.1371/journal.pone.0073285.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jenny Hadrevi; Bijar Ghafouri; Britt Larsson; Björn Gerdle; Fredrik Hellström
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Identified proteins by LC-MS/MS or MALDI-TOF-MS/MS. (DOCX)

  9. r

    Swedish Twin Registry

    • researchdata.se
    • demo.researchdata.se
    • +1more
    Updated Mar 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrik Magnusson (2025). Swedish Twin Registry [Dataset]. https://researchdata.se/en/catalogue/dataset/ext0163-1
    Explore at:
    Dataset updated
    Mar 21, 2025
    Dataset provided by
    Karolinska Institutet
    Authors
    Patrik Magnusson
    Time period covered
    1961
    Description

    STR has been based at the Karolinska Institutet since 1959, first at the Institution of Hygien and thereafter at Medical Epidemiology and Biostatistics, MEB. STR was originally created primarily to study the importance of environmental factors for the development of cardiovascular/respiratory diseases and cancer, but has since then evolved to a resource for all epidemiological and genetic aspects of ill health. The research that is based on STR is financed externally through grants that the users apply for individually. STR is in this way making up the basis for a lot of research; during the past decade over 50 articles have been published annually, where of several in high impact journals. STR has during the past decade transformed from being primarily an epidemiological resource to forming a biobank of samples (DNA, blood and serum) for a large number of twins. Genome-wide genotyping of close to 30 000 participants have been undertaken and the plan is that all DNA samples shall become genotyped on a genome-wide platform the coming few years. Serum from 12 600 twins have so far been used for measurements of classical blood biomarkers. Generated genotypes and biomarker measurements builds in an effective manner up the value of STR as an molecular epidemiological resource.

    Purpose:

    The goal of the Swedish Twin Registry (STR) is to provide a longitudinal research infrastructure in the form of a population-based twin cohort of adequate size and content to enable powerful epidemiological and molecular medical studies. The study designs used are classical epidemiological investigations of risk-factors for disease and death (providing within twin pair designs), genetic association studies, heritability studies (both twin model based and molecular based), epigenetics, proteomics as well as other types of "-omics" approaches. STR is open for Swedish researchers and international researchers that have a Swedish collaborator.

  10. f

    The performance of the RF model by 10-fold cross validation test.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). The performance of the RF model by 10-fold cross validation test. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The performance of the RF model by 10-fold cross validation test.

  11. f

    The performance of the RF models based on the six residue sets.

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). The performance of the RF models based on the six residue sets. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The performance of the RF models based on the six residue sets.

  12. Comparison of ISHDSF rank and hypothesis based gene selection results to...

    • plos.figshare.com
    xls
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhongming Zhao; Bradley T. Webb; Peilin Jia; T. Bernard Bigdeli; Brion S. Maher; Edwin van den Oord; Sarah E. Bergen; Richard L. Amdur; Francis A. O'Neill; Dermot Walsh; Dawn L. Thiselton; Xiangning Chen; Carlos N. Pato; Brien P. Riley; Kenneth S. Kendler; Ayman H. Fanous (2023). Comparison of ISHDSF rank and hypothesis based gene selection results to random gene selection in schizophrenia CATIE and GAIN GWAS datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0067776.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zhongming Zhao; Bradley T. Webb; Peilin Jia; T. Bernard Bigdeli; Brion S. Maher; Edwin van den Oord; Sarah E. Bergen; Richard L. Amdur; Francis A. O'Neill; Dermot Walsh; Dawn L. Thiselton; Xiangning Chen; Carlos N. Pato; Brien P. Riley; Kenneth S. Kendler; Ayman H. Fanous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    aCategory 1: genes are both highly ranked and involved in neurodevelopment. Category 2: genes are exclusive to neurodevelopment. Category 3: genes are exclusively highly ranked (see details in text).bWe performed simulations by four methods: 1) based on the count of SNPs, 2) based on the minimum p-value, 3) based on the number of SNPs with p

  13. Comparison of the Adequacy Index and Log Likelihood ratios of diagnostic...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Who-Whong Wang; Soo Fan Ang; Rajneesh Kumar; Charmain Heah; Andi Utama; Navessa Padma Tania; Huihua Li; Sze Huey Tan; Desmond Poo; Su Pin Choo; Wan Cheng Chow; Chee Kiat Tan; Han Chong Toh (2023). Comparison of the Adequacy Index and Log Likelihood ratios of diagnostic models. [Dataset]. http://doi.org/10.1371/journal.pone.0068904.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Who-Whong Wang; Soo Fan Ang; Rajneesh Kumar; Charmain Heah; Andi Utama; Navessa Padma Tania; Huihua Li; Sze Huey Tan; Desmond Poo; Su Pin Choo; Wan Cheng Chow; Chee Kiat Tan; Han Chong Toh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of the Adequacy Index and Log Likelihood ratios of diagnostic models.

  14. Demographics of the Study Population.

    • plos.figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ge Gao; Chao Xuan; Qin Yang; Xiao-Cheng Liu; Zhi-Gang Liu; Guo-Wei He (2023). Demographics of the Study Population. [Dataset]. http://doi.org/10.1371/journal.pone.0072111.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ge Gao; Chao Xuan; Qin Yang; Xiao-Cheng Liu; Zhi-Gang Liu; Guo-Wei He
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All variables displayed as mean ± SEM (n = 40 in each group);RVD = rheumatic valvular disease;DVD = degenerative valvular disease; M = male; F = female; GLU = glucose; TBIL = total bilirubin;GPT = glutamic-pyruvic transaminase; TCHOL = total cholesterol; TG = triglyceride; HDL-C = high-density lipoprotein cholesterol; LDL-C = low-density lipoprotein cholesterol; UA = uric acid; CREA = creatinine.P>0.05 in all comparisons between any two of these groups (unpaired t-test for all continuous numbers).

  15. f

    The performance of the RF models based on the six feature groups.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). The performance of the RF models based on the six feature groups. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The performance of the RF models based on the six feature groups.

  16. f

    Protein Spot with Variable of importance (VIP) value >1.0 in the PLS-DA...

    • figshare.com
    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jenny Hadrevi; Bijar Ghafouri; Britt Larsson; Björn Gerdle; Fredrik Hellström (2023). Protein Spot with Variable of importance (VIP) value >1.0 in the PLS-DA multivariate model. [Dataset]. http://doi.org/10.1371/journal.pone.0073285.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jenny Hadrevi; Bijar Ghafouri; Britt Larsson; Björn Gerdle; Fredrik Hellström
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Proteins identified with LC-MS/MS, MALDI-TOF-MS/MS. ID is the same as in figure 2 (Weight-plot PLS-DA).

  17. f

    Comparison results of our method, Effective T3, BPBAac and BEAN.

    • plos.figshare.com
    xls
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). Comparison results of our method, Effective T3, BPBAac and BEAN. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison results of our method, Effective T3, BPBAac and BEAN.

  18. Effective Identification of Gram-Negative Bacterial Type III Secreted...

    • plos.figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). Effective Identification of Gram-Negative Bacterial Type III Secreted Effectors Using Position-Specific Residue Conservation Profiles [Dataset]. http://doi.org/10.1371/journal.pone.0084439
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroudType III secretion systems (T3SSs) are central to the pathogenesis and specifically deliver their secreted substrates (type III secreted proteins, T3SPs) into host cells. Since T3SPs play a crucial role in pathogen-host interactions, identifying them is crucial to our understanding of the pathogenic mechanisms of T3SSs. This study reports a novel and effective method for identifying the distinctive residues which are conserved different from other SPs for T3SPs prediction. Moreover, the importance of several sequence features was evaluated and further, a promising prediction model was constructed.ResultsBased on the conservation profiles constructed by a position-specific scoring matrix (PSSM), 52 distinctive residues were identified. To our knowledge, this is the first attempt to identify the distinct residues of T3SPs. Of the 52 distinct residues, the first 30 amino acid residues are all included, which is consistent with previous studies reporting that the secretion signal generally occurs within the first 30 residue positions. However, the remaining 22 positions span residues 30–100 were also proven by our method to contain important signal information for T3SP secretion because the translocation of many effectors also depends on the chaperone-binding residues that follow the secretion signal. For further feature optimisation and compression, permutation importance analysis was conducted to select 62 optimal sequence features. A prediction model across 16 species was developed using random forest to classify T3SPs and non-T3 SPs, with high receiver operating curve of 0.93 in the 10-fold cross validation and an accuracy of 94.29% for the test set. Moreover, when performing on a common independent dataset, the results demonstrate that our method outperforms all the others published to date. Finally, the novel, experimentally confirmed T3 effectors were used to further demonstrate the model’s correct application. The model and all data used in this paper are freely available at http://cic.scu.edu.cn/bioinformatics/T3SPs.zip.

  19. f

    The conservation differences analyzed by SAM for N-terminal 100 residues.

    • figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li (2023). The conservation differences analyzed by SAM for N-terminal 100 residues. [Dataset]. http://doi.org/10.1371/journal.pone.0084439.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaojiao Yang; Yanzhi Guo; Jiesi Luo; Xuemei Pu; Menglong Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The conservation differences analyzed by SAM for N-terminal 100 residues.

  20. Association Study of 167 Candidate Genes for Schizophrenia Selected by a...

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhongming Zhao; Bradley T. Webb; Peilin Jia; T. Bernard Bigdeli; Brion S. Maher; Edwin van den Oord; Sarah E. Bergen; Richard L. Amdur; Francis A. O'Neill; Dermot Walsh; Dawn L. Thiselton; Xiangning Chen; Carlos N. Pato; Brien P. Riley; Kenneth S. Kendler; Ayman H. Fanous (2023). Association Study of 167 Candidate Genes for Schizophrenia Selected by a Multi-Domain Evidence-Based Prioritization Algorithm and Neurodevelopmental Hypothesis [Dataset]. http://doi.org/10.1371/journal.pone.0067776
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zhongming Zhao; Bradley T. Webb; Peilin Jia; T. Bernard Bigdeli; Brion S. Maher; Edwin van den Oord; Sarah E. Bergen; Richard L. Amdur; Francis A. O'Neill; Dermot Walsh; Dawn L. Thiselton; Xiangning Chen; Carlos N. Pato; Brien P. Riley; Kenneth S. Kendler; Ayman H. Fanous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Integrating evidence from multiple domains is useful in prioritizing disease candidate genes for subsequent testing. We ranked all known human genes (n = 3819) under linkage peaks in the Irish Study of High-Density Schizophrenia Families using three different evidence domains: 1) a meta-analysis of microarray gene expression results using the Stanley Brain collection, 2) a schizophrenia protein-protein interaction network, and 3) a systematic literature search. Each gene was assigned a domain-specific p-value and ranked after evaluating the evidence within each domain. For comparison to this ranking process, a large-scale candidate gene hypothesis was also tested by including genes with Gene Ontology terms related to neurodevelopment. Subsequently, genotypes of 3725 SNPs in 167 genes from a custom Illumina iSelect array were used to evaluate the top ranked vs. hypothesis selected genes. Seventy-three genes were both highly ranked and involved in neurodevelopment (category 1) while 42 and 52 genes were exclusive to neurodevelopment (category 2) or highly ranked (category 3), respectively. The most significant associations were observed in genes PRKG1, PRKCE, and CNTN4 but no individual SNPs were significant after correction for multiple testing. Comparison of the approaches showed an excess of significant tests using the hypothesis-driven neurodevelopment category. Random selection of similar sized genes from two independent genome-wide association studies (GWAS) of schizophrenia showed the excess was unlikely by chance. In a further meta-analysis of three GWAS datasets, four candidate SNPs reached nominal significance. Although gene ranking using integrated sources of prior information did not enrich for significant results in the current experiment, gene selection using an a priori hypothesis (neurodevelopment) was superior to random selection. As such, further development of gene ranking strategies using more carefully selected sources of information is warranted.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Weihao Tang; Xiaoke Ma (2025). Supplementary Data from Identification of Causal Plasma Proteins in Hepatocellular Carcinoma via Two-Sample Mendelian Randomization and Integrative Transcriptomic‒Proteomic Analysis [Dataset]. http://doi.org/10.1158/2767-9764.29050894

Supplementary Data from Identification of Causal Plasma Proteins in Hepatocellular Carcinoma via Two-Sample Mendelian Randomization and Integrative Transcriptomic‒Proteomic Analysis

Related Article
Explore at:
xlsxAvailable download formats
Dataset updated
May 13, 2025
Dataset provided by
American Association for Cancer Research
Authors
Weihao Tang; Xiaoke Ma
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Supplementary Data

Search
Clear search
Close search
Google apps
Main menu