100+ datasets found
  1. D

    Replication Data for: Uncertainty-Aware Principal Component Analysis

    • darus.uni-stuttgart.de
    Updated Dec 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen (2022). Replication Data for: Uncertainty-Aware Principal Component Analysis [Dataset]. http://doi.org/10.18419/DARUS-2321
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 7, 2022
    Dataset provided by
    DaRUS
    Authors
    Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen
    License

    https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321

    Dataset funded by
    DFG
    Description

    This dataset contains the source code for uncertainty-aware principal component analysis (UA-PCA) and a series of images that show dimensionality reduction plots created with UA-PCA. The software is a JavaScript library for performing principal component analysis and dimensionality reduction on datasets consisting of multivariate probability distributions. Each plot of the image series used UA-PCA to project a dataset consisting of multivariate normal distributions. The covariance matrices of the dataset instances were scaled with different factors resulting in different UA-PCA projections. The projected probability distributions are displayed using isolines of their probability density functions. As the scaling value increases, the projection changes, showing the sensitivity of UA-PCA to changes in variance.

  2. f

    Data from: Visualization of Molecular Fingerprints

    • acs.figshare.com
    zip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John R. Owen; Ian T. Nabney; José L. Medina-Franco; Fabian López-Vallejo (2023). Visualization of Molecular Fingerprints [Dataset]. http://doi.org/10.1021/ci1004042.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    ACS Publications
    Authors
    John R. Owen; Ian T. Nabney; José L. Medina-Franco; Fabian López-Vallejo
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    A visualization plot of a data set of molecular data is a useful tool for gaining insight into a set of molecules. In chemoinformatics, most visualization plots are of molecular descriptors, and the statistical model most often used to produce a visualization is principal component analysis (PCA). This paper takes PCA, together with four other statistical models (NeuroScale, GTM, LTM, and LTM-LIN), and evaluates their ability to produce clustering in visualizations not of molecular descriptors but of molecular fingerprints. Two different tasks are addressed: understanding structural information (particularly combinatorial libraries) and relating structure to activity. The quality of the visualizations is compared both subjectively (by visual inspection) and objectively (with global distance comparisons and local k-nearest-neighbor predictors). On the data sets used to evaluate clustering by structure, LTM is found to perform significantly better than the other models. In particular, the clusters in LTM visualization space are consistent with the relationships between the core scaffolds that define the combinatorial sublibraries. On the data sets used to evaluate clustering by activity, LTM again gives the best performance but by a smaller margin. The results of this paper demonstrate the value of using both a nonlinear projection map and a Bernoulli noise model for modeling binary data.

  3. f

    A two-dimensional PCA plot obtained from a multiple factor analysis (MFA)...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Jul 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Balaji, Thiruppathi; Vaishnav, Keshav; Sarma, Devojit Kumar; Kumar, Mayandi Senthil; Paramasivan, Rajaiah; Mohanty, Suman S.; Barik, Tapan Kumar; Srivastava, Hemlata; Kalimuthu, Mariapillai; Kumar, Narendran Pradeep; Tyagi, Suchi; Suman, Devi Shankar; Kumar, Ashwani; Sumitha, Melveettil Kishor; Kumar, Devendra; Sunish, Ittoop Pulikkottil; Patil, Prabhakargouda B.; Bhowmick, Ipsita Pal; Singh, Om P.; Uragayala, Sreehari; Gupta, Bhavna (2023). A two-dimensional PCA plot obtained from a multiple factor analysis (MFA) performed on all 22 populations using 142 bioclimatic variables retrieved from the WorldClim database. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000977988
    Explore at:
    Dataset updated
    Jul 27, 2023
    Authors
    Balaji, Thiruppathi; Vaishnav, Keshav; Sarma, Devojit Kumar; Kumar, Mayandi Senthil; Paramasivan, Rajaiah; Mohanty, Suman S.; Barik, Tapan Kumar; Srivastava, Hemlata; Kalimuthu, Mariapillai; Kumar, Narendran Pradeep; Tyagi, Suchi; Suman, Devi Shankar; Kumar, Ashwani; Sumitha, Melveettil Kishor; Kumar, Devendra; Sunish, Ittoop Pulikkottil; Patil, Prabhakargouda B.; Bhowmick, Ipsita Pal; Singh, Om P.; Uragayala, Sreehari; Gupta, Bhavna
    Description

    A two-dimensional PCA plot obtained from a multiple factor analysis (MFA) performed on all 22 populations using 142 bioclimatic variables retrieved from the WorldClim database.

  4. Consequences of PCA graphs, SNP codings, and PCA variants for elucidating...

    • plos.figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen (2023). Consequences of PCA graphs, SNP codings, and PCA variants for elucidating population structure [Dataset]. http://doi.org/10.1371/journal.pone.0218306
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SNP datasets are high-dimensional, often with thousands to millions of SNPs and hundreds to thousands of samples or individuals. Accordingly, PCA graphs are frequently used to provide a low-dimensional visualization in order to display and discover patterns in SNP data from humans, animals, plants, and microbes—especially to elucidate population structure. PCA is not a single method that is always done the same way, but rather requires three choices which we explore as a three-way factorial: two kinds of PCA graphs by three SNP codings by six PCA variants. Our main three recommendations are simple and easily implemented: Use PCA biplots, SNP coding 1 for the rare allele and 0 for the common allele, and double-centered PCA (or AMMI1 if main effects are also of interest). We also document contemporary practices by a literature survey of 125 representative articles that apply PCA to SNP data, find that virtually none implement our recommendations. The ultimate benefit from informed and optimal choices of PCA graph, SNP coding, and PCA variant, is expected to be discovery of more biology, and thereby acceleration of medical, agricultural, and other vital applications.

  5. The X-loading values for the first principal component of the PCA score...

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erwin van Vliet; Elisenda Eixarch; Miriam Illa; Ariadna Arbat-Plana; Anna González-Tendero; Helena T. Hogberg; Liang Zhao; Thomas Hartung; Eduard Gratacos (2023). The X-loading values for the first principal component of the PCA score plot. [Dataset]. http://doi.org/10.1371/journal.pone.0064545.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Erwin van Vliet; Elisenda Eixarch; Miriam Illa; Ariadna Arbat-Plana; Anna González-Tendero; Helena T. Hogberg; Liang Zhao; Thomas Hartung; Eduard Gratacos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    X-loading values show positive and negative correlations responsible for the cluster formation along the first principal component (PC1) in the PCA score plot.

  6. DESeq2 DGEAnalysisGSE44076 QC,PCA,Heatmaps,Volcano

    • kaggle.com
    zip
    Updated Nov 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Nagendra (2025). DESeq2 DGEAnalysisGSE44076 QC,PCA,Heatmaps,Volcano [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/deseq2-dgeanalysisgse44076-qcpcaheatmapsvolcano
    Explore at:
    zip(1796357 bytes)Available download formats
    Dataset updated
    Nov 29, 2025
    Authors
    Dr. Nagendra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains a Differential Gene Expression (DGE) analysis of GSE44076.

    The analysis compares tumor versus normal samples.

    It uses the DESeq2 package for RNA-seq count data analysis.

    The dataset includes quality control (QC) visualizations.

    Principal Component Analysis (PCA) plots are provided for sample clustering.

    Heatmaps illustrate the expression patterns of top differentially expressed genes.

    EnhancedVolcano plots are included to visualize significant genes.

    The dataset enables users to explore gene expression changes in colorectal cancer.

    All R scripts and associated visualizations are included for reproducibility.

    The workflow can be adapted for other RNA-seq datasets.

    The dataset supports bioinformatics, transcriptomics, and cancer research studies.

    It provides an educational resource for DESeq2-based RNA-seq analysis.

  7. Principal Component Analysis (PCA) table of the full dataset on N = 23...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deanna Gigliotti; Jeff R. S. Leiter; Peter B. MacDonald; Jason Peeler; Judy E. Anderson (2023). Principal Component Analysis (PCA) table of the full dataset on N = 23 variables, showing correlation coefficients for variables loaded on the 3 PCs extracted from the analysis of data from 27 participants. [Dataset]. http://doi.org/10.1371/journal.pone.0162494.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Deanna Gigliotti; Jeff R. S. Leiter; Peter B. MacDonald; Jason Peeler; Judy E. Anderson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the row for each variable, numbers indicate the strength of correlation of that variable with the eigenvector of each PC. When the absolute value of correlation coefficients was ≥ 0.3, they were considered important (bold font) in defining the PC. Variables loaded on PCs 1–3 below, appear in the PCA plot (Fig 4A) with an asterisk (*) to indicate they also project upward (since PC3 is perpendicular to axes for PCs 1 and 2).

  8. d

    coastDat-2 Hydrodynamic Model TRIM-NP Principal Component Analysis Residual...

    • demo-b2find.dkrz.de
    Updated Sep 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). coastDat-2 Hydrodynamic Model TRIM-NP Principal Component Analysis Residual Currents - Dataset - B2FIND [Dataset]. http://demo-b2find.dkrz.de/dataset/09fd34c9-4d32-5bcb-a0e3-f3901ac83ae8
    Explore at:
    Dataset updated
    Sep 20, 2025
    Description

    Simulated 2D residual velocity fields in the inner German Bight were subjected to Principal Component Analysis (PCA). Residual currents were obtained from coastDat2 barotropic 2D simulations with the hydrodynamic model TRIM-NP V2.1.22 in barotropic 2D mode on a Cartesian grid (1.6km spatial resolution) stored on an hourly basis for the years 1948 - 2012 (doi:10.1594/WDCC/coastDat-2_TRIM-NP-2d) and later extended until August 2015. The present analysis refers to the period Jan 1958 - Aug 2015. The spatial domain considered is the region to the east of 6 degrees east and to the south of 55.6 degrees north. All grid nodes with a bathymetry of less than 10m were excluded. Residual velocities were calculated in two different ways: 1.) as 25h means, 2.) as monthly means. Both types of residual current data are available from * RESIDUAL_CURRENTS_195801_201508 The directory contains sub-directories for years and months. Daily residual currents for the 13th of September 1974, for instance, are stored in * RESIDUAL_CURRENTS_195801_201508/YEAR_1974/MONTH_09/TRIM2D_1974_09_13_means.nc while monthly mean residual currents for September 1974 are stored in: * RESIDUAL_CURRENTS_195801_201508/YEAR_1974/TRIM2D_1974_09_means.nc All current fields provided were interpolated from the original Cartesian model grid to a more convenient regular geographical grid (116x76 nodes). Mean residual currents are stored in: * mean_residual_currents.nc This data set contains residual velocities both on original Cartesian grid nodes and interpolated to the geographical grid. An example plot is provided: * mean_residual_currents.png For PCA, two residual velocity components from each of 12133 Cartesian grid nodes were combined into one data vector (length 2x12133), referring to 21061 daily or 692 monthly time levels. Results of two independent PCAs for either daily or monthly mean fields are stored in: * PCA_daily_residual_currents.nc * PCA_monthly_residual_currents.nc Files contain three leading Principal Components (PCs) and corresponding Emipirical Orthogonal Functions (EOFs). Again EOFs were also interpolated to a regular geographical grid. PC time series are also stored in plain ASCII format: * PCs_daily.txt * PCs_monthly.txt For monthly fields the number N of variables (N=2x12133) is much larger than the number T of time levels (T=692). Therefore, to reduce computational demands, the roles of time and space were formally interchanged. Having conducted the PCA the EOFs were then transformed back to the original spatial coordinates (cf. Section 12.2.6 in von Storch and Zwiers (1999), Statistical Analysis in Climate Research, Cambridge University Press). A much larger number of time levels made even this approach prohibitive for the full set of daily data. Therefore, PCAs were performed for six sub-periods (1958-1965, 1966-1975, 1976-1985, 1986-1995, 1996-2005, 2006-2015(Aug)) independently. EOFs obtained from these six sub-periods were then averaged to obtain EOFs representative for the whole period. Corresponding PCs were calculated by projecting daily fields onto these average EOFs. IMPORTANT: In contrast with PCA of monthly data, the PCA of daily data INVOLVES SOME APPROXIMATIONS! EOFs on the original nodes were normalized to have unit lengths. The following figures, * daily_EOF1.png * daily_EOF2.png * daily_EOF3.png show the first three EOFs obtained from daily data, assuming that corresponding PCs have the value of one standard deviation. The following two plots, * monthly_EOF1.png * monthly_EOF2.png show the leading EOFs for monthly mean data. EOF3 is omitted as it represents just a very small percentage of overall variance (1.7%).

  9. d

    The principal components of electoral regimes

    • datadryad.org
    zip
    Updated Aug 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karoline Wiesner; Samuel Bien; Matthew Wilson (2024). The principal components of electoral regimes [Dataset]. http://doi.org/10.5061/dryad.np5hqc030
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 2, 2024
    Dataset provided by
    Dryad
    Authors
    Karoline Wiesner; Samuel Bien; Matthew Wilson
    Time period covered
    Jul 12, 2024
    Description

    We use version 12 (2022) of the V-Dem data (https://www.V-Dem.net) and apply standard principal component analysis (PCA). Following standard procedure, we normalized each V-Dem variable (i.e. centered it to a mean of zero and rescaled it to a variance of one) prior to performing PCA. For better readability of the plots, we rescaled all principal components uniformly such that the first component has a maximum absolute value of one (i.e. its values are bounded by [-1,1]) while preserving the mean of zero for all components. We further re-oriented each component such that its strongest loading is positive.

  10. T

    United States - Balance Sheet: Total Risk Based Capital (PCA Definition)

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States - Balance Sheet: Total Risk Based Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-total-risk-based-capital-pca-definition-fed-data.html
    Explore at:
    json, csv, xml, excelAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1976 - Dec 31, 2025
    Area covered
    United States
    Description

    United States - Balance Sheet: Total Risk Based Capital (PCA Definition) was 2364706.75300 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Total Risk Based Capital (PCA Definition) reached a record high of 2364706.75300 in April of 2025 and a record low of 322350.26900 in January of 1990. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Total Risk Based Capital (PCA Definition) - last updated from the United States Federal Reserve on November of 2025.

  11. f

    Heatmap of pairwise correlations and principal component analysis (PCA) of...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    • +1more
    Updated Aug 5, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sawyer, Jason E.; Spencer, Thomas E.; Bauersachs, Stefan; Madsen, Crystal A.; Geary, Thomas W.; Minten, Megan A.; Bilby, Todd R.; Tibary, Ahmed; Neibergs, Holly L.; Bruno, Ralph G. S.; Allen, Carolyn C.; Wang, Zeping (2013). Heatmap of pairwise correlations and principal component analysis (PCA) of microarray data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001687053
    Explore at:
    Dataset updated
    Aug 5, 2013
    Authors
    Sawyer, Jason E.; Spencer, Thomas E.; Bauersachs, Stefan; Madsen, Crystal A.; Geary, Thomas W.; Minten, Megan A.; Bilby, Todd R.; Tibary, Ahmed; Neibergs, Holly L.; Bruno, Ralph G. S.; Allen, Carolyn C.; Wang, Zeping
    Description

    (A) Microarray data were filtered for detectable probes and normalized with the BioConductor package vsn. Normalized data were used for calculation of pairwise distances and drawing of a heatmap by use of the BioConductor package geneplotter. Each column represents one sample and shows the correlation to all samples (including itself), with red for correlation = 1 and blue for the lowest observed correlation. Note the clear homogeneity in the samples from fertility classified heifers (HF, high fertile; SF, subfertile; IF, infertile). (B) PCA is a plot distribution indicating the source of greatest variation in the overall transcriptional profiles of the samples. Each symbol represents one replicate. Note the clear lack of separation of samples based on fertility classifications (HF, high fertile; SF, subfertile; IF, infertile).

  12. T

    United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition)

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-tier-1-risk-based-capital-pca-definition-fed-data.html
    Explore at:
    json, xml, excel, csvAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1976 - Dec 31, 2025
    Area covered
    United States
    Description

    United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) was 2271005.21400 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) reached a record high of 2271005.21400 in April of 2025 and a record low of 147414.03800 in January of 1984. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) - last updated from the United States Federal Reserve on October of 2025.

  13. T

    United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition)

    • tradingeconomics.com
    csv, excel, json, xml
    Updated May 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TRADING ECONOMICS (2025). United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-tier-1-leverage-capital-pca-definition-fed-data.html
    Explore at:
    csv, excel, json, xmlAvailable download formats
    Dataset updated
    May 15, 2025
    Dataset authored and provided by
    TRADING ECONOMICS
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1976 - Dec 31, 2025
    Area covered
    United States
    Description

    United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) was 2271005.21400 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) reached a record high of 2271005.21400 in April of 2025 and a record low of 147414.03800 in January of 1984. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) - last updated from the United States Federal Reserve on November of 2025.

  14. q

    Using PCA to study small mammal biomechanics

    • qubeshub.org
    Updated May 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abagael West; Stephanie Smith (2025). Using PCA to study small mammal biomechanics [Dataset]. http://doi.org/10.25334/6KCX-M612
    Explore at:
    Dataset updated
    May 22, 2025
    Dataset provided by
    QUBES
    Authors
    Abagael West; Stephanie Smith
    Description

    In this lesson, students interpret a scatter plot showing the results of a principal components analysis (PCA). They view an interview with Dr. Stephanie Smith, who explains how PCA calculations work, and why she chose to use this analysis to visualize her data. Dr. Smith also discusses her journey becoming a scientist and describes a typical day at work.

  15. A Network View on Psychiatric Disorders: Network Clusters of Symptoms as...

    • plos.figshare.com
    • figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rutger Goekoop; Jaap G. Goekoop (2023). A Network View on Psychiatric Disorders: Network Clusters of Symptoms as Elementary Syndromes of Psychopathology [Dataset]. http://doi.org/10.1371/journal.pone.0112734
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Rutger Goekoop; Jaap G. Goekoop
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionThe vast number of psychopathological syndromes that can be observed in clinical practice can be described in terms of a limited number of elementary syndromes that are differentially expressed. Previous attempts to identify elementary syndromes have shown limitations that have slowed progress in the taxonomy of psychiatric disorders.AimTo examine the ability of network community detection (NCD) to identify elementary syndromes of psychopathology and move beyond the limitations of current classification methods in psychiatry.Methods192 patients with unselected mental disorders were tested on the Comprehensive Psychopathological Rating Scale (CPRS). Principal component analysis (PCA) was performed on the bootstrapped correlation matrix of symptom scores to extract the principal component structure (PCS). An undirected and weighted network graph was constructed from the same matrix. Network community structure (NCS) was optimized using a previously published technique.ResultsIn the optimal network structure, network clusters showed a 89% match with principal components of psychopathology. Some 6 network clusters were found, including "DEPRESSION", "MANIA", “ANXIETY”, "PSYCHOSIS", "RETARDATION", and "BEHAVIORAL DISORGANIZATION". Network metrics were used to quantify the continuities between the elementary syndromes.ConclusionWe present the first comprehensive network graph of psychopathology that is free from the biases of previous classifications: a ‘Psychopathology Web’. Clusters within this network represent elementary syndromes that are connected via a limited number of bridge symptoms. Many problems of previous classifications can be overcome by using a network approach to psychopathology.

  16. Wine Dataset for PCA

    • kaggle.com
    zip
    Updated Feb 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DEV AHUJA (2024). Wine Dataset for PCA [Dataset]. https://www.kaggle.com/datasets/devahuja2808/wine-dataset-for-pca
    Explore at:
    zip(4522 bytes)Available download formats
    Dataset updated
    Feb 23, 2024
    Authors
    DEV AHUJA
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by DEV AHUJA

    Released under Apache 2.0

    Contents

  17. f

    Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Jun 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Spencer, Paulette; Maksumova, Zoya; Raj, Srilakshmi M.; Isshiki, Mariko; Colón, Mirtha; Isasi, Carmen R.; Klugman, Susan D.; Suglia, Shakira; Griffen, Anthony J.; Meissner, Paul; Cabana, Michael D.; Greally, John M. (2025). Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC participants and reference panels. (B) PCA plots highlighting individuals belonging to the 14 IBD clusters detected in AoU NYC participants. (C) SCOPE analysis for AoU NYC participants labelled with the 14 IBD clusters. Each color represents global ancestry proportion of the five superpopulations (African, European, American, East Asian and South Asian) inferred using supervised mode in SCOPE. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002064867
    Explore at:
    Dataset updated
    Jun 24, 2025
    Authors
    Spencer, Paulette; Maksumova, Zoya; Raj, Srilakshmi M.; Isshiki, Mariko; Colón, Mirtha; Isasi, Carmen R.; Klugman, Susan D.; Suglia, Shakira; Griffen, Anthony J.; Meissner, Paul; Cabana, Michael D.; Greally, John M.
    Area covered
    New York
    Description

    Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC participants and reference panels. (B) PCA plots highlighting individuals belonging to the 14 IBD clusters detected in AoU NYC participants. (C) SCOPE analysis for AoU NYC participants labelled with the 14 IBD clusters. Each color represents global ancestry proportion of the five superpopulations (African, European, American, East Asian and South Asian) inferred using supervised mode in SCOPE.

  18. Wine12

    • kaggle.com
    zip
    Updated Nov 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    gaurav9712 (2020). Wine12 [Dataset]. https://www.kaggle.com/gaurav9712/wine12
    Explore at:
    zip(4491 bytes)Available download formats
    Dataset updated
    Nov 23, 2020
    Authors
    gaurav9712
    Description

    Perform Principal component analysis and perform clustering using first 3 principal component scores (both heirarchial and k mean clustering(scree plot or elbow curve) and obtain optimum number of clusters and check whether we have obtained same number of clusters with the original data (class column we have ignored at the begining who shows it has 3 clusters)df

  19. F

    Balance Sheet: Total Risk Based Capital (PCA Definition)

    • fred.stlouisfed.org
    json
    Updated Oct 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Balance Sheet: Total Risk Based Capital (PCA Definition) [Dataset]. https://fred.stlouisfed.org/series/QBPBSTRSKK
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Oct 2, 2025
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Description

    Graph and download economic data for Balance Sheet: Total Risk Based Capital (PCA Definition) (QBPBSTRSKK) from Q1 1990 to Q2 2025 about capital and USA.

  20. Augmented ANOVA table for Individual-Centered PCA of SNP data on oats, using...

    • plos.figshare.com
    xls
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen (2023). Augmented ANOVA table for Individual-Centered PCA of SNP data on oats, using SNP coding rare = 1. [Dataset]. http://doi.org/10.1371/journal.pone.0218306.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PCA is applied to SNP main effects and S×I interaction effects combined (S&S×I), and the portion of each is shown in the last two columns.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen (2022). Replication Data for: Uncertainty-Aware Principal Component Analysis [Dataset]. http://doi.org/10.18419/DARUS-2321

Replication Data for: Uncertainty-Aware Principal Component Analysis

Related Article
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 7, 2022
Dataset provided by
DaRUS
Authors
Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen
License

https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321

Dataset funded by
DFG
Description

This dataset contains the source code for uncertainty-aware principal component analysis (UA-PCA) and a series of images that show dimensionality reduction plots created with UA-PCA. The software is a JavaScript library for performing principal component analysis and dimensionality reduction on datasets consisting of multivariate probability distributions. Each plot of the image series used UA-PCA to project a dataset consisting of multivariate normal distributions. The covariance matrices of the dataset instances were scaled with different factors resulting in different UA-PCA projections. The projected probability distributions are displayed using isolines of their probability density functions. As the scaling value increases, the projection changes, showing the sensitivity of UA-PCA to changes in variance.

Search
Clear search
Close search
Google apps
Main menu