100+ datasets found

D
Replication Data for: Uncertainty-Aware Principal Component Analysis
darus.uni-stuttgart.de
Updated Dec 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen (2022). Replication Data for: Uncertainty-Aware Principal Component Analysis [Dataset]. http://doi.org/10.18419/DARUS-2321
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.18419/DARUS-2321
Dataset updated
Dec 7, 2022
Dataset provided by
DaRUS
Authors
Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen
License
https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321
Dataset funded by
DFG
Description
This dataset contains the source code for uncertainty-aware principal component analysis (UA-PCA) and a series of images that show dimensionality reduction plots created with UA-PCA. The software is a JavaScript library for performing principal component analysis and dimensionality reduction on datasets consisting of multivariate probability distributions. Each plot of the image series used UA-PCA to project a dataset consisting of multivariate normal distributions. The covariance matrices of the dataset instances were scaled with different factors resulting in different UA-PCA projections. The projected probability distributions are displayed using isolines of their probability density functions. As the scaling value increases, the projection changes, showing the sensitivity of UA-PCA to changes in variance.
f
Data from: Visualization of Molecular Fingerprints
acs.figshare.com
zip
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John R. Owen; Ian T. Nabney; José L. Medina-Franco; Fabian López-Vallejo (2023). Visualization of Molecular Fingerprints [Dataset]. http://doi.org/10.1021/ci1004042.s002
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.1021/ci1004042.s002
Dataset updated
May 30, 2023
Dataset provided by
ACS Publications
Authors
John R. Owen; Ian T. Nabney; José L. Medina-Franco; Fabian López-Vallejo
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
A visualization plot of a data set of molecular data is a useful tool for gaining insight into a set of molecules. In chemoinformatics, most visualization plots are of molecular descriptors, and the statistical model most often used to produce a visualization is principal component analysis (PCA). This paper takes PCA, together with four other statistical models (NeuroScale, GTM, LTM, and LTM-LIN), and evaluates their ability to produce clustering in visualizations not of molecular descriptors but of molecular fingerprints. Two different tasks are addressed: understanding structural information (particularly combinatorial libraries) and relating structure to activity. The quality of the visualizations is compared both subjectively (by visual inspection) and objectively (with global distance comparisons and local k-nearest-neighbor predictors). On the data sets used to evaluate clustering by structure, LTM is found to perform significantly better than the other models. In particular, the clusters in LTM visualization space are consistent with the relationships between the core scaffolds that define the combinatorial sublibraries. On the data sets used to evaluate clustering by activity, LTM again gives the best performance but by a smaller margin. The results of this paper demonstrate the value of using both a nonlinear projection map and a Bernoulli noise model for modeling binary data.
f
A two-dimensional PCA plot obtained from a multiple factor analysis (MFA)...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Jul 27, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Balaji, Thiruppathi; Vaishnav, Keshav; Sarma, Devojit Kumar; Kumar, Mayandi Senthil; Paramasivan, Rajaiah; Mohanty, Suman S.; Barik, Tapan Kumar; Srivastava, Hemlata; Kalimuthu, Mariapillai; Kumar, Narendran Pradeep; Tyagi, Suchi; Suman, Devi Shankar; Kumar, Ashwani; Sumitha, Melveettil Kishor; Kumar, Devendra; Sunish, Ittoop Pulikkottil; Patil, Prabhakargouda B.; Bhowmick, Ipsita Pal; Singh, Om P.; Uragayala, Sreehari; Gupta, Bhavna (2023). A two-dimensional PCA plot obtained from a multiple factor analysis (MFA) performed on all 22 populations using 142 bioclimatic variables retrieved from the WorldClim database. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000977988
Explore at:
Dataset updated
Jul 27, 2023
Authors
Balaji, Thiruppathi; Vaishnav, Keshav; Sarma, Devojit Kumar; Kumar, Mayandi Senthil; Paramasivan, Rajaiah; Mohanty, Suman S.; Barik, Tapan Kumar; Srivastava, Hemlata; Kalimuthu, Mariapillai; Kumar, Narendran Pradeep; Tyagi, Suchi; Suman, Devi Shankar; Kumar, Ashwani; Sumitha, Melveettil Kishor; Kumar, Devendra; Sunish, Ittoop Pulikkottil; Patil, Prabhakargouda B.; Bhowmick, Ipsita Pal; Singh, Om P.; Uragayala, Sreehari; Gupta, Bhavna
Description
A two-dimensional PCA plot obtained from a multiple factor analysis (MFA) performed on all 22 populations using 142 bioclimatic variables retrieved from the WorldClim database.
Consequences of PCA graphs, SNP codings, and PCA variants for elucidating...
plos.figshare.com
xlsx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen (2023). Consequences of PCA graphs, SNP codings, and PCA variants for elucidating population structure [Dataset]. http://doi.org/10.1371/journal.pone.0218306
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0218306
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
SNP datasets are high-dimensional, often with thousands to millions of SNPs and hundreds to thousands of samples or individuals. Accordingly, PCA graphs are frequently used to provide a low-dimensional visualization in order to display and discover patterns in SNP data from humans, animals, plants, and microbes—especially to elucidate population structure. PCA is not a single method that is always done the same way, but rather requires three choices which we explore as a three-way factorial: two kinds of PCA graphs by three SNP codings by six PCA variants. Our main three recommendations are simple and easily implemented: Use PCA biplots, SNP coding 1 for the rare allele and 0 for the common allele, and double-centered PCA (or AMMI1 if main effects are also of interest). We also document contemporary practices by a literature survey of 125 representative articles that apply PCA to SNP data, find that virtually none implement our recommendations. The ultimate benefit from informed and optimal choices of PCA graph, SNP coding, and PCA variant, is expected to be discovery of more biology, and thereby acceleration of medical, agricultural, and other vital applications.
The X-loading values for the first principal component of the PCA score...
figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Erwin van Vliet; Elisenda Eixarch; Miriam Illa; Ariadna Arbat-Plana; Anna González-Tendero; Helena T. Hogberg; Liang Zhao; Thomas Hartung; Eduard Gratacos (2023). The X-loading values for the first principal component of the PCA score plot. [Dataset]. http://doi.org/10.1371/journal.pone.0064545.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0064545.t003
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Erwin van Vliet; Elisenda Eixarch; Miriam Illa; Ariadna Arbat-Plana; Anna González-Tendero; Helena T. Hogberg; Liang Zhao; Thomas Hartung; Eduard Gratacos
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
X-loading values show positive and negative correlations responsible for the cluster formation along the first principal component (PC1) in the PCA score plot.
DESeq2 DGEAnalysisGSE44076 QC,PCA,Heatmaps,Volcano
kaggle.com
zip
Updated Nov 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr. Nagendra (2025). DESeq2 DGEAnalysisGSE44076 QC,PCA,Heatmaps,Volcano [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/deseq2-dgeanalysisgse44076-qcpcaheatmapsvolcano
Explore at:
zip(1796357 bytes)Available download formats
Dataset updated
Nov 29, 2025
Authors
Dr. Nagendra
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset contains a Differential Gene Expression (DGE) analysis of GSE44076.

The analysis compares tumor versus normal samples.

It uses the DESeq2 package for RNA-seq count data analysis.

The dataset includes quality control (QC) visualizations.

Principal Component Analysis (PCA) plots are provided for sample clustering.

Heatmaps illustrate the expression patterns of top differentially expressed genes.

EnhancedVolcano plots are included to visualize significant genes.

The dataset enables users to explore gene expression changes in colorectal cancer.

All R scripts and associated visualizations are included for reproducibility.

The workflow can be adapted for other RNA-seq datasets.

The dataset supports bioinformatics, transcriptomics, and cancer research studies.

It provides an educational resource for DESeq2-based RNA-seq analysis.
Principal Component Analysis (PCA) table of the full dataset on N = 23...
plos.figshare.com
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deanna Gigliotti; Jeff R. S. Leiter; Peter B. MacDonald; Jason Peeler; Judy E. Anderson (2023). Principal Component Analysis (PCA) table of the full dataset on N = 23 variables, showing correlation coefficients for variables loaded on the 3 PCs extracted from the analysis of data from 27 participants. [Dataset]. http://doi.org/10.1371/journal.pone.0162494.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0162494.t001
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Deanna Gigliotti; Jeff R. S. Leiter; Peter B. MacDonald; Jason Peeler; Judy E. Anderson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the row for each variable, numbers indicate the strength of correlation of that variable with the eigenvector of each PC. When the absolute value of correlation coefficients was ≥ 0.3, they were considered important (bold font) in defining the PC. Variables loaded on PCs 1–3 below, appear in the PCA plot (Fig 4A) with an asterisk (*) to indicate they also project upward (since PC3 is perpendicular to axes for PCs 1 and 2).
d
coastDat-2 Hydrodynamic Model TRIM-NP Principal Component Analysis Residual...
demo-b2find.dkrz.de
Updated Sep 20, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). coastDat-2 Hydrodynamic Model TRIM-NP Principal Component Analysis Residual Currents - Dataset - B2FIND [Dataset]. http://demo-b2find.dkrz.de/dataset/09fd34c9-4d32-5bcb-a0e3-f3901ac83ae8
Explore at:
Dataset updated
Sep 20, 2025
Description
Simulated 2D residual velocity fields in the inner German Bight were subjected to Principal Component Analysis (PCA). Residual currents were obtained from coastDat2 barotropic 2D simulations with the hydrodynamic model TRIM-NP V2.1.22 in barotropic 2D mode on a Cartesian grid (1.6km spatial resolution) stored on an hourly basis for the years 1948 - 2012 (doi:10.1594/WDCC/coastDat-2_TRIM-NP-2d) and later extended until August 2015. The present analysis refers to the period Jan 1958 - Aug 2015. The spatial domain considered is the region to the east of 6 degrees east and to the south of 55.6 degrees north. All grid nodes with a bathymetry of less than 10m were excluded. Residual velocities were calculated in two different ways: 1.) as 25h means, 2.) as monthly means. Both types of residual current data are available from * RESIDUAL_CURRENTS_195801_201508 The directory contains sub-directories for years and months. Daily residual currents for the 13th of September 1974, for instance, are stored in * RESIDUAL_CURRENTS_195801_201508/YEAR_1974/MONTH_09/TRIM2D_1974_09_13_means.nc while monthly mean residual currents for September 1974 are stored in: * RESIDUAL_CURRENTS_195801_201508/YEAR_1974/TRIM2D_1974_09_means.nc All current fields provided were interpolated from the original Cartesian model grid to a more convenient regular geographical grid (116x76 nodes). Mean residual currents are stored in: * mean_residual_currents.nc This data set contains residual velocities both on original Cartesian grid nodes and interpolated to the geographical grid. An example plot is provided: * mean_residual_currents.png For PCA, two residual velocity components from each of 12133 Cartesian grid nodes were combined into one data vector (length 2x12133), referring to 21061 daily or 692 monthly time levels. Results of two independent PCAs for either daily or monthly mean fields are stored in: * PCA_daily_residual_currents.nc * PCA_monthly_residual_currents.nc Files contain three leading Principal Components (PCs) and corresponding Emipirical Orthogonal Functions (EOFs). Again EOFs were also interpolated to a regular geographical grid. PC time series are also stored in plain ASCII format: * PCs_daily.txt * PCs_monthly.txt For monthly fields the number N of variables (N=2x12133) is much larger than the number T of time levels (T=692). Therefore, to reduce computational demands, the roles of time and space were formally interchanged. Having conducted the PCA the EOFs were then transformed back to the original spatial coordinates (cf. Section 12.2.6 in von Storch and Zwiers (1999), Statistical Analysis in Climate Research, Cambridge University Press). A much larger number of time levels made even this approach prohibitive for the full set of daily data. Therefore, PCAs were performed for six sub-periods (1958-1965, 1966-1975, 1976-1985, 1986-1995, 1996-2005, 2006-2015(Aug)) independently. EOFs obtained from these six sub-periods were then averaged to obtain EOFs representative for the whole period. Corresponding PCs were calculated by projecting daily fields onto these average EOFs. IMPORTANT: In contrast with PCA of monthly data, the PCA of daily data INVOLVES SOME APPROXIMATIONS! EOFs on the original nodes were normalized to have unit lengths. The following figures, * daily_EOF1.png * daily_EOF2.png * daily_EOF3.png show the first three EOFs obtained from daily data, assuming that corresponding PCs have the value of one standard deviation. The following two plots, * monthly_EOF1.png * monthly_EOF2.png show the leading EOFs for monthly mean data. EOF3 is omitted as it represents just a very small percentage of overall variance (1.7%).
d
The principal components of electoral regimes
datadryad.org
zip
Updated Aug 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karoline Wiesner; Samuel Bien; Matthew Wilson (2024). The principal components of electoral regimes [Dataset]. http://doi.org/10.5061/dryad.np5hqc030
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.np5hqc030
Dataset updated
Aug 2, 2024
Dataset provided by
Dryad
Authors
Karoline Wiesner; Samuel Bien; Matthew Wilson
Time period covered
Jul 12, 2024
Description
We use version 12 (2022) of the V-Dem data (https://www.V-Dem.net) and apply standard principal component analysis (PCA). Following standard procedure, we normalized each V-Dem variable (i.e. centered it to a mean of zero and rescaled it to a variance of one) prior to performing PCA. For better readability of the plots, we rescaled all principal components uniformly such that the first component has a maximum absolute value of one (i.e. its values are bounded by [-1,1]) while preserving the mean of zero for all components. We further re-oriented each component such that its strongest loading is positive.
T
United States - Balance Sheet: Total Risk Based Capital (PCA Definition)
tradingeconomics.com
csv, excel, json, xml
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United States - Balance Sheet: Total Risk Based Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-total-risk-based-capital-pca-definition-fed-data.html
Explore at:
json, csv, xml, excelAvailable download formats
Dataset updated
May 15, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
United States
Description
United States - Balance Sheet: Total Risk Based Capital (PCA Definition) was 2364706.75300 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Total Risk Based Capital (PCA Definition) reached a record high of 2364706.75300 in April of 2025 and a record low of 322350.26900 in January of 1990. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Total Risk Based Capital (PCA Definition) - last updated from the United States Federal Reserve on November of 2025.
f
Heatmap of pairwise correlations and principal component analysis (PCA) of...
datasetcatalog.nlm.nih.gov
plos.figshare.com
+1more
Updated Aug 5, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sawyer, Jason E.; Spencer, Thomas E.; Bauersachs, Stefan; Madsen, Crystal A.; Geary, Thomas W.; Minten, Megan A.; Bilby, Todd R.; Tibary, Ahmed; Neibergs, Holly L.; Bruno, Ralph G. S.; Allen, Carolyn C.; Wang, Zeping (2013). Heatmap of pairwise correlations and principal component analysis (PCA) of microarray data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001687053
Explore at:
Dataset updated
Aug 5, 2013
Authors
Sawyer, Jason E.; Spencer, Thomas E.; Bauersachs, Stefan; Madsen, Crystal A.; Geary, Thomas W.; Minten, Megan A.; Bilby, Todd R.; Tibary, Ahmed; Neibergs, Holly L.; Bruno, Ralph G. S.; Allen, Carolyn C.; Wang, Zeping
Description
(A) Microarray data were filtered for detectable probes and normalized with the BioConductor package vsn. Normalized data were used for calculation of pairwise distances and drawing of a heatmap by use of the BioConductor package geneplotter. Each column represents one sample and shows the correlation to all samples (including itself), with red for correlation = 1 and blue for the lowest observed correlation. Note the clear homogeneity in the samples from fertility classified heifers (HF, high fertile; SF, subfertile; IF, infertile). (B) PCA is a plot distribution indicating the source of greatest variation in the overall transcriptional profiles of the samples. Each symbol represents one replicate. Note the clear lack of separation of samples based on fertility classifications (HF, high fertile; SF, subfertile; IF, infertile).
T
United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition)
tradingeconomics.com
csv, excel, json, xml
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-tier-1-risk-based-capital-pca-definition-fed-data.html
Explore at:
json, xml, excel, csvAvailable download formats
Dataset updated
May 15, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
United States
Description
United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) was 2271005.21400 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) reached a record high of 2271005.21400 in April of 2025 and a record low of 147414.03800 in January of 1984. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition) - last updated from the United States Federal Reserve on October of 2025.
T
United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition)
tradingeconomics.com
csv, excel, json, xml
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) [Dataset]. https://tradingeconomics.com/united-states/balance-sheet-tier-1-leverage-capital-pca-definition-fed-data.html
Explore at:
csv, excel, json, xmlAvailable download formats
Dataset updated
May 15, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
United States
Description
United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) was 2271005.21400 Mil. of U.S. $ in April of 2025, according to the United States Federal Reserve. Historically, United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) reached a record high of 2271005.21400 in April of 2025 and a record low of 147414.03800 in January of 1984. Trading Economics provides the current actual value, an historical data chart and related indicators for United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition) - last updated from the United States Federal Reserve on November of 2025.
q
Using PCA to study small mammal biomechanics
qubeshub.org
Updated May 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abagael West; Stephanie Smith (2025). Using PCA to study small mammal biomechanics [Dataset]. http://doi.org/10.25334/6KCX-M612
Explore at:
Unique identifier
https://doi.org/10.25334/6KCX-M612
Dataset updated
May 22, 2025
Dataset provided by
QUBES
Authors
Abagael West; Stephanie Smith
Description
In this lesson, students interpret a scatter plot showing the results of a principal components analysis (PCA). They view an interview with Dr. Stephanie Smith, who explains how PCA calculations work, and why she chose to use this analysis to visualize her data. Dr. Smith also discusses her journey becoming a scientist and describes a typical day at work.
A Network View on Psychiatric Disorders: Network Clusters of Symptoms as...
plos.figshare.com
figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rutger Goekoop; Jaap G. Goekoop (2023). A Network View on Psychiatric Disorders: Network Clusters of Symptoms as Elementary Syndromes of Psychopathology [Dataset]. http://doi.org/10.1371/journal.pone.0112734
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0112734
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Rutger Goekoop; Jaap G. Goekoop
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IntroductionThe vast number of psychopathological syndromes that can be observed in clinical practice can be described in terms of a limited number of elementary syndromes that are differentially expressed. Previous attempts to identify elementary syndromes have shown limitations that have slowed progress in the taxonomy of psychiatric disorders.AimTo examine the ability of network community detection (NCD) to identify elementary syndromes of psychopathology and move beyond the limitations of current classification methods in psychiatry.Methods192 patients with unselected mental disorders were tested on the Comprehensive Psychopathological Rating Scale (CPRS). Principal component analysis (PCA) was performed on the bootstrapped correlation matrix of symptom scores to extract the principal component structure (PCS). An undirected and weighted network graph was constructed from the same matrix. Network community structure (NCS) was optimized using a previously published technique.ResultsIn the optimal network structure, network clusters showed a 89% match with principal components of psychopathology. Some 6 network clusters were found, including "DEPRESSION", "MANIA", “ANXIETY”, "PSYCHOSIS", "RETARDATION", and "BEHAVIORAL DISORGANIZATION". Network metrics were used to quantify the continuities between the elementary syndromes.ConclusionWe present the first comprehensive network graph of psychopathology that is free from the biases of previous classifications: a ‘Psychopathology Web’. Clusters within this network represent elementary syndromes that are connected via a limited number of bridge symptoms. Many problems of previous classifications can be overcome by using a network approach to psychopathology.
Wine Dataset for PCA
kaggle.com
zip
Updated Feb 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DEV AHUJA (2024). Wine Dataset for PCA [Dataset]. https://www.kaggle.com/datasets/devahuja2808/wine-dataset-for-pca
Explore at:
zip(4522 bytes)Available download formats
Dataset updated
Feb 23, 2024
Authors
DEV AHUJA
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by DEV AHUJA

Released under Apache 2.0

Contents
f
Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC...
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Jun 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Spencer, Paulette; Maksumova, Zoya; Raj, Srilakshmi M.; Isshiki, Mariko; Colón, Mirtha; Isasi, Carmen R.; Klugman, Susan D.; Suglia, Shakira; Griffen, Anthony J.; Meissner, Paul; Cabana, Michael D.; Greally, John M. (2025). Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC participants and reference panels. (B) PCA plots highlighting individuals belonging to the 14 IBD clusters detected in AoU NYC participants. (C) SCOPE analysis for AoU NYC participants labelled with the 14 IBD clusters. Each color represents global ancestry proportion of the five superpopulations (African, European, American, East Asian and South Asian) inferred using supervised mode in SCOPE. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002064867
Explore at:
Dataset updated
Jun 24, 2025
Authors
Spencer, Paulette; Maksumova, Zoya; Raj, Srilakshmi M.; Isshiki, Mariko; Colón, Mirtha; Isasi, Carmen R.; Klugman, Susan D.; Suglia, Shakira; Griffen, Anthony J.; Meissner, Paul; Cabana, Michael D.; Greally, John M.
Area covered
New York
Description
Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC participants and reference panels. (B) PCA plots highlighting individuals belonging to the 14 IBD clusters detected in AoU NYC participants. (C) SCOPE analysis for AoU NYC participants labelled with the 14 IBD clusters. Each color represents global ancestry proportion of the five superpopulations (African, European, American, East Asian and South Asian) inferred using supervised mode in SCOPE.
Wine12
kaggle.com
zip
Updated Nov 23, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
gaurav9712 (2020). Wine12 [Dataset]. https://www.kaggle.com/gaurav9712/wine12
Explore at:
zip(4491 bytes)Available download formats
Dataset updated
Nov 23, 2020
Authors
gaurav9712
Description
Perform Principal component analysis and perform clustering using first 3 principal component scores (both heirarchial and k mean clustering(scree plot or elbow curve) and obtain optimum number of clusters and check whether we have obtained same number of clusters with the original data (class column we have ignored at the begining who shows it has 3 clusters)df
F
Balance Sheet: Total Risk Based Capital (PCA Definition)
fred.stlouisfed.org
json
Updated Oct 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Balance Sheet: Total Risk Based Capital (PCA Definition) [Dataset]. https://fred.stlouisfed.org/series/QBPBSTRSKK
Explore at:
jsonAvailable download formats
Dataset updated
Oct 2, 2025
License
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Description
Graph and download economic data for Balance Sheet: Total Risk Based Capital (PCA Definition) (QBPBSTRSKK) from Q1 1990 to Q2 2025 about capital and USA.
Augmented ANOVA table for Individual-Centered PCA of SNP data on oats, using...
plos.figshare.com
xls
Updated Jun 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen (2023). Augmented ANOVA table for Individual-Centered PCA of SNP data on oats, using SNP coding rare = 1. [Dataset]. http://doi.org/10.1371/journal.pone.0218306.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0218306.t003
Dataset updated
Jun 5, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Hugh G. Gauch Jr.; Sheng Qian; Hans-Peter Piepho; Linda Zhou; Rui Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
PCA is applied to SNP main effects and S×I interaction effects combined (S&S×I), and the portion of each is shown in the last two columns.

Facebook

Twitter

Click to copy link

Link copied

Cite

Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen (2022). Replication Data for: Uncertainty-Aware Principal Component Analysis [Dataset]. http://doi.org/10.18419/DARUS-2321

Replication Data for: Uncertainty-Aware Principal Component Analysis

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Unique identifier

https://doi.org/10.18419/DARUS-2321

Dataset updated

Dec 7, 2022

Dataset provided by

DaRUS

Authors

Jochen Görtler; Thilo Spinner; Daniel Weiskopf; Oliver Deussen

License

https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321https://darus.uni-stuttgart.de/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.18419/DARUS-2321

Dataset funded by

DFG

Description

This dataset contains the source code for uncertainty-aware principal component analysis (UA-PCA) and a series of images that show dimensionality reduction plots created with UA-PCA. The software is a JavaScript library for performing principal component analysis and dimensionality reduction on datasets consisting of multivariate probability distributions. Each plot of the image series used UA-PCA to project a dataset consisting of multivariate normal distributions. The covariance matrices of the dataset instances were scaled with different factors resulting in different UA-PCA projections. The projected probability distributions are displayed using isolines of their probability density functions. As the scaling value increases, the projection changes, showing the sensitivity of UA-PCA to changes in variance.

Clear search

Close search

Google apps

Main menu

Replication Data for: Uncertainty-Aware Principal Component Analysis

Data from: Visualization of Molecular Fingerprints

A two-dimensional PCA plot obtained from a multiple factor analysis (MFA)...

Consequences of PCA graphs, SNP codings, and PCA variants for elucidating...

The X-loading values for the first principal component of the PCA score...

DESeq2 DGEAnalysisGSE44076 QC,PCA,Heatmaps,Volcano

Principal Component Analysis (PCA) table of the full dataset on N = 23...

coastDat-2 Hydrodynamic Model TRIM-NP Principal Component Analysis Residual...

The principal components of electoral regimes

United States - Balance Sheet: Total Risk Based Capital (PCA Definition)

Heatmap of pairwise correlations and principal component analysis (PCA) of...

United States - Balance Sheet: Tier 1 Risk Based Capital (PCA Definition)

United States - Balance Sheet: Tier 1 Leverage Capital (PCA Definition)

Using PCA to study small mammal biomechanics

A Network View on Psychiatric Disorders: Network Clusters of Symptoms as...

Wine Dataset for PCA

Dataset

Contents

Ancestry background of AoU IBD clusters. (A) PCA plot for all AoU NYC...

Wine12

Balance Sheet: Total Risk Based Capital (PCA Definition)

Augmented ANOVA table for Individual-Centered PCA of SNP data on oats, using...

Replication Data for: Uncertainty-Aware Principal Component Analysis