Mass spectrometry imaging (MSI) experiments result in complex multi-dimensional datasets, which require specialist data analysis tools. Here we have developed massPix - an R package for analysing and interpreting data from MSI of lipids in tissue. MassPix is an open-source tool for the analysis and statistical interpretation of MSI data, and is particularly useful for lipidomics applications. MassPix produces single ion images, performs multivariate statistics and provides putative lipid annotations based on accurate mass matching against generated lipid libraries. Classification of tissue regions with high spectral similarly can be carried out by principal components analysis (PCA) or k-means clustering. Mouse cerebellum was analysed using matrix assisted laser desorption ionisation (MALDI) MSI. The resulting MSI dataset forms the test data for massPix.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The FragPipe computational proteomics platform is gaining widespread popularity among the proteomics research community because of its fast processing speed and user-friendly graphical interface. Although FragPipe produces well-formatted output tables that are ready for analysis, there is still a need for an easy-to-use and user-friendly downstream statistical analysis and visualization tool. FragPipe-Analyst addresses this need by providing an R shiny web server to assist FragPipe users in conducting downstream analyses of the resulting quantitative proteomics data. It supports major quantification workflows, including label-free quantification, tandem mass tags, and data-independent acquisition. FragPipe-Analyst offers a range of useful functionalities, such as various missing value imputation options, data quality control, unsupervised clustering, differential expression (DE) analysis using Limma, and gene ontology and pathway enrichment analysis using Enrichr. To support advanced analysis and customized visualizations, we also developed FragPipeAnalystR, an R package encompassing all FragPipe-Analyst functionalities that is extended to support site-specific analysis of post-translational modifications (PTMs). FragPipe-Analyst and FragPipeAnalystR are both open-source and freely available.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Replication data for the publication: "rSIREM: an R package for MALDI spectral deconvolution" by Del Castillo Pérez et al. The deposited data are SALDI-MSI data of three consectutive thin tissue sections from mouse cerebellum measured at the different mass resolutions at the same instrument (MALDI-MSI: Spectroglyph Injector - Orbitrap Exploris). The paper describes a new R package (rSIREM) to computationally improve the mass resolution of an MSI post-measurement. The developed R package (https://github.com/EdelCastillo/rSirem ) applies a statistical treatment on the concentration of spatial images obtained by separately considering each of the m/z over all the pixels. A representative scalar is associated with each image, obtained by applying a new measure (SIREM) to it, derived from Shannon's entropy. The perturbations of this measure, when considering a sequence of consecutive images, reveal the existence of overlap, if it exists. This information serves as a seed to initialize the EM algorithm in the Gaussian Mixture Model context. The efficiency of the method has been verified using three independent procedures.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The repository contains three mzML and four imzML mass spectrometry datasets,
The mzML data are compiled in a single directory 'mzML' and zipped:
The imzML mass spectrometry imaging data are zipped individually:
All these datasets are publicly available from different repositories; however, If you reuse them, please attribute the original authors!
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
In the field of proteomics, generating biologically relevant results from mass spectrometry (MS) signals remains a challenging task. This is partly due to the fact that the computational strategies for converting MS signals into biologically interpretable data depend heavily on the MS acquisition method. Additionally, the processing and the analysis of these data vary depending on whether the proteomic experiment was performed with or without labeling, and with or without fractionation. Several R packages have been developed for processing and analyzing MS data, but they only incorporate identification and quantification data; none of them takes into account other invaluable information collected during MS runs. To address this limitation, we introduce MCQR, an alternative R package for the in-depth exploration, processing, and analysis of quantitative proteomics data generated from either data-dependent or data-independent acquisition methods. MCQR leverages experimental retention time measurements for quality control, data filtering, and processing. Its modular architecture offers flexibility to accommodate various types of proteomics experiments, including label-free, label-based, fractionated, or those enriched for specific post-translational modifications. Its functions, designed as simple building blocks, are user-friendly, making it easy to test parameters and methods, and to construct customized analysis scenarios. These unique features position MCQR as a comprehensive toolbox, perfectly suited to the specific needs of MS-based proteomics experiments.
Supporting datasets and algorithms (R-based) for the manuscript entitled "Development of a Tool to Determine the Variability of Consensus Mass Spectra", including an R Markdown script to reproduce the manuscript's figures.
Estimating the properties of galaxies, and even where they are, is a challenging process. The Rubin Observatory, a sky survey telescope located in Chile, will once it becomes operational image tens of billions of astronomical objects, the vast majority of which will be galaxies that have never been imaged before. Analyses of these data will require sophisticated methodologies, ones that will allow us to first determine where the galaxies are (i.e., how far away they are), and then conditional on the distance, how massive they are. Given galaxy distance and mass data, we can test theories of how the Universe evolves, by comparing simulated galaxy data with these data.
The Buzzard-V1.0 simulation was used to generate a realistic sample of Rubin Observatory data. In this dataset are measurements for 111,172 galaxies. Developers used these data to benchmark, e.g., methods for estimating galaxy distance. Here, we can assume the distance has been estimated well, and use these data to try to model galaxy mass as a function of brightness and distance.
The dataset contains measures of magnitude and magnitude uncertainty in six astronomical bands (u for ultraviolet, g for green, r for red, i for infrared, and z and y for two additional infrared bands). Magnitude is a logarithmic measure of brightness, with an increase of 5 representing a decrease in brightness by a factor of 100, and with a value of zero being represented (roughly) by how the star Vega appears in the night sky. In addition, there is a redshift measured for each galaxy; it represents by how much light from the galaxy is stretched (by the expansion of Universe) as it travels to us. Thus higher redshifts represent larger distances. The last measurement is log.mass, which is the base-10 logarithm of the galaxy stellar mass in units of solar mass; for instance, log.mass = 10 means that the galaxy has a mass 10 billion times that of the Sun.
As noted above, the idea here is to learn a statistical association between measures of magnitude and distance, and galaxy mass.
One wrinkle here that analysts can exploit is that the data contain standard error estimates for the magnitudes (though not for redshift, for which, in practice, the error would be ).
Schmidt, Malz, Soo, Almosallam, Brescia, Cavuoti, Cohen-Tanugi, Connolly, DeRose, Freeman, Graham, Iyer, Jarvis, Kalmbach, Kovacs, Lee, Longo, Morrison, Newman, Nourbakhsh, Nuss, Pospisil, Tranin, Wechsler, Zhou, Izbicki, (The LSST Dark Energy Science Collaboration). “Evaluation of probabilistic photometric redshift estimation approaches for The Rubin Observatory Legacy Survey of Space and Time (LSST)”. Monthly Notices of the Royal Astronomical Society 499, December 2020, pages 1587–1606. https://doi.org/10.1093/mnras/staa2799
Foto from unsplash
This dataset contains aerosol gravimetric analysis of mass, using 2-stage multi-jet cascade impactors, taken aboard the Ron Brown ship during the ACE-Asia field project. This dataset contains the tab delimited (.acf) data files. Data can also be downloaded in a netCDF format.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the last glacial maximum. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the Holocene. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.
Supporting datasets and algorithms (R-based) for the manuscript entitled "Development of a Tool to Determine the Variability of Consensus Mass Spectra", including an R Markdown script to reproduce the manuscript's figures.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Requisite R files for the Proteo-SAFARI app
Scripts for analysis of DI-qTOF data recorded for studies of refractory dissolved organic matter
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Following files contain the results of LARAPPI/CI (Laser Ablation Remote Atmospheric Pressure Photoionization/Chemical Ionization)-3D-MSI performed employing an ultrahigh resolution QToF (quadrupole-time-of-flight) mass spectrometer Bruker Impact II.2D_Fusarium_graminarum_Bacillus_cereus.d.zip: The .zip file contains raw data from LARAPPI/CI-MSI from the 2D measurement of the sample with Fusarium graminarum and Paenibacillus xylanexedens. The .d files can be accessed using the open-source R package available for free.Fusarium_graminarum_Paenibacillus_xylanexedens.zip: The .zip file contains raw data from LARAPPI/CI-MSI from the 3D measurement of the sample with Fusarium graminarum and Paenibacillus xylanexedens, including a .d file for each of the two steps. The .d files can be accessed using the open-source R package available for free.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Following files contain the results of LARAPPI/CI (Laser Ablation Remote Atmospheric Pressure Photoionization/Chemical Ionization)-3D-MSI performed employing an ultrahigh resolution QToF (quadrupole-time-of-flight) mass spectrometer Bruker Impact II.3D_step0_6_d.zip; 3D_step1_6_d.zip; 3D_step2_6_d.zip: the .zip files contains raw 3D MSI data in .d format including 3 measurents, one from each level of the sample with Fusarium avenaceum and Priestia megaterium. The data can be opened using an open-source R package that is freely available.3D_step0_8_d.zip; 3D_step1_8_d.zip; 3D_step2_8_d.zip: the .zip files contains raw 3D MSI data in .d format including 3 measurents, one from each level of the sample with Fusarium avenaceum and Bacillus licheniformis. The data can be opened using an open-source R package that is freely available.
This dataset contains the concentration of polycyclic aromatic hydrocarbon (PAHs) and biomarkers (hopanes, steranes, alkanes) in marine sediment cores using gas chromatography-mass spectrometry (GC-MS), mass-to-charge ratio (m/z) and monoisotopic intensity data using a Fourier transform ion cyclotron mass spectrometry (FTICR-MS), sediment texture and composition data, short-lived radioisotope (SLRad) data, and species-level benthic foraminiferal assemblages identified from sediment cores collected aboard R/V Justo Sierra cruise JS-0815 in the southern Gulf of Mexico between 2015-07-31 and 2015-08-08. Marine sediment core samples were collected with multiple corers and were sectioned at specific intervals, and freeze-dried. Hydrocarbons were extracted from freeze-dried samples using a dichloromethane/methanol (9:1) mixture of solvents and extracts were analyzed using an Agilent 7890B GC/MS instrument attached to a 5977A mass detector. Benthic infauna data includes average meiofauna and macrofauna taxa abundance per replicate-section; calculations for infauna abundance, diversity, richness and evenness; and the total concentration of polycyclic aromatic hydrocarbon (PAHs) for the core sections. SLRad data and sediment texture and composition data were generated for selected core sub-samples at 2mm sampling intervals for “surficial unit†and 5mm sampling resolution intervals to the base of cores. All data includes the location, date and depth of the sample collection sites.
https://artefacts.ceda.ac.uk/licences/specific_licences/esacci_icesheets_greenland_terms_and_conditions.pdfhttps://artefacts.ceda.ac.uk/licences/specific_licences/esacci_icesheets_greenland_terms_and_conditions.pdf
This dataset provides a Gravimetric Mass Balance (GMB) product for the Greenland Ice Sheet (GIS), generated by DTU Space, based on monthly snapshots of the Earth’s gravity field provided by the Gravity Recovery and Climate Experiment (GRACE) and its follow-on satellite mission (GRACE-FO). The product relies on monthly gravity field solutions (L2) of release 06 generated at the Center for Space Research (University of Texas at Austin) and spans the period from April 2002 through August 2021.
The GMB product covers the full GRACE mission period (April 2002 - June 2017) and is extended by means of GRACE-FO data starting from June 2018, thus including 200 monthly solutions. The mass change estimation is based on inversion method developed at DTU Space.
Two different types of products are available. First, the gridded mass trends product is comprised of ice mass change trends for cells of equal area with 50 km resolution covering the whole GIS. Second, the mass change time series product provides time series of integrated mass changes for 8 drainage basins and the entire GIS.
Reference: Barletta, V. R., Sørensen, L. S., and Forsberg, R. (2013) 'Scatter of mass changes estimates at basin scale for Greenland and Antarctica', The Cryosphere, 7, 1411-1432, doi:10.5194/tc-7-1411-2013.",
This dataset contains mass-to-charge ratio (m/z) and monoisotopic intensity peaks detected in the northern and southern Gulf of Mexico dissolved organic matter extracts using a Fourier transform ion cyclotron mass spectrometry (FTICR-MS). The instrument used was manufactured by Bruker Daltonics; Model: Solarix, 12T. The water samples were collected using a CTD-Niskin rosette aboard the R/V Justo Sierra cruise JS-0815 in the southern Gulf of Mexico on 2015-08-03 and aboard the R/V Weatherbird II cruise WB-0815 in the northern Gulf of Mexico on 2015-08-18. The R/V Justo Sierra cruise was led by chief scientist Dr. Steve Murawski and the R/V Weatherbird II cruise was led by chief scientist Dr. Dave Hollander. Mass-to-charge ratio (m/z) and monoisotopic intensity from a sediment core collected aboard the R/V Justo Sierra cruise JS-0815 in the southern Gulf of Mexico is available under GRIIDC UDI R4.x267.179:0006.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Datasets and R scripts from González-Suárez, M; Gonzalez-Voyer, A; von Hardenberg, A; Santini, L (2021) The role of brain size on mammalian population densities Journal of Animal Ecology, 90: 653– 661. DOI: 10.1111/1365-2656.13397Additional details in the README.pdf file
SUMMARY OF FILES INCLUDED
·
12 csv datasets from other sources (described
below) with brain and body mass data in the zip file Brain and Mass data.
·
Six csv datasets from other sources and compilations
(described below) with population density and diet information
·
Two csv files (Complete_dataset_published.csv, Brain_data_compilation_published.csv) produced during this study. Details of the files and the compilation protocol are provided in the README file.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: (Table 2) Accelerator mass spectrometry (AMS) radiocarbon dates and calibrated ages. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.819014 for more information.
Mass spectrometry imaging (MSI) experiments result in complex multi-dimensional datasets, which require specialist data analysis tools. Here we have developed massPix - an R package for analysing and interpreting data from MSI of lipids in tissue. MassPix is an open-source tool for the analysis and statistical interpretation of MSI data, and is particularly useful for lipidomics applications. MassPix produces single ion images, performs multivariate statistics and provides putative lipid annotations based on accurate mass matching against generated lipid libraries. Classification of tissue regions with high spectral similarly can be carried out by principal components analysis (PCA) or k-means clustering. Mouse cerebellum was analysed using matrix assisted laser desorption ionisation (MALDI) MSI. The resulting MSI dataset forms the test data for massPix.