Mass spectrometry imaging (MSI) experiments result in complex multi-dimensional datasets, which require specialist data analysis tools. Here we have developed massPix - an R package for analysing and interpreting data from MSI of lipids in tissue. MassPix is an open-source tool for the analysis and statistical interpretation of MSI data, and is particularly useful for lipidomics applications. MassPix produces single ion images, performs multivariate statistics and provides putative lipid annotations based on accurate mass matching against generated lipid libraries. Classification of tissue regions with high spectral similarly can be carried out by principal components analysis (PCA) or k-means clustering. Mouse cerebellum was analysed using matrix assisted laser desorption ionisation (MALDI) MSI. The resulting MSI dataset forms the test data for massPix.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The FragPipe computational proteomics platform is gaining widespread popularity among the proteomics research community because of its fast processing speed and user-friendly graphical interface. Although FragPipe produces well-formatted output tables that are ready for analysis, there is still a need for an easy-to-use and user-friendly downstream statistical analysis and visualization tool. FragPipe-Analyst addresses this need by providing an R shiny web server to assist FragPipe users in conducting downstream analyses of the resulting quantitative proteomics data. It supports major quantification workflows, including label-free quantification, tandem mass tags, and data-independent acquisition. FragPipe-Analyst offers a range of useful functionalities, such as various missing value imputation options, data quality control, unsupervised clustering, differential expression (DE) analysis using Limma, and gene ontology and pathway enrichment analysis using Enrichr. To support advanced analysis and customized visualizations, we also developed FragPipeAnalystR, an R package encompassing all FragPipe-Analyst functionalities that is extended to support site-specific analysis of post-translational modifications (PTMs). FragPipe-Analyst and FragPipeAnalystR are both open-source and freely available.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Replication data for the publication: "rSIREM: an R package for MALDI spectral deconvolution" by Del Castillo Pérez et al. The deposited data are SALDI-MSI data of three consectutive thin tissue sections from mouse cerebellum measured at the different mass resolutions at the same instrument (MALDI-MSI: Spectroglyph Injector - Orbitrap Exploris). The paper describes a new R package (rSIREM) to computationally improve the mass resolution of an MSI post-measurement. The developed R package (https://github.com/EdelCastillo/rSirem ) applies a statistical treatment on the concentration of spatial images obtained by separately considering each of the m/z over all the pixels. A representative scalar is associated with each image, obtained by applying a new measure (SIREM) to it, derived from Shannon's entropy. The perturbations of this measure, when considering a sequence of consecutive images, reveal the existence of overlap, if it exists. This information serves as a seed to initialize the EM algorithm in the Gaussian Mixture Model context. The efficiency of the method has been verified using three independent procedures.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
With recent advances in analytical chemistry, liquid chromatography high-resolution tandem mass spectrometry (LC-HRMS/MS) has become an essential tool for metabolite discovery and detection. Even if most of the common drug transformations have already been extensively described, manual search of drug metabolites in LC-HRMS/MS datasets is still a common practice in toxicology laboratories, complicating metabolite discovery. Furthermore, the availability of free open-source software for metabolite discovery is still limited. In this article, we present MetIDfyR, an open-source and cross-platform R package for in silico drug phase I/II biotransformation prediction and mass-spectrometric data mining. MetIDfyR has proven its efficacy for advanced metabolite identification in semi-complex and complex mixtures in in vitro or in vivo drug studies and is freely available at github.com/agnesblch/MetIDfyR.
Supporting datasets and algorithms (R-based) for the manuscript entitled "Development of a Tool to Determine the Variability of Consensus Mass Spectra", including an R Markdown script to reproduce the manuscript's figures.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The repository contains three mzML and four imzML mass spectrometry datasets,
The mzML data are compiled in a single directory 'mzML' and zipped:
The imzML mass spectrometry imaging data are zipped individually:
All these datasets are publicly available from different repositories; however, If you reuse them, please attribute the original authors!
Estimating the properties of galaxies, and even where they are, is a challenging process. The Rubin Observatory, a sky survey telescope located in Chile, will once it becomes operational image tens of billions of astronomical objects, the vast majority of which will be galaxies that have never been imaged before. Analyses of these data will require sophisticated methodologies, ones that will allow us to first determine where the galaxies are (i.e., how far away they are), and then conditional on the distance, how massive they are. Given galaxy distance and mass data, we can test theories of how the Universe evolves, by comparing simulated galaxy data with these data.
The Buzzard-V1.0 simulation was used to generate a realistic sample of Rubin Observatory data. In this dataset are measurements for 111,172 galaxies. Developers used these data to benchmark, e.g., methods for estimating galaxy distance. Here, we can assume the distance has been estimated well, and use these data to try to model galaxy mass as a function of brightness and distance.
The dataset contains measures of magnitude and magnitude uncertainty in six astronomical bands (u for ultraviolet, g for green, r for red, i for infrared, and z and y for two additional infrared bands). Magnitude is a logarithmic measure of brightness, with an increase of 5 representing a decrease in brightness by a factor of 100, and with a value of zero being represented (roughly) by how the star Vega appears in the night sky. In addition, there is a redshift measured for each galaxy; it represents by how much light from the galaxy is stretched (by the expansion of Universe) as it travels to us. Thus higher redshifts represent larger distances. The last measurement is log.mass, which is the base-10 logarithm of the galaxy stellar mass in units of solar mass; for instance, log.mass = 10 means that the galaxy has a mass 10 billion times that of the Sun.
As noted above, the idea here is to learn a statistical association between measures of magnitude and distance, and galaxy mass.
One wrinkle here that analysts can exploit is that the data contain standard error estimates for the magnitudes (though not for redshift, for which, in practice, the error would be ).
Schmidt, Malz, Soo, Almosallam, Brescia, Cavuoti, Cohen-Tanugi, Connolly, DeRose, Freeman, Graham, Iyer, Jarvis, Kalmbach, Kovacs, Lee, Longo, Morrison, Newman, Nourbakhsh, Nuss, Pospisil, Tranin, Wechsler, Zhou, Izbicki, (The LSST Dark Energy Science Collaboration). “Evaluation of probabilistic photometric redshift estimation approaches for The Rubin Observatory Legacy Survey of Space and Time (LSST)”. Monthly Notices of the Royal Astronomical Society 499, December 2020, pages 1587–1606. https://doi.org/10.1093/mnras/staa2799
Foto from unsplash
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
In the field of proteomics, generating biologically relevant results from mass spectrometry (MS) signals remains a challenging task. This is partly due to the fact that the computational strategies for converting MS signals into biologically interpretable data depend heavily on the MS acquisition method. Additionally, the processing and the analysis of these data vary depending on whether the proteomic experiment was performed with or without labeling, and with or without fractionation. Several R packages have been developed for processing and analyzing MS data, but they only incorporate identification and quantification data; none of them takes into account other invaluable information collected during MS runs. To address this limitation, we introduce MCQR, an alternative R package for the in-depth exploration, processing, and analysis of quantitative proteomics data generated from either data-dependent or data-independent acquisition methods. MCQR leverages experimental retention time measurements for quality control, data filtering, and processing. Its modular architecture offers flexibility to accommodate various types of proteomics experiments, including label-free, label-based, fractionated, or those enriched for specific post-translational modifications. Its functions, designed as simple building blocks, are user-friendly, making it easy to test parameters and methods, and to construct customized analysis scenarios. These unique features position MCQR as a comprehensive toolbox, perfectly suited to the specific needs of MS-based proteomics experiments.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the Holocene. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.
This dataset contains aerosol gravimetric analysis of mass, using 2-stage multi-jet cascade impactors, taken aboard the Ron Brown ship during the ACE-Asia field project. This dataset contains the tab delimited (.acf) data files. Data can also be downloaded in a netCDF format.
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the last glacial maximum. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.
Our aim is to identify the population of very low-mass object (VLMO) candidates in the R CrA region and reveal their formation dependence in the local environments. We performed a deep near-infrared (NIR) photometric observation of the R CrA region by UKIRT/WFCAM. Class I and II candidates showing NIR excess were selected from their observed colors. We derived the photometric mass of each candidate with an age assumption of 1Myr. We compared the derived mass of identified VLMO candidates to the dust column density at their position. The 10{sigma} limiting magnitudes were 20.7, 19.6, and 19.2mag in the J-, H-, and K-band, respectively, and we detected 2922 JHK sources in all three bands with an S/N greater than ten in the K-band. Fifteen Class I and 207 Class II candidates with NIR excess were selected from a [J-H]/[H-K] color-color diagram. Six low-mass stars, five brown dwarfs, and 196 planetary-mass object candidates were identified from the J-band luminosity of Class II candidates with the age assumption of 1 Myr using the evolutionary models. The derived initial mass function (IMF) does not appear to decrease in the brown dwarf and planetary-mass regime, even when taking into account the background star and galactic contamination. From comparison between the spatial distributions of Class I and II candidates and dust column densities derived from the Herschel observation, we found that all the low-mass star and brown dwarf candidates are located in the region where the dust column densities are higher than 2.5x10^21^cm^-2^, while planetary-mass object candidates are independent of their local dust densities. Our results suggest that the formations of low-mass stars and very low-mass objects may be dependent on the local cloud properties.
Scripts for analysis of DI-qTOF data recorded for studies of refractory dissolved organic matter
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Requisite R files for the Proteo-SAFARI app
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Additional file 1: Publication analysis code. Analysis code to reproduce present study.
Contains the parameters/set-up for a VIC model run along with the input and post-processed output from the model along with a python notebook to recreate the figures in the GRL paper of the same name.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Datasets and R scripts from González-Suárez, M; Gonzalez-Voyer, A; von Hardenberg, A; Santini, L (2021) The role of brain size on mammalian population densities Journal of Animal Ecology, 90: 653– 661. DOI: 10.1111/1365-2656.13397Additional details in the README.pdf file
SUMMARY OF FILES INCLUDED
·
12 csv datasets from other sources (described
below) with brain and body mass data in the zip file Brain and Mass data.
·
Six csv datasets from other sources and compilations
(described below) with population density and diet information
·
Two csv files (Complete_dataset_published.csv, Brain_data_compilation_published.csv) produced during this study. Details of the files and the compilation protocol are provided in the README file.
SOIREE Sediment Traps - Total Mass Flux
. Visit https://dataone.org/datasets/sha256%3A2c31a8029059075ca1ab79991d6de232cb2c8c83650aeb1534b5245273f1feb4 for complete metadata about this dataset.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
A major bottleneck of mass spectrometric metabolomic analysis is still the rapid detection and annotation of unknown m/z features across biological matrices. This kind of analysis is especially cumbersome for complex samples with hundreds to thousands of unknown features. Traditionally, the annotation was done manually imposing constraints in reproducibility and automatization. Furthermore, different analysis tools are typically used at different steps which requires parsing of data and changing of environments. We present here MetNet, implemented in the R programming language and available as an open-source package via the Bioconductor project. MetNet, which is compatible with the output of the xcms/CAMERA suite, uses the data-rich output of mass spectrometry metabolomics to putatively link features on their relation to other features in the data set. MetNet uses both structural and quantitative information on metabolomics data for network inference and enables the annotation of unknown analytes.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Supplemental Data for R analyses; each sheet should be saved out as its own .csv for R. This file is missing the humerus and femur circumference metrics. All analyses as in the R file will still run, but data is important! It is included in version 2.
Mass spectrometry imaging (MSI) experiments result in complex multi-dimensional datasets, which require specialist data analysis tools. Here we have developed massPix - an R package for analysing and interpreting data from MSI of lipids in tissue. MassPix is an open-source tool for the analysis and statistical interpretation of MSI data, and is particularly useful for lipidomics applications. MassPix produces single ion images, performs multivariate statistics and provides putative lipid annotations based on accurate mass matching against generated lipid libraries. Classification of tissue regions with high spectral similarly can be carried out by principal components analysis (PCA) or k-means clustering. Mouse cerebellum was analysed using matrix assisted laser desorption ionisation (MALDI) MSI. The resulting MSI dataset forms the test data for massPix.