89 datasets found
  1. 🚀 Galaxy Mass Prediction

    • kaggle.com
    Updated Jul 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2024). 🚀 Galaxy Mass Prediction [Dataset]. https://www.kaggle.com/datasets/mexwell/galaxy-mass-prediction
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 31, 2024
    Dataset provided by
    Kaggle
    Authors
    mexwell
    Description

    Motivation

    Estimating the properties of galaxies, and even where they are, is a challenging process. The Rubin Observatory, a sky survey telescope located in Chile, will once it becomes operational image tens of billions of astronomical objects, the vast majority of which will be galaxies that have never been imaged before. Analyses of these data will require sophisticated methodologies, ones that will allow us to first determine where the galaxies are (i.e., how far away they are), and then conditional on the distance, how massive they are. Given galaxy distance and mass data, we can test theories of how the Universe evolves, by comparing simulated galaxy data with these data.

    The Buzzard-V1.0 simulation was used to generate a realistic sample of Rubin Observatory data. In this dataset are measurements for 111,172 galaxies. Developers used these data to benchmark, e.g., methods for estimating galaxy distance. Here, we can assume the distance has been estimated well, and use these data to try to model galaxy mass as a function of brightness and distance.

    Data

    The dataset contains measures of magnitude and magnitude uncertainty in six astronomical bands (u for ultraviolet, g for green, r for red, i for infrared, and z and y for two additional infrared bands). Magnitude is a logarithmic measure of brightness, with an increase of 5 representing a decrease in brightness by a factor of 100, and with a value of zero being represented (roughly) by how the star Vega appears in the night sky. In addition, there is a redshift measured for each galaxy; it represents by how much light from the galaxy is stretched (by the expansion of Universe) as it travels to us. Thus higher redshifts represent larger distances. The last measurement is log.mass, which is the base-10 logarithm of the galaxy stellar mass in units of solar mass; for instance, log.mass = 10 means that the galaxy has a mass 10 billion times that of the Sun.

    Variable Description

    • u Galaxy magnitude in Rubin u band (320.5-393.5 nm)
    • g Galaxy magnitude in Rubin g band (401.5-551.9 nm)
    • r Galaxy magnitude in Rubin r band (552.0-691.0 nm)
    • i Galaxy magnitude in Rubin i band (691.0-818.0 nm)
    • z Galaxy magnitude in Rubin z band (818.0-923.5 nm)
    • y Galaxy magnitude in Rubin y band (923.8-1084.5 nm)
    • u.err Uncertainty for u-band magnitude
    • g.err Uncertainty for g-band magnitude
    • r.err Uncertainty for r-band magnitude
    • i.err Uncertainty for i-band magnitude
    • z.err Uncertainty for z-band magnitude
    • y.err Uncertainty for y-band magnitude
    • log.mass Galaxy stellar mass (log-base-10 solar masses)
    • redshift Galaxy redshift

    Questions

    As noted above, the idea here is to learn a statistical association between measures of magnitude and distance, and galaxy mass.

    One wrinkle here that analysts can exploit is that the data contain standard error estimates for the magnitudes (though not for redshift, for which, in practice, the error would be ).

    References

    Schmidt, Malz, Soo, Almosallam, Brescia, Cavuoti, Cohen-Tanugi, Connolly, DeRose, Freeman, Graham, Iyer, Jarvis, Kalmbach, Kovacs, Lee, Longo, Morrison, Newman, Nourbakhsh, Nuss, Pospisil, Tranin, Wechsler, Zhou, Izbicki, (The LSST Dark Energy Science Collaboration). “Evaluation of probabilistic photometric redshift estimation approaches for The Rubin Observatory Legacy Survey of Space and Time (LSST)”. Monthly Notices of the Royal Astronomical Society 499, December 2020, pages 1587–1606. https://doi.org/10.1093/mnras/staa2799

    Acknowledgement

    Foto from unsplash

  2. f

    Statistical Data Analysis using R

    • figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Barsanelli Costa (2023). Statistical Data Analysis using R [Dataset]. http://doi.org/10.6084/m9.figshare.5501035.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Authors
    Samuel Barsanelli Costa
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    R Scripts contain statistical data analisys for streamflow and sediment data, including Flow Duration Curves, Double Mass Analysis, Nonlinear Regression Analysis for Suspended Sediment Rating Curves, Stationarity Tests and include several plots.

  3. Data from: massPix: An R package for annotation and interpretation of mass...

    • data.niaid.nih.gov
    xml
    Updated Aug 31, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zoe Hall (2017). massPix: An R package for annotation and interpretation of mass spectrometry imaging data for lipidomics [Dataset]. https://data.niaid.nih.gov/resources?id=mtbls487
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Aug 31, 2017
    Dataset provided by
    University of Cambridge
    Authors
    Zoe Hall
    Variables measured
    tissue type, Metabolomics
    Description

    Mass spectrometry imaging (MSI) experiments result in complex multi-dimensional datasets, which require specialist data analysis tools. Here we have developed massPix - an R package for analysing and interpreting data from MSI of lipids in tissue. MassPix is an open-source tool for the analysis and statistical interpretation of MSI data, and is particularly useful for lipidomics applications. MassPix produces single ion images, performs multivariate statistics and provides putative lipid annotations based on accurate mass matching against generated lipid libraries. Classification of tissue regions with high spectral similarly can be carried out by principal components analysis (PCA) or k-means clustering. Mouse cerebellum was analysed using matrix assisted laser desorption ionisation (MALDI) MSI. The resulting MSI dataset forms the test data for massPix.

  4. R/V Ron Brown Aerosol Gravimetric analysis of mass as a function of size...

    • data.ucar.edu
    ascii
    Updated Dec 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patricia Quinn (2024). R/V Ron Brown Aerosol Gravimetric analysis of mass as a function of size (ASCII) [Dataset]. http://doi.org/10.26023/W7MR-NDVF-5407
    Explore at:
    asciiAvailable download formats
    Dataset updated
    Dec 26, 2024
    Dataset provided by
    University Corporation for Atmospheric Research
    Authors
    Patricia Quinn
    Time period covered
    Mar 19, 2001 - Apr 18, 2001
    Area covered
    Description

    This dataset contains aerosol gravimetric analysis of mass, using 2-stage multi-jet cascade impactors, taken aboard the Ron Brown ship during the ACE-Asia field project. This dataset contains the tab delimited (.acf) data files. Data can also be downloaded in a netCDF format.

  5. f

    Data from: Analysis and Visualization of Quantitative Proteomics Data Using...

    • acs.figshare.com
    zip
    Updated Sep 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yi Hsiao; Haijian Zhang; Ginny Xiaohe Li; Yamei Deng; Fengchao Yu; Hossein Valipour Kahrood; Joel R. Steele; Ralf B. Schittenhelm; Alexey I. Nesvizhskii (2024). Analysis and Visualization of Quantitative Proteomics Data Using FragPipe-Analyst [Dataset]. http://doi.org/10.1021/acs.jproteome.4c00294.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 10, 2024
    Dataset provided by
    ACS Publications
    Authors
    Yi Hsiao; Haijian Zhang; Ginny Xiaohe Li; Yamei Deng; Fengchao Yu; Hossein Valipour Kahrood; Joel R. Steele; Ralf B. Schittenhelm; Alexey I. Nesvizhskii
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The FragPipe computational proteomics platform is gaining widespread popularity among the proteomics research community because of its fast processing speed and user-friendly graphical interface. Although FragPipe produces well-formatted output tables that are ready for analysis, there is still a need for an easy-to-use and user-friendly downstream statistical analysis and visualization tool. FragPipe-Analyst addresses this need by providing an R shiny web server to assist FragPipe users in conducting downstream analyses of the resulting quantitative proteomics data. It supports major quantification workflows, including label-free quantification, tandem mass tags, and data-independent acquisition. FragPipe-Analyst offers a range of useful functionalities, such as various missing value imputation options, data quality control, unsupervised clustering, differential expression (DE) analysis using Limma, and gene ontology and pathway enrichment analysis using Enrichr. To support advanced analysis and customized visualizations, we also developed FragPipeAnalystR, an R package encompassing all FragPipe-Analyst functionalities that is extended to support site-specific analysis of post-translational modifications (PTMs). FragPipe-Analyst and FragPipeAnalystR are both open-source and freely available.

  6. mzML mass spectrometry and imzML mass spectrometry imaging test data

    • zenodo.org
    zip
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robert Winkler; Robert Winkler (2023). mzML mass spectrometry and imzML mass spectrometry imaging test data [Dataset]. http://doi.org/10.5281/zenodo.10084132
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Robert Winkler; Robert Winkler
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The repository contains three mzML and four imzML mass spectrometry datasets,

    The mzML data are compiled in a single directory 'mzML' and zipped:

    • Col_1.mzML is a liquid chromatography (LC) ESI MS dataset from an Arabidopsis extraction published in: Sotelo-Silveira, M., Chauvin, A.-L., Marsch-Martínez, N., Winkler, R. & De Folter, S. Metabolic fingerprinting of Arabidopsis thaliana accessions. Frontiers in Plant Science 6, 1–13 (2015). https://doi.org/10.3389/fpls.2015.00365.
    • Cytochrome_C.mzML is an electrospray mass spectrometry (ESI MS) dataset of Cytochrome C. The data were discussed in: Winkler, R. ESIprot: a universal tool for charge state determination and molecular weight calculation of proteins from electrospray ionization mass spectrometry data. Rapid Communications in Mass Spectrometry 24, 285- 294 (2010). https://doi.org/10.1002/rcm.4384.
    • T9_A1.mzML is a low-temperature plasma (LTP) MS dataset of the interaction between Arabidopsis and Trichoderma, published in 1. Torres-Ortega, R. et al. In Vivo Low-Temperature Plasma Ionization Mass Spectrometry (LTP-MS) Reveals Regulation of 6-Pentyl-2H-Pyran-2-One (6-PP) as a Physiological Variable during Plant-Fungal Interaction. Metabolites 12, 1231 (2022). https://doi.org/10.3390/metabo12121231.

    The imzML mass spectrometry imaging data are zipped individually:

    • imzML_AP_SMALDI.zip contains an AP-SMALDI mass spectrometry imaging data set of mouse urinary bladder slides, published by Römpp A, Guenther S, Schober Y, Schulz O, Takats Z, Kummer W, Spengler B., ProteomeXchange dataset PXD001283. 2014., and available from https://www.ebi.ac.uk/pride/archive/projects/PXD001283; Publication: Römpp A, Guenther S, Schober Y, Schulz O, Takats Z, Kummer W, Spengler B; Histology by mass spectrometry: label-free tissue characterization obtained from high-accuracy bioanalytical imaging., Angew Chem Int Ed Engl, 49, 22, 3834-8 (2014). https://doi.org/10.1002/anie.200905559, PubMed: 20397170.
    • imzML_DESI.zip is a DESI mass spectrometry imaging data set of human colorectal cancer tissue by Oetjen J, Veselkov K, Watrous J, McKenzie JS, Becker M, Hauberg-Lotte L, Kobarg JH, Strittmatter N, Mróz AK, Hoffmann F, Trede D, Palmer A, Schiffler S, Steinhorst K, Aichler M, Goldin R, Guntinas-Lichius O, von Eggeling F, Thiele H, Maedler K, Walch A, Maass P, Dorrestein PC, Takats Z, Alexandrov T. 2015. Benchmark datasets for 3D MALDI-and DESI-imaging mass spectrometry. GigaScience 4(1):2105 https://doi.org/10.1186/s13742-015-0059-4.
    • imzML_LA-ESI.zip is an LA-ESI mass spectrometry imaging data set of an Arabidopsis thaliana leaf by Zheng, Z., Bartels, B., & Svatoš, A. (2020). Laser Ablation Electrospray Ionization Mass Spectrometry Imaging (LAESI MSI) of Arabidopsis thaliana leaf [Data set]. Zenodo. https://doi.org/10.5281/zenodo.3678473.
    • imzML_LTP.zip was generated by low-temperature plasma ionization ambient mass spectrometry imaging of a chili fruit, published by Maldonado-Torres M, López-Hernández Jé F, Jiménez-Sandoval P, Winkler R. 2014. Plug and play' assembly of a low-temperature plasma ionization mass spectrometry imaging (LTP-MSI) system. Journal of Proteomics 102C:60–65 https://doi.org/10.1016/j.jprot.2014.03.003; Mauricio Maldonado-Torres, José Fabricio López-Hernández, Pedro Jiménez-Sandoval, & Robert Winkler. (2017). Low-temperature plasma mass spectrometry imaging (LTP-MSI) of Chili pepper [Data set]. In Journal of proteomics (Vol. 102, pp. 60–65). Zenodo. https://doi.org/10.5281/zenodo.484496.

    All these datasets are publicly available from different repositories; however, If you reuse them, please attribute the original authors!

  7. U

    Data and R Code to Derive Estimates of Groundwater Levels Using MOVE.1...

    • data.usgs.gov
    • catalog.data.gov
    Updated Sep 28, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dee-Ann Crozier; Elizabeth Ahearn (2024). Data and R Code to Derive Estimates of Groundwater Levels Using MOVE.1 Regression and Compute Monthly Percentiles for Select Wells in Massachusetts [Dataset]. http://doi.org/10.5066/P90PDI34
    Explore at:
    Dataset updated
    Sep 28, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    Dee-Ann Crozier; Elizabeth Ahearn
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    Oct 31, 1939 - Dec 31, 2021
    Area covered
    Massachusetts
    Description

    This data release contains extended estimates of daily groundwater levels and monthly percentiles at 27 short-term monitoring wells in Massachusetts. The Maintenance of Variance Extension Type 1 (MOVE.1) regression method was used to extend short-term groundwater levels at wells with less than 10 years of continuous data. This method uses groundwater level data from a correlated long-term monitoring well (index well) to estimate the groundwater level record for the short-term monitoring well. MOVE.1 regressions are used widely throughout the hydrologic community to extend flow records from streamgaging stations but are less commonly used to extend groundwater records at wells. The data in this data release document the results of the MOVE.1 regressions to estimate groundwater levels and compute updated monthly percentiles for select wells used in the groundwater index in the Massachusetts Drought Management Plan (2019). The U.S. Geological Survey (USGS) groundwater identification ...

  8. Data from: Brain mass, body mass and population density in mammals

    • figshare.com
    zip
    Updated Sep 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manuela Gonzalez Suarez (2021). Brain mass, body mass and population density in mammals [Dataset]. http://doi.org/10.6084/m9.figshare.12867305.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 29, 2021
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Manuela Gonzalez Suarez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets and R scripts from González-Suárez, M; Gonzalez-Voyer, A; von Hardenberg, A; Santini, L (2021) The role of brain size on mammalian population densities Journal of Animal Ecology, 90: 653– 661. DOI: 10.1111/1365-2656.13397Additional details in the README.pdf file

    SUMMARY OF FILES INCLUDED

    • CSV datasets: total 21 files representing

    ·
    12 csv datasets from other sources (described below) with brain and body mass data in the zip file Brain and Mass data.

    ·
    Six csv datasets from other sources and compilations (described below) with population density and diet information

    ·
    Two csv files (Complete_dataset_published.csv, Brain_data_compilation_published.csv) produced during this study. Details of the files and the compilation protocol are provided in the README file.

    • R scripts: three scripts that describe the protocols to: combine brain and body mass data (brain_data_compilation_published.Rmd), combine all sources (Joining_all_data_published.Rmd), and analyse and produce tables and figures presented in the manuscript (Data_analysesPPA_published.Rmd).
  9. Additional file 2 of Raman2imzML converts Raman imaging data into the...

    • springernature.figshare.com
    application/x-rar
    Updated Jun 11, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefania Alexandra Iakab; Lluc Sementé; María García-Altares; Xavier Correig; Pere Ràfols (2023). Additional file 2 of Raman2imzML converts Raman imaging data into the standard mass spectrometry imaging format [Dataset]. http://doi.org/10.6084/m9.figshare.14433548.v1
    Explore at:
    application/x-rarAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    figshare
    Authors
    Stefania Alexandra Iakab; Lluc Sementé; María García-Altares; Xavier Correig; Pere Ràfols
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 2. Markdown for using the Raman2imzML package.

  10. Data from: (Appendix 2) Compilation of mass accumulation rates from deep sea...

    • doi.pangaea.de
    • search.dataone.org
    html, tsv
    Updated 1998
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter B deMenocal; William R Howard; Nina R Catubig; David E Archer; Roger Francois; Ein-Fen Yu (1998). (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the Holocene [Dataset]. http://doi.org/10.1594/PANGAEA.726363
    Explore at:
    html, tsvAvailable download formats
    Dataset updated
    1998
    Dataset provided by
    PANGAEA
    Authors
    Peter B deMenocal; William R Howard; Nina R Catubig; David E Archer; Roger Francois; Ein-Fen Yu
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Time period covered
    Oct 6, 1947 - Feb 1, 1988
    Area covered
    Variables measured
    Event label, Reference/source, Calcium carbonate, Latitude of event, Elevation of event, Longitude of event, DEPTH, sediment/rock, Opal, biogenic silica, Method/Device of event, Accumulation rate, mass, and 1 more
    Description

    This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the Holocene. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.

  11. f

    Dataset for R analyses v2

    • figshare.com
    xlsx
    Updated Sep 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Douglass Rovinsky (2020). Dataset for R analyses v2 [Dataset]. http://doi.org/10.6084/m9.figshare.12935222.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Sep 10, 2020
    Dataset provided by
    figshare
    Authors
    Douglass Rovinsky
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplemental Data for R analyses; each sheet should be saved out as its own .csv for R. This version contains the humerus and femur circumference metrics in sheet 2 (BodyMass_RegMtrx) that were missing in the version 1.

  12. Development of a Tool to Determine the Variability of Consensus Mass Spectra...

    • catalog.data.gov
    • data.nist.gov
    Updated Jul 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). Development of a Tool to Determine the Variability of Consensus Mass Spectra Supporting Data [Dataset]. https://catalog.data.gov/dataset/development-of-a-tool-to-determine-the-variability-of-consensus-mass-spectra-supporting-da
    Explore at:
    Dataset updated
    Jul 29, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    Supporting datasets and algorithms (R-based) for the manuscript entitled "Development of a Tool to Determine the Variability of Consensus Mass Spectra", including an R Markdown script to reproduce the manuscript's figures.

  13. The COSMOS2020 Galaxy Stellar Mass Function -- Key Measurements

    • zenodo.org
    • explore.openaire.eu
    zip
    Updated Sep 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John R. Weaver; John R. Weaver (2023). The COSMOS2020 Galaxy Stellar Mass Function -- Key Measurements [Dataset]. http://doi.org/10.5281/zenodo.7808833
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 25, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    John R. Weaver; John R. Weaver
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Here we describe the release of the measurements of the galaxy Stellar Mass Function and quiescent mass fractions based on the COSMOS2020 Farmer Catalogue and LePhare estimates of redshifts, masses, and rest-frame colours as described in Weaver et al. 2023 (arXiv:2212.02512v1).

    When using these data products please cite both this SMF paper (Weaver et al. 2023) and the COSMOS2020 Catalogue (Weaver et al. 2022). Links to ADS export citations:

    SMF | https://ui.adsabs.harvard.edu/abs/2022arXiv221202512W
    COSMOS2020 | https://ui.adsabs.harvard.edu/abs/2022ApJS..258...11W

    Please reach out if you have any questions or concerns.

  14. C

    Replication data for "rSIREM: an R package for MALDI spectral deconvolution"...

    • dataverse.csuc.cat
    bin, txt
    Updated Sep 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christoph Hauke Manfred Bookmeyer; Christoph Hauke Manfred Bookmeyer; Esteban Del Castillo Pérez; Esteban Del Castillo Pérez (2024). Replication data for "rSIREM: an R package for MALDI spectral deconvolution" [Dataset]. http://doi.org/10.34810/data1744
    Explore at:
    txt(5624), bin(37070894), bin(1125618216), bin(37822075), bin(829782432), bin(35471981), bin(2143487336)Available download formats
    Dataset updated
    Sep 19, 2024
    Dataset provided by
    CORA.Repositori de Dades de Recerca
    Authors
    Christoph Hauke Manfred Bookmeyer; Christoph Hauke Manfred Bookmeyer; Esteban Del Castillo Pérez; Esteban Del Castillo Pérez
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Dataset funded by
    European Research Executive Agency (REA)
    Description

    Replication data for the publication: "rSIREM: an R package for MALDI spectral deconvolution" by Del Castillo Pérez et al. The deposited data are SALDI-MSI data of three consectutive thin tissue sections from mouse cerebellum measured at the different mass resolutions at the same instrument (MALDI-MSI: Spectroglyph Injector - Orbitrap Exploris). The paper describes a new R package (rSIREM) to computationally improve the mass resolution of an MSI post-measurement. The developed R package (https://github.com/EdelCastillo/rSirem ) applies a statistical treatment on the concentration of spatial images obtained by separately considering each of the m/z over all the pixels. A representative scalar is associated with each image, obtained by applying a new measure (SIREM) to it, derived from Shannon's entropy. The perturbations of this measure, when considering a sequence of consecutive images, reveal the existence of overlap, if it exists. This information serves as a seed to initialize the EM algorithm in the Gaussian Mixture Model context. The efficiency of the method has been verified using three independent procedures.

  15. d

    Data from: Predicting body mass in Ruminantia using postcranial measurements...

    • datadryad.org
    zip
    Updated Dec 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexa Wimberly (2024). Predicting body mass in Ruminantia using postcranial measurements [Dataset]. http://doi.org/10.5061/dryad.8sf7m0ctf
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 27, 2024
    Dataset provided by
    Dryad
    Authors
    Alexa Wimberly
    Time period covered
    2023
    Description

    This dataset was collected by taking linear measurements on postcranial bones in museum collections. The data has been processed using R scripts.

  16. h

    The Measurement of R in e+ e- annihilation at center-of-mass energies...

    • hepdata.net
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Measurement of R in e+ e- annihilation at center-of-mass energies between 7.2-GeV and 10.34-GeV [Dataset]. http://doi.org/10.17182/hepdata.14321.v1
    Explore at:
    Description

    VEPP-4 collider. Measurement of R in e+ e- interactions in the centre-of-mass energy range 7.25 to 10.34 GeV using the MD-1 detector. Data corrected for background and radiative effects.

  17. d

    R script for analysis of DI-qTOF

    • search.dataone.org
    • borealisdata.ca
    Updated Jun 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bouvet, Corentin; Guéguen, Céline (2024). R script for analysis of DI-qTOF [Dataset]. http://doi.org/10.5683/SP3/4KFAWO
    Explore at:
    Dataset updated
    Jun 12, 2024
    Dataset provided by
    Borealis
    Authors
    Bouvet, Corentin; Guéguen, Céline
    Description

    Scripts for analysis of DI-qTOF data recorded for studies of refractory dissolved organic matter

  18. (Appendix 2) Compilation of mass accumulation rates from deep sea sediments...

    • doi.pangaea.de
    • search.dataone.org
    html, tsv
    Updated 1998
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter B deMenocal; William R Howard; Nina R Catubig; David E Archer; Roger Francois; Ein-Fen Yu (1998). (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the last glacial maximum [Dataset]. http://doi.org/10.1594/PANGAEA.58146
    Explore at:
    tsv, htmlAvailable download formats
    Dataset updated
    1998
    Dataset provided by
    PANGAEA
    Authors
    Peter B deMenocal; William R Howard; Nina R Catubig; David E Archer; Roger Francois; Ein-Fen Yu
    License

    Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
    License information was derived automatically

    Time period covered
    Oct 6, 1947 - Feb 1, 1988
    Area covered
    Variables measured
    AGE, Event label, Reference/source, Calcium carbonate, Latitude of event, Elevation of event, Longitude of event, Opal, biogenic silica, Method/Device of event, Accumulation rate, mass, and 1 more
    Description

    This dataset is about: (Appendix 2) Compilation of mass accumulation rates from deep sea sediments during the last glacial maximum. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.726364 for more information.

  19. T

    Proteo-SAFARI R Files

    • dataverse.tdl.org
    type/x-r-syntax
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Lanzillotti; Michael Lanzillotti (2024). Proteo-SAFARI R Files [Dataset]. http://doi.org/10.18738/T8/22EIRO
    Explore at:
    type/x-r-syntax(5977), type/x-r-syntax(787), type/x-r-syntax(5070), type/x-r-syntax(8338), type/x-r-syntax(1263), type/x-r-syntax(5796), type/x-r-syntax(19242), type/x-r-syntax(7006), type/x-r-syntax(7129), type/x-r-syntax(3163), type/x-r-syntax(4446), type/x-r-syntax(7419), type/x-r-syntax(4448)Available download formats
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    Texas Data Repository
    Authors
    Michael Lanzillotti; Michael Lanzillotti
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Requisite R files for the Proteo-SAFARI app

  20. g

    Development of a Tool to Determine the Variability of Consensus Mass Spectra...

    • gimi9.com
    Updated Mar 23, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Development of a Tool to Determine the Variability of Consensus Mass Spectra Supporting Data | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_development-of-a-tool-to-determine-the-variability-of-consensus-mass-spectra-supporting-da
    Explore at:
    Dataset updated
    Mar 23, 2021
    Description

    Supporting datasets and algorithms (R-based) for the manuscript entitled "Development of a Tool to Determine the Variability of Consensus Mass Spectra", including an R Markdown script to reproduce the manuscript's figures.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
mexwell (2024). 🚀 Galaxy Mass Prediction [Dataset]. https://www.kaggle.com/datasets/mexwell/galaxy-mass-prediction
Organization logo

🚀 Galaxy Mass Prediction

Can you predict the galaxy mass with its brightness?

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 31, 2024
Dataset provided by
Kaggle
Authors
mexwell
Description

Motivation

Estimating the properties of galaxies, and even where they are, is a challenging process. The Rubin Observatory, a sky survey telescope located in Chile, will once it becomes operational image tens of billions of astronomical objects, the vast majority of which will be galaxies that have never been imaged before. Analyses of these data will require sophisticated methodologies, ones that will allow us to first determine where the galaxies are (i.e., how far away they are), and then conditional on the distance, how massive they are. Given galaxy distance and mass data, we can test theories of how the Universe evolves, by comparing simulated galaxy data with these data.

The Buzzard-V1.0 simulation was used to generate a realistic sample of Rubin Observatory data. In this dataset are measurements for 111,172 galaxies. Developers used these data to benchmark, e.g., methods for estimating galaxy distance. Here, we can assume the distance has been estimated well, and use these data to try to model galaxy mass as a function of brightness and distance.

Data

The dataset contains measures of magnitude and magnitude uncertainty in six astronomical bands (u for ultraviolet, g for green, r for red, i for infrared, and z and y for two additional infrared bands). Magnitude is a logarithmic measure of brightness, with an increase of 5 representing a decrease in brightness by a factor of 100, and with a value of zero being represented (roughly) by how the star Vega appears in the night sky. In addition, there is a redshift measured for each galaxy; it represents by how much light from the galaxy is stretched (by the expansion of Universe) as it travels to us. Thus higher redshifts represent larger distances. The last measurement is log.mass, which is the base-10 logarithm of the galaxy stellar mass in units of solar mass; for instance, log.mass = 10 means that the galaxy has a mass 10 billion times that of the Sun.

Variable Description

  • u Galaxy magnitude in Rubin u band (320.5-393.5 nm)
  • g Galaxy magnitude in Rubin g band (401.5-551.9 nm)
  • r Galaxy magnitude in Rubin r band (552.0-691.0 nm)
  • i Galaxy magnitude in Rubin i band (691.0-818.0 nm)
  • z Galaxy magnitude in Rubin z band (818.0-923.5 nm)
  • y Galaxy magnitude in Rubin y band (923.8-1084.5 nm)
  • u.err Uncertainty for u-band magnitude
  • g.err Uncertainty for g-band magnitude
  • r.err Uncertainty for r-band magnitude
  • i.err Uncertainty for i-band magnitude
  • z.err Uncertainty for z-band magnitude
  • y.err Uncertainty for y-band magnitude
  • log.mass Galaxy stellar mass (log-base-10 solar masses)
  • redshift Galaxy redshift

Questions

As noted above, the idea here is to learn a statistical association between measures of magnitude and distance, and galaxy mass.

One wrinkle here that analysts can exploit is that the data contain standard error estimates for the magnitudes (though not for redshift, for which, in practice, the error would be ).

References

Schmidt, Malz, Soo, Almosallam, Brescia, Cavuoti, Cohen-Tanugi, Connolly, DeRose, Freeman, Graham, Iyer, Jarvis, Kalmbach, Kovacs, Lee, Longo, Morrison, Newman, Nourbakhsh, Nuss, Pospisil, Tranin, Wechsler, Zhou, Izbicki, (The LSST Dark Energy Science Collaboration). “Evaluation of probabilistic photometric redshift estimation approaches for The Rubin Observatory Legacy Survey of Space and Time (LSST)”. Monthly Notices of the Royal Astronomical Society 499, December 2020, pages 1587–1606. https://doi.org/10.1093/mnras/staa2799

Acknowledgement

Foto from unsplash

Search
Clear search
Close search
Google apps
Main menu