24 datasets found
  1. q

    REMNet Tutorial, R Part 5: Normalizing Microbiome Data in R 5.2.19

    • qubeshub.org
    Updated Aug 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jessica Joyner (2019). REMNet Tutorial, R Part 5: Normalizing Microbiome Data in R 5.2.19 [Dataset]. http://doi.org/10.25334/M13H-XT81
    Explore at:
    Dataset updated
    Aug 28, 2019
    Dataset provided by
    QUBES
    Authors
    Jessica Joyner
    Description

    Video on normalizing microbiome data from the Research Experiences in Microbiomes Network

  2. Dataset supporting: Normalizing and denoising protein expression data from...

    • nih.figshare.com
    • figshare.com
    zip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Matthew P. Mulé; Andrew J. Martins; John Tsang (2023). Dataset supporting: Normalizing and denoising protein expression data from droplet-based single cell profiling [Dataset]. http://doi.org/10.35092/yhjc.13370915.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Matthew P. Mulé; Andrew J. Martins; John Tsang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data for reproducing analysis in the manuscript:Normalizing and denoising protein expression data from droplet-based single cell profilinglink to manuscript: https://www.biorxiv.org/content/10.1101/2020.02.24.963603v1

    Data deposited here are for the purposes of reproducing the analysis results and figures reported in the manuscript above. These data are all publicly available downloaded and converted to R datasets prior to Dec 4, 2020. For a full description of all the data included in this repository and instructions for reproducing all analysis results and figures, please see the repository: https://github.com/niaid/dsb_manuscript.

    For usage of the dsb R package for normalizing CITE-seq data please see the repository: https://github.com/niaid/dsb

    If you use the dsb R package in your work please cite:Mulè MP, Martins AJ, Tsang JS. Normalizing and denoising protein expression data from droplet-based single cell profiling. bioRxiv. 2020;2020.02.24.963603.

    General contact: John Tsang (john.tsang AT nih.gov)

    Questions about software/code: Matt Mulè (mulemp AT nih.gov)

  3. Z

    Example subjects for Mobilise-D data standardization

    • data.niaid.nih.gov
    • zenodo.org
    Updated Oct 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Palmerini, Luca; Reggi, Luca; Bonci, Tecla; Del Din, Silvia; Micó-Amigo, Encarna; Salis, Francesca; Bertuletti, Stefano; Caruso, Marco; Cereatti, Andrea; Gazit, Eran; Paraschiv-Ionescu, Anisoara; Soltani, Abolfazl; Kluge, Felix; Küderle, Arne; Ullrich, Martin; Kirk, Cameron; Hiden, Hugo; D'Ascanio, Ilaria; Hansen, Clint; Rochester, Lynn; Mazzà, Claudia; Chiari, Lorenzo; on behalf of the Mobilise-D consortium (2022). Example subjects for Mobilise-D data standardization [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7185428
    Explore at:
    Dataset updated
    Oct 11, 2022
    Dataset provided by
    https://www.mobilise-d.eu/partners
    Newcastle University, Translational and Clinical Research Institute, Faculty of Medical Sciences, UK.
    Laboratory of Movement Analysis and Measurement, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland.
    University of Bologna, Department of Electrical, Electronic and Information Engineering 'Guglielmo Marconi', Italy.
    Newcastle University, Translational and Clinical Research Institute, Faculty of Medical Sciences, UK. The Newcastle upon Tyne NHS Foundation Trust, UK.
    Machine Learning and Data Analytics Lab, Department of Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-University Erlangen-Nürnberg, Germany.
    Tel Aviv Sourasky Medical Center, Center for the Study of Movement, Cognition and Mobility, Neurological Institute, Israel.
    Politecnico di Torino, Department of Electronics and Telecommunications, Italy. Politecnico di Torino, PolitoBIOMed Lab – Biomedical Engineering Lab, Italy.
    Newcastle University, School of Computing, UK.
    The University of Sheffield, INSIGNEO Institute for in silico Medicine, UK. The University of Sheffield, Department of Mechanical Engineering, UK
    University of Bologna, Health Sciences and Technologies—Interdepartmental Center for Industrial Research (CIRI-SDV), Italy
    University of Bologna, Department of Electrical, Electronic and Information Engineering 'Guglielmo Marconi', Italy. University of Bologna, Health Sciences and Technologies—Interdepartmental Center for Industrial Research (CIRI-SDV), Italy
    University of Sassari, Department of Biomedical Sciences, Italy.
    Politecnico di Torino, Department of Electronics and Telecommunications, Italy.
    Neurogeriatrics Kiel, Department of Neurology, University Hospital Schleswig-Holstein, Germany.
    Authors
    Palmerini, Luca; Reggi, Luca; Bonci, Tecla; Del Din, Silvia; Micó-Amigo, Encarna; Salis, Francesca; Bertuletti, Stefano; Caruso, Marco; Cereatti, Andrea; Gazit, Eran; Paraschiv-Ionescu, Anisoara; Soltani, Abolfazl; Kluge, Felix; Küderle, Arne; Ullrich, Martin; Kirk, Cameron; Hiden, Hugo; D'Ascanio, Ilaria; Hansen, Clint; Rochester, Lynn; Mazzà, Claudia; Chiari, Lorenzo; on behalf of the Mobilise-D consortium
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Standardized data from Mobilise-D participants (YAR dataset) and pre-existing datasets (ICICLE, MSIPC2, Gait in Lab and real-life settings, MS project, UNISS-UNIGE) are provided in the shared folder, as an example of the procedures proposed in the publication "Mobility recorded by wearable devices and gold standards: the Mobilise-D procedure for data standardization" that is currently under review in Scientific data. Please refer to that publication for further information. Please cite that publication if using these data.

    The code to standardize an example subject (for the ICICLE dataset) and to open the standardized Matlab files in other languages (Python, R) is available in github (https://github.com/luca-palmerini/Procedure-wearable-data-standardization-Mobilise-D).

  4. d

    Data from: Standardizing protocols for determining the cause of mortality in...

    • datadryad.org
    • datasetcatalog.nlm.nih.gov
    • +1more
    zip
    Updated Jun 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bogdan Cristescu; Mark Elbroch; Tavis Forrester; Maximilian Allen; Derek Spitz; Christopher Wilmers; Heiko Wittmer (2022). Standardizing protocols for determining the cause of mortality in wildlife studies [Dataset]. http://doi.org/10.7291/D1GD50
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 22, 2022
    Dataset provided by
    Dryad
    Authors
    Bogdan Cristescu; Mark Elbroch; Tavis Forrester; Maximilian Allen; Derek Spitz; Christopher Wilmers; Heiko Wittmer
    Time period covered
    May 31, 2022
    Description

    The datasets can be opened using Microsoft Excel and R.

  5. Z

    Rapid Creation of a Data Product for the World's Specimens of Horseshoe Bats...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mast, Austin R.; Paul, Deborah L.; Rios, Nelson; Bruhn, Robert; Dalton, Trevor; Krimmel, Erica R.; Pearson, Katelin D.; Sherman, Aja; Shorthouse, David P.; Simmons, Nancy B.; Soltis, Pam; Upham, Nathan; Abibou, Djihbrihou (2024). Rapid Creation of a Data Product for the World's Specimens of Horseshoe Bats and Relatives, a Known Reservoir for Coronaviruses [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3974999
    Explore at:
    Dataset updated
    Jul 18, 2024
    Dataset provided by
    Yale University Peabody Museum of Natural History
    American Museum of Natural History
    Agriculture and Agri-Food Canada
    Arizona State University
    University of Florida
    Florida State University
    Authors
    Mast, Austin R.; Paul, Deborah L.; Rios, Nelson; Bruhn, Robert; Dalton, Trevor; Krimmel, Erica R.; Pearson, Katelin D.; Sherman, Aja; Shorthouse, David P.; Simmons, Nancy B.; Soltis, Pam; Upham, Nathan; Abibou, Djihbrihou
    License

    https://creativecommons.org/licenses/publicdomain/https://creativecommons.org/licenses/publicdomain/

    Area covered
    World
    Description

    This repository is associated with NSF DBI 2033973, RAPID Grant: Rapid Creation of a Data Product for the World's Specimens of Horseshoe Bats and Relatives, a Known Reservoir for Coronaviruses (https://www.nsf.gov/awardsearch/showAward?AWD_ID=2033973). Specifically, this repository contains (1) raw data from iDigBio (http://portal.idigbio.org) and GBIF (https://www.gbif.org), (2) R code for reproducible data wrangling and improvement, (3) protocols associated with data enhancements, and (4) enhanced versions of the dataset published at various project milestones. Additional code associated with this grant can be found in the BIOSPEX repository (https://github.com/iDigBio/Biospex). Long-term data management of the enhanced specimen data created by this project is expected to be accomplished by the natural history collections curating the physical specimens, a list of which can be found in this Zenodo resource.

    Grant abstract: "The award to Florida State University will support research contributing to the development of georeferenced, vetted, and versioned data products of the world's specimens of horseshoe bats and their relatives for use by researchers studying the origins and spread of SARS-like coronaviruses, including the causative agent of COVID-19. Horseshoe bats and other closely related species are reported to be reservoirs of several SARS-like coronaviruses. Species of these bats are primarily distributed in regions where these viruses have been introduced to populations of humans. Currently, data associated with specimens of these bats are housed in natural history collections that are widely distributed both nationally and globally. Additionally, information tying these specimens to localities are mostly vague, or in many instances missing. This decreases the utility of the specimens for understanding the source, emergence, and distribution of SARS-COV-2 and similar viruses. This project will provide quality georeferenced data products through the consolidation of ancillary information linked to each bat specimen, using the extended specimen model. The resulting product will serve as a model of how data in biodiversity collections might be used to address emerging diseases of zoonotic origin. Results from the project will be disseminated widely in opensource journals, at scientific meetings, and via websites associated with the participating organizations and institutions. Support of this project provides a quality resource optimized to inform research relevant to improving our understanding of the biology and spread of SARS-CoV-2. The overall objectives are to deliver versioned data products, in formats used by the wider research and biodiversity collections communities, through an open-access repository; project protocols and code via GitHub and described in a peer-reviewed paper, and; sustained engagement with biodiversity collections throughout the project for reintegration of improved data into their local specimen data management systems improving long-term curation.

    This RAPID award will produce and deliver a georeferenced, vetted and consolidated data product for horseshoe bats and related species to facilitate understanding of the sources, distribution, and spread of SARS-CoV-2 and related viruses, a timely response to the ongoing global pandemic caused by SARS-CoV-2 and an important contribution to the global effort to consolidate and provide quality data that are relevant to understanding emergent and other properties the current pandemic. This RAPID award is made by the Division of Biological Infrastructure (DBI) using funds from the Coronavirus Aid, Relief, and Economic Security (CARES) Act.

    This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria."

    Files included in this resource

    9d4b9069-48c4-4212-90d8-4dd6f4b7f2a5.zip: Raw data from iDigBio, DwC-A format

    0067804-200613084148143.zip: Raw data from GBIF, DwC-A format

    0067806-200613084148143.zip: Raw data from GBIF, DwC-A format

    1623690110.zip: Full export of this project's data (enhanced and raw) from BIOSPEX, CSV format

    bionomia-datasets-attributions.zip: Directory containing 103 Frictionless Data packages for datasets that have attributions made containing Rhinolophids or Hipposiderids, each package also containing a CSV file for mismatches in person date of birth/death and specimen eventDate. File bionomia-datasets-attributions-key_2021-02-25.csv included in this directory provides a key between dataset identifier (how the Frictionless Data package files are named) and dataset name.

    bionomia-problem-dates-all-datasets_2021-02-25.csv: List of 21 Hipposiderid or Rhinolophid records whose eventDate or dateIdentified mismatches a wikidata recipient’s date of birth or death across all datasets.

    flagEventDate.txt: file containing term definition to reference in DwC-A

    flagExclude.txt: file containing term definition to reference in DwC-A

    flagGeoreference.txt: file containing term definition to reference in DwC-A

    flagTaxonomy.txt: file containing term definition to reference in DwC-A

    georeferencedByID.txt: file containing term definition to reference in DwC-A

    identifiedByNames.txt: file containing term definition to reference in DwC-A

    instructions-to-get-people-data-from-bionomia-via-datasetKey: instructions given to data providers

    RAPID-code_collection-date.R: code associated with enhancing collection dates

    RAPID-code_compile-deduplicate.R: code associated with compiling and deduplicating raw data

    RAPID-code_external-linkages-bold.R: code associated with enhancing external linkages

    RAPID-code_external-linkages-genbank.R: code associated with enhancing external linkages

    RAPID-code_external-linkages-standardize.R: code associated with enhancing external linkages

    RAPID-code_people.R: code associated with enhancing data about people

    RAPID-code_standardize-country.R: code associated with standardizing country data

    RAPID-data-dictionary.pdf: metadata about terms included in this project’s data, in PDF format

    RAPID-data-dictionary.xlsx: metadata about terms included in this project’s data, in spreadsheet format

    rapid-data-providers_2021-05-03.csv: list of data providers and number of records provided to rapid-joined-records_country-cleanup_2020-09-23.csv

    rapid-final-data-product_2021-06-29.zip: Enhanced data from BIOSPEX, DwC-A format

    rapid-final-gazetteer.zip: Gazetteer providing georeference data and metadata for 10,341 localities assessed as part of this project

    rapid-joined-records_country-cleanup_2020-09-23.csv: data product initial version where raw data has been compiled and deduplicated, and country data has been standardized

    RAPID-protocol_collection-date.pdf: protocol associated with enhancing collection dates

    RAPID-protocol_compile-deduplicate.pdf: protocol associated with compiling and deduplicating raw data

    RAPID-protocol_external-linkages.pdf: protocol associated with enhancing external linkages

    RAPID-protocol_georeference.pdf: protocol associated with georeferencing

    RAPID-protocol_people.pdf: protocol associated with enhancing data about people

    RAPID-protocol_standardize-country.pdf: protocol associated with standardizing country data

    RAPID-protocol_taxonomic-names.pdf: protocol associated with enhancing taxonomic name data

    RAPIDAgentStrings1_archivedCopy_30March2021.ods: resource used in conjunction with RAPID people protocol

    recordedByNames.txt: file containing term definition to reference in DwC-A

    Rhinolophid-HipposideridAgentStrings_and_People2_archivedCopy_30March2021.ods: resource used in conjunction with RAPID people protocol

    wikidata-notes-for-bat-collectors_leachman_2020: please see https://zenodo.org/record/4724139 for this resource

  6. Cyclistic

    • kaggle.com
    zip
    Updated May 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Salam Ibrahim (2022). Cyclistic [Dataset]. https://www.kaggle.com/datasets/salamibrahim/cyclistic
    Explore at:
    zip(209748131 bytes)Available download formats
    Dataset updated
    May 12, 2022
    Authors
    Salam Ibrahim
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    **Introduction ** This case study will be based on Cyclistic, a bike sharing company in Chicago. I will perform tasks of a junior data analyst to answer business questions. I will do this by following a process that includes the following phases: ask, prepare, process, analyze, share and act.

    Background Cyclistic is a bike sharing company that operates 5828 bikes within 692 docking stations. The company has been around since 2016 and separates itself from the competition due to the fact that they offer a variety of bike services including assistive options. Lily Moreno is the director of the marketing team and will be the person to receive these insights from this analysis.

    Case Study and business task Lily Morenos perspective on how to generate more income by marketing Cyclistics services correctly includes converting casual riders (one day passes and/or pay per ride customers) into annual riders with a membership. Annual riders are more profitable than casual riders according to the finance analysts. She would rather see a campaign targeting casual riders into annual riders, instead of launching campaigns targeting new costumers. So her strategy as the manager of the marketing team is simply to maximize the amount of annual riders by converting casual riders.

    In order to make a data driven decision, Moreno needs the following insights: - A better understanding of how casual riders and annual riders differ - Why would a casual rider become an annual one - How digital media can affect the marketing tactics

    Moreno has directed me to the first question - how do casual riders and annual riders differ?

    Stakeholders Lily Moreno, manager of the marketing team Cyclistic Marketing team Executive team

    Data sources and organization Data used in this report is made available and is licensed by Motivate International Inc. Personal data is hidden to protect personal information. Data used is from the past 12 months (01/04/2021 – 31/03/2022) of bike share dataset.

    By merging all 12 monthly bike share data provided, an extensive amount of data with 5,400,000 rows were returned and included in this analysis.

    Data security and limitations: Personal information is secured and hidden to prevent unlawful use. Original files are backed up in folders and subfolders.

    Tools and documentation of cleaning process The tools used for data verification and data cleaning are Microsoft Excel and R programming. The original files made accessible by Motivate International Inc. are backed up in their original format and in separate files.

    Microsoft Excel is used to generally look through the dataset and get a overview of the content. I performed simple checks of the data by filtering, sorting, formatting and standardizing the data to make it easily mergeable.. In Excel, I also changed data type to have the right format, removed unnecessary data if its incomplete or incorrect, created new columns to subtract and reformat existing columns and deleting empty cells. These tasks are easily done in spreadsheets and provides an initial cleaning process of the data.

    R will be used to perform queries of bigger datasets such as this one. R will also be used to create visualizations to answer the question at hand.

    Limitations Microsoft Excel has a limitation of 1,048,576 rows while the data of the 12 months combined are over 5,500,000 rows. When combining the 12 months of data into one table/sheet, Excel is no longer efficient and I switched over to R programming.

  7. WoSIS snapshot - December 2023

    • data.isric.org
    • repository.soilwise-he.eu
    Updated Dec 20, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ISRIC - World Soil Information (2023). WoSIS snapshot - December 2023 [Dataset]. https://data.isric.org/geonetwork/srv/api/records/e50f84e1-aa5b-49cb-bd6b-cd581232a2ec
    Explore at:
    www:link-1.0-http--related, www:link-1.0-http--link, www:download-1.0-ftp--downloadAvailable download formats
    Dataset updated
    Dec 20, 2023
    Dataset provided by
    International Soil Reference and Information Centre
    Authors
    ISRIC - World Soil Information
    Time period covered
    Jan 1, 1918 - Dec 1, 2022
    Area covered
    Description

    ABSTRACT: The World Soil Information Service (WoSIS) provides quality-assessed and standardized soil profile data to support digital soil mapping and environmental applications at broad scale levels. Since the release of the ‘WoSIS snapshot 2019’ many new soil data were shared with us, registered in the ISRIC data repository, and subsequently standardized in accordance with the licenses specified by the data providers. The source data were contributed by a wide range of data providers, therefore special attention was paid to the standardization of soil property definitions, soil analytical procedures and soil property values (and units of measurement). We presently consider the following soil chemical properties (organic carbon, total carbon, total carbonate equivalent, total Nitrogen, Phosphorus (extractable-P, total-P, and P-retention), soil pH, cation exchange capacity, and electrical conductivity) and physical properties (soil texture (sand, silt, and clay), bulk density, coarse fragments, and water retention), grouped according to analytical procedures (aggregates) that are operationally comparable. For each profile we provide the original soil classification (FAO, WRB, USDA, and version) and horizon designations as far as these have been specified in the source databases. Three measures for 'fitness-for-intended-use' are provided: positional uncertainty (for site locations), time of sampling/description, and a first approximation for the uncertainty associated with the operationally defined analytical methods. These measures should be considered during digital soil mapping and subsequent earth system modelling that use the present set of soil data. DATA SET DESCRIPTION: The 'WoSIS 2023 snapshot' comprises data for 228k profiles from 217k geo-referenced sites that originate from 174 countries. The profiles represent over 900k soil layers (or horizons) and over 6 million records. The actual number of measurements for each property varies (greatly) between profiles and with depth, this generally depending on the objectives of the initial soil sampling programmes. The data are provided in TSV (tab separated values) format and as GeoPackage. The zip-file (446 Mb) contains the following files: - Readme_WoSIS_202312_v2.pdf: Provides a short description of the dataset, file structure, column names, units and category values (this file is also available directly under 'online resources'). The pdf includes links to tutorials for downloading the TSV files into R respectively Excel. See also 'HOW TO READ TSV FILES INTO R AND PYTHON' in the next section. - wosis_202312_observations.tsv: This file lists the four to six letter codes for each observation, whether the observation is for a site/profile or layer (horizon), the unit of measurement and the number of profiles respectively layers represented in the snapshot. It also provides an estimate for the inferred accuracy for the laboratory measurements. - wosis_202312_sites.tsv: This file characterizes the site location where profiles were sampled. - wosis_2023112_profiles: Presents the unique profile ID (i.e. primary key), site_id, source of the data, country ISO code and name, positional uncertainty, latitude and longitude (WGS 1984), maximum depth of soil described and sampled, as well as information on the soil classification system and edition. Depending on the soil classification system used, the number of fields will vary . - wosis_202312_layers: This file characterises the layers (or horizons) per profile, and lists their upper and lower depths (cm). - wosis_202312_xxxx.tsv : This type of file presents results for each observation (e.g. “xxxx” = “BDFIOD” ), as defined under “code” in file wosis_202312_observation.tsv. (e.g. wosis_202311_bdfiod.tsv). - wosis_202312.gpkg: Contains the above datafiles in GeoPackage format (which stores the files within an SQLite database). HOW TO READ TSV FILES INTO R AND PYTHON: A) To read the data in R, please uncompress the ZIP file and specify the uncompressed folder. setwd("/YourFolder/WoSIS_2023_December/") ## For example: setwd('D:/WoSIS_2023_December/') Then use read_tsv to read the TSV files, specifying the data types for each column (c = character, i = integer, n = number, d = double, l = logical, f = factor, D = date, T = date time, t = time). observations = readr::read_tsv('wosis_202312_observations.tsv', col_types='cccciid') observations ## show columns and first 10 rows sites = readr::read_tsv('wosis_202312_sites.tsv', col_types='iddcccc') sites profiles = readr::read_tsv('wosis_202312_profiles.tsv', col_types='icciccddcccccciccccicccci') profiles layers = readr::read_tsv('wosis_202312_layers.tsv', col_types='iiciciiilcc') layers ## Do this for each observation 'XXXX', e.g. file 'Wosis_202312_orgc.tsv': orgc = readr::read_tsv('wosis_202312_orgc.tsv', col_types='iicciilccdccddccccc') orgc Note: One may also use the following R code (example is for file 'observations.tsv'): observations <- read.table("wosis_202312_observations.tsv", sep = "\t", header = TRUE, quote = "", comment.char = "", stringsAsFactors = FALSE ) B) To read the files into python first decompress the files to your selected folder. Then in python: # import the required library import pandas as pd # Read the observations data observations = pd.read_csv("wosis_202312_observations.tsv", sep="\t") # print the data frame header and some rows observations.head() # Read the sites data sites = pd.read_csv("wosis_202312_sites.tsv", sep="\t") # Read the profiles data profiles = pd.read_csv("wosis_202312_profiles.tsv", sep="\t") # Read the layers data layers = pd.read_csv("wosis_202312_layers.tsv", sep="\t") # Read the soil property data, e.g. 'cfvo' (do this for each observation) cfvo = pd.read_csv("wosis_202312_cfvo.tsv", sep="\t") CITATION: Calisto, L., de Sousa, L.M., Batjes, N.H., 2023. Standardised soil profile data for the world (WoSIS snapshot – December 2023), https://doi.org/10.17027/isric-wdcsoils-20231130 Supplement to: Batjes N.H., Calisto, L. and de Sousa L.M., 2023. Providing quality-assessed and standardised soil data to support global mapping and modelling (WoSIS snapshot 2023). Earth System Science Data, https://doi.org/10.5194/essd-16-4735-2024.

  8. d

    Methods for normalizing microbiome data: an ecological perspective

    • datadryad.org
    • data.niaid.nih.gov
    zip
    Updated Oct 30, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Donald T. McKnight; Roger Huerlimann; Deborah S. Bower; Lin Schwarzkopf; Ross A. Alford; Kyall R. Zenger (2018). Methods for normalizing microbiome data: an ecological perspective [Dataset]. http://doi.org/10.5061/dryad.tn8qs35
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 30, 2018
    Dataset provided by
    Dryad
    Authors
    Donald T. McKnight; Roger Huerlimann; Deborah S. Bower; Lin Schwarzkopf; Ross A. Alford; Kyall R. Zenger
    Time period covered
    Oct 19, 2018
    Description

    Simulation script 1This R script will simulate two populations of microbiome samples and compare normalization methods.Simulation script 2This R script will simulate two populations of microbiome samples and compare normalization methods via PcOAs.Sample.OTU.distributionOTU distribution used in the paper: Methods for normalizing microbiome data: an ecological perspective

  9. e

    Standardized NEON organismal data (neonDivData)

    • portal.edirepository.org
    bin, csv
    Updated Apr 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daijiang Li; Sydne Record; Eric Sokol; Matthew Bitters; Melissa Chen; Anny Chung; Matthew Helmus; Ruvi Jaimes; Lara Jansen; Marta Jarzyna; Michael Just; Jalene LaMontagne; Brett Melbourne; Wynne Moss; Kari Norman; Stephanie Parker; Natalie Robinson; Bijan Seyednasrollah; Sarah Spaulding; Thilina Surasinghe; Sarah Thomsen; Phoebe Zarnetske (2022). Standardized NEON organismal data (neonDivData) [Dataset]. http://doi.org/10.6073/pasta/c28dd4f6e7989003505ea02e9a92afbf
    Explore at:
    csv(67793652 bytes), csv(266884330 bytes), csv(4643854 bytes), csv(12011 bytes), csv(944312 bytes), csv(6879 bytes), csv(25181268 bytes), csv(1949590 bytes), csv(375200 bytes), csv(3062147 bytes), csv(35160044 bytes), csv(738408 bytes), csv(18427828 bytes), csv(604110 bytes), csv(35684117 bytes), csv(86101256 bytes), bin(20729 bytes), bin(4674 bytes)Available download formats
    Dataset updated
    Apr 12, 2022
    Dataset provided by
    EDI
    Authors
    Daijiang Li; Sydne Record; Eric Sokol; Matthew Bitters; Melissa Chen; Anny Chung; Matthew Helmus; Ruvi Jaimes; Lara Jansen; Marta Jarzyna; Michael Just; Jalene LaMontagne; Brett Melbourne; Wynne Moss; Kari Norman; Stephanie Parker; Natalie Robinson; Bijan Seyednasrollah; Sarah Spaulding; Thilina Surasinghe; Sarah Thomsen; Phoebe Zarnetske
    Time period covered
    Jun 5, 2013 - Jul 28, 2020
    Area covered
    Variables measured
    sex, unit, year, State, endRH, month, sites, units, value, boutID, and 113 more
    Description

    To standardize NEON organismal data for major taxonomic groups, we first systematically reviewed NEON’s documentations for each taxonomic group. We then discussed as a group and with NEON staff to decide how to wrangle and standardize NEON organismal data. See Li et al. 2022 for more details. All R code to process NEON data products can be obtained through the R package ‘ecocomDP’. Once the data are in ecocomDP format, we further processed them to convert them into long data frames with code on Github (https://github.com/daijiang/neonDivData/tree/master/data-raw), which is also archived here.

  10. Additional file 3: of DBNorm: normalizing high-density oligonucleotide...

    • springernature.figshare.com
    txt
    Updated Nov 30, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qinxue Meng; Daniel Catchpoole; David Skillicorn; Paul Kennedy (2017). Additional file 3: of DBNorm: normalizing high-density oligonucleotide microarray data based on distributions [Dataset]. http://doi.org/10.6084/m9.figshare.5648932.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Nov 30, 2017
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Qinxue Meng; Daniel Catchpoole; David Skillicorn; Paul Kennedy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    DBNorm test script. Code of how we test DBNorm package. (TXT 2Â kb)

  11. XPRESSyourself: Enhancing, standardizing, and automating ribosome profiling...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    pdf
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jordan A. Berg; Jonathan R. Belyeu; Jeffrey T. Morgan; Yeyun Ouyang; Alex J. Bott; Aaron R. Quinlan; Jason Gertz; Jared Rutter (2023). XPRESSyourself: Enhancing, standardizing, and automating ribosome profiling computational analyses yields improved insight into data [Dataset]. http://doi.org/10.1371/journal.pcbi.1007625
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jordan A. Berg; Jonathan R. Belyeu; Jeffrey T. Morgan; Yeyun Ouyang; Alex J. Bott; Aaron R. Quinlan; Jason Gertz; Jared Rutter
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Ribosome profiling, an application of nucleic acid sequencing for monitoring ribosome activity, has revolutionized our understanding of protein translation dynamics. This technique has been available for a decade, yet the current state and standardization of publicly available computational tools for these data is bleak. We introduce XPRESSyourself, an analytical toolkit that eliminates barriers and bottlenecks associated with this specialized data type by filling gaps in the computational toolset for both experts and non-experts of ribosome profiling. XPRESSyourself automates and standardizes analysis procedures, decreasing time-to-discovery and increasing reproducibility. This toolkit acts as a reference implementation of current best practices in ribosome profiling analysis. We demonstrate this toolkit’s performance on publicly available ribosome profiling data by rapidly identifying hypothetical mechanisms related to neurodegenerative phenotypes and neuroprotective mechanisms of the small-molecule ISRIB during acute cellular stress. XPRESSyourself brings robust, rapid analysis of ribosome-profiling data to a broad and ever-expanding audience and will lead to more reproducible and accessible measurements of translation regulation. XPRESSyourself software is perpetually open-source under the GPL-3.0 license and is hosted at https://github.com/XPRESSyourself, where users can access additional documentation and report software issues.

  12. Additional file 4: of DBNorm: normalizing high-density oligonucleotide...

    • springernature.figshare.com
    txt
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qinxue Meng; Daniel Catchpoole; David Skillicorn; Paul Kennedy (2023). Additional file 4: of DBNorm: normalizing high-density oligonucleotide microarray data based on distributions [Dataset]. http://doi.org/10.6084/m9.figshare.5648956.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Qinxue Meng; Daniel Catchpoole; David Skillicorn; Paul Kennedy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    DBNorm installation. Describes how to install DBNorm via devtools in R. (TXT 4Â kb)

  13. g

    2014 Wind Turbine Gearbox Damage Distribution based on the NREL Gearbox...

    • gimi9.com
    Updated Nov 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). 2014 Wind Turbine Gearbox Damage Distribution based on the NREL Gearbox Reliability Database | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_2014-wind-turbine-gearbox-damage-distribution-based-on-the-nrel-gearbox-reliability-databa/
    Explore at:
    Dataset updated
    Nov 6, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Despite the improvements in wind turbine gearbox design and manufacturing practices, the wind industry is still challenged by premature wind turbine gearbox failures. To help address this industry-wide challenge, a consortium called the Gearbox Reliability Collaborative (GRC) was launched by the U.S. Department of Energy's (DOE's) National Renewable Energy Laboratory (NREL) in 2007. It brings together the parties involved in the wind turbine gearbox supply chain to investigate possible root causes of premature wind turbine gearbox failures and research improvement strategies. One area of work under the GRC is a collection and analysis effort of failure data titled the Gearbox Reliability Database (GRD), which was started in 2009. The main objectives of the GRD are to categorize top wind turbine gearbox failure modes, identify possible root causes, and direct future wind turbine gearbox reliability research and development (R) activities. The GRD project has made huge progress in developing wind turbine gearbox reliability data collection tools, standardizing data recording practices, populating the database, and informing the industry with up-to-date R findings and statistics. The project currently has more than 20 data-sharing partners, including wind turbine and wind turbine gearbox manufacturers and owners/operators, gearbox rebuild shops, and operation and maintenance service providers. The assets represented by only owner/operator partners on this project comprised approximately 34% of the U.S. capacity at the end of 2013 (according to the American Wind Energy Association annual market report). The number and variety of partners and the assets they represent demonstrate the value and need of major wind turbine component data collection and analysis to the industry. The attached image shows the distribution of the damage component locations based on approximately 320 confirmable wind turbine gearbox damage records stored in the database. It is observed that wind turbine gearboxes could fail in drastically different ways. The majority of the damage occurs to bearings (64%), followed by gears (25%), and the other components account for 11% of the failures. Among the other components, lubrication and filtration system problems are dominant. Both bearing and gear faults are concentrated in the parallel section, which aligns with field observations made by wind turbine owner/operator partners. The top gearbox failure is axial cracks that occur to bearings located at the high- or intermediate-speed stage. This identification confirms the value and need for wind turbine gearbox R on bearing axial cracks root causes and mitigation methods, which is a joint research effort by Argonne National Laboratory and NREL funded by DOE's Wind and Water Power Program. The data-sharing partners highly value this project and recommend that NREL generate industry-wide reliability benchmarking statistics from the information contained in the database, which is currently not publicly available. Frequently, these reliability statistics are distorted and kept internally if they are generated by wind turbine original equipment manufacturers or owners/operators, which do not normally have a balanced representation of wind turbine makers and models. The GRD experiences provide the industry with valuable support to standardize reliability data collection for major wind turbine components and subsystems.

  14. r

    BEDMAP3 - Ice thickness, bed and surface elevation for Antarctica -...

    • researchdata.edu.au
    Updated 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    A. Zirizzotti; Duncan Young; D.S. Wilson; B. Welch; David Vaughan; S. Urbini; Kirsty Tinto; I. Tabacco; B. Sun; M. Studinger; D. Steinhage; Andrew Smith; Martin Siegert; Dustin Schroeder; A. Ruppel; Neil Ross; Jason Roberts; Andres Rivera; David Rippin; Eric Rignot; M. Riger-Kusk; S. Popov; F. Pattyn; John Paden; O. Nost; Y. Nogi; Frank Nitsche; Jeremie Mouginot; Mathieu Morlighem; Kenichi Matsuoka; E. MacKie; J. MacGregor; B. Luyendyk; C. Leuschen; G. Leitchenkov; J. Lee; K. Langley; W. Krabill; Jack Kohler; Edward King; Tom Jordan; Wilfried Jokat; Adrian Jenkins; D. Jansen; R. Jacobel; A. Humbert; J.W. Holt; N. Holschuh; P. Holmlund; Richard Hindmarsh; B. Hills; J.S. Greenbaum; P. Gogineni; V. Goel; Y. Gim; S. Fujita; S Franke; Rene Forsberg; Fausto Ferraccioli; H. Eisermann; O. Eisen; G. Eagles; R. Drews; V. Damn; D Damaske; X. Cui; Hugh Corr; H Conway; K Christianson; Ginny Catania; G. Casassa; Donald Blankenship; Robert Bingham; C Bianchi; R.E. Bell; Jonathan Bamber; A. Aitken; Hamish Pritchard; Julien Bodart; Peter Fretwell; Alice Fremand; School of Earth and Oceans (2022). BEDMAP3 - Ice thickness, bed and surface elevation for Antarctica - standardised data points [Dataset]. http://doi.org/10.5285/91523FF9-D621-46B3-87F7-FFB6EFCD1847
    Explore at:
    Dataset updated
    2022
    Dataset provided by
    NERC EDS UK Polar Data Centre
    The University of Western Australia
    Authors
    A. Zirizzotti; Duncan Young; D.S. Wilson; B. Welch; David Vaughan; S. Urbini; Kirsty Tinto; I. Tabacco; B. Sun; M. Studinger; D. Steinhage; Andrew Smith; Martin Siegert; Dustin Schroeder; A. Ruppel; Neil Ross; Jason Roberts; Andres Rivera; David Rippin; Eric Rignot; M. Riger-Kusk; S. Popov; F. Pattyn; John Paden; O. Nost; Y. Nogi; Frank Nitsche; Jeremie Mouginot; Mathieu Morlighem; Kenichi Matsuoka; E. MacKie; J. MacGregor; B. Luyendyk; C. Leuschen; G. Leitchenkov; J. Lee; K. Langley; W. Krabill; Jack Kohler; Edward King; Tom Jordan; Wilfried Jokat; Adrian Jenkins; D. Jansen; R. Jacobel; A. Humbert; J.W. Holt; N. Holschuh; P. Holmlund; Richard Hindmarsh; B. Hills; J.S. Greenbaum; P. Gogineni; V. Goel; Y. Gim; S. Fujita; S Franke; Rene Forsberg; Fausto Ferraccioli; H. Eisermann; O. Eisen; G. Eagles; R. Drews; V. Damn; D Damaske; X. Cui; Hugh Corr; H Conway; K Christianson; Ginny Catania; G. Casassa; Donald Blankenship; Robert Bingham; C Bianchi; R.E. Bell; Jonathan Bamber; A. Aitken; Hamish Pritchard; Julien Bodart; Peter Fretwell; Alice Fremand; School of Earth and Oceans
    Time period covered
    Jan 1, 1971 - Dec 31, 2020
    Area covered
    Description

    We present here the Bedmap3 ice thickness, bed and surface elevation standardised CSV data points that are used to create the Bedmap3 gridding products in addition to the previous data releases. The data consists of 50 million points acquired by 17 different data providers in Antarctica. The associated datasets consist of: - Bedmap1 standardised CSV data points: https://doi.org/10.5285/f64815ec-4077-4432-9f55-0ce230f46029 - Bedmap2 standardised CSV data points: https://doi.org/10.5285/2fd95199-365e-4da1-ae26-3b6d48b3e6ac - Bedmap3 statistically-summarised data points (shapefiles): https://doi.org/10.5285/a72a50c6-a829-4e12-9f9a-5a683a1acc4a This work is supported by the SCAR Bedmap project and the British Antarctic Survey's core programme: National Capability - Polar Expertise Supporting UK Research

  15. n

    Dataset: A three-dimensional approach to general plant fire syndromes

    • data.niaid.nih.gov
    • datadryad.org
    zip
    Updated Jan 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pedro Jaureguiberry; Sandra Díaz (2023). Dataset: A three-dimensional approach to general plant fire syndromes [Dataset]. http://doi.org/10.5061/dryad.j6q573njb
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 27, 2023
    Dataset provided by
    Instituto Multidisciplinario de Biología Vegetal (CONICET-Universidad Nacional de Córdoba) and FCEFyN
    Authors
    Pedro Jaureguiberry; Sandra Díaz
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description
    1. Plant fire syndromes are usually defined as combinations of fire response traits, the most common being resprouting (R) and seeding (S). Plant flammability (F), on the other hand, refers to a plant’s effects on communities and ecosystems. Despite its important ecological and evolutionary implications, F has rarely been considered to define plant fire syndromes and, if so, usually separated from response syndromes.
    2. We propose a three-dimensional model that combines R, S and F, encapsulating both plant response to fire regimes and the capacity to promote them. Each axis is divided into three possible standardized categories, reflecting low, medium and high values of each variable, with a total of 27 possible combinations of R, S and F.
    3. We hypothesized that different fire histories should be reflected in the position of species within the three-dimensional space and that this should help assess the importance of fire as an evolutionary force in determining R-S-F syndromes.
    4. To illustrate our approach we compiled information on the fire syndromes of 24 dominant species of different growth forms from the Chaco seasonally-dry forest of central Argentina, and we compared them to 33 species from different Mediterranean-type climate ecosystems (MTCEs) of the world.
    5. Chaco and MTCEs species differed in the range (seven syndromes vs. thirteen syndromes, respectively) and proportion of extreme syndromes (i.e. species with extreme values of R, S and/or F) representing 29% of species in the Chaco vs. 45% in the MTCEs.
    6. Additionally, we explored the patterns of R, S and F of 4032 species from seven regions with contrasting fire histories, and found significantly higher frequencies of extreme values (predominantly high) of all three variables in MTCEs compared to the other regions, where intermediate and low values predominated, broadly supporting our general hypothesis.
    7. The proposed three-dimensional approach should help standardize comparisons of fire syndromes across taxa, growth forms and regions with different fire histories. This will contribute to the understanding of the role of fire in the evolution of plant traits and assist vegetation modelling in the face of changes in fire regimes. Methods Data collection for Chaco species From previous studies, we compiled data on post-fire resprouting (R) (Jaureguiberry 2012; Jaureguibery et al. 2020), germination capacity after heat shock treatments (S) (Jaureguiberry & Díaz) and flammability (F) (Jaureguiberry et al. 2011) of 24 dominant species of the seasonally-dry Chaco forest of central Argentina (hereafter Chaco). We then transformed the original data from the mentioned studies into three possible categorical ordinal values: 1, 2 or 3, indicating low, medium and high values of each variable, respectively. To do so, we used the following criteria: 1) For R data: we focused on the survival percentage recorded for each species (Jaureguiberry et al., 2020) as a proxy for resprouting capacity (Pérez-Harguindeguy et al., 2013). This was because this variable is widely used in fire studies and has a standard scale and range of values, therefore facilitating comparisons between species from different regions. Survival percentages were assigned to one of three possible intervals: 0 to 33 %; 34 to 66 % and from 67 to 100%, and then each interval was assigned the value 1, 2 or 3 respectively, indicating low, medium and high values of resprouting capacity. 2) For S data: based on germination response to heat shock treatments we classified species as heat-sensitive (germination lower than the control), heat-tolerant (germination similar to the control) or heat-stimulated (germination higher than the control) (see details in Jaureguiberry and Díaz 2015). Each of these categories was respectively assigned a value of 1, 2 or 3. 3) For F data: while original measurements included burning rate, maximum temperature and biomass consumed (see details in Jaureguiberry et al. 2011), with the purpose of comparing Chaco species with species from other regions, and considering that burning rate is rarely reported, data of the two latter variables were collected from studies that followed Jaureguiberry et al. (2011). A PCA followed by cluster analysis allowed classifying species into the following categories: 1=low flammability; 2=moderate flammability; and 3=high flammability.

    Data collection for other regions We performed an unstructured literature review of fire-related traits relevant to our model. Whenever possible, we searched for the same or similar variables to those used for the Chaco, namely survival percentage, germination response to heat shock, and variables related to flammability (e.g. maximum temperature, biomass consumed and burning rate), as proxies for R, S and F, respectively. Classification into different R intervals was based either on quantitative data on survival percentage, or on qualitative information from major databases. For example, resprouting capacity reported as “low”, or “high” (e.g. Tavşanoğlu & Pausas, 2018) were assigned R values of 1 and 3, respectively. For Southern Australian species, those reported as “fire killed” and “weak resprouting” (Falster et al., 2021) were assigned a value of 1, while those reported as “intermediate resprouting” and “strong resprouting” were assigned values of 2 and 3, respectively. The vast majority of records in our dataset refer to resprouting of individuals one growing season after the fire. Flammability data for most of the species were based on quantitative measurements that have used the method of Jaureguiberry et al. (2011), which was standardised following the criteria explained earlier. However, for some species, classification was based either on other quantitative measures that followed other methodologies (e.g. measures based on plant parts such as twigs or leaves, or fuel beds) or on qualitative classifications reported in the literature (most of which are in turn based on reviews of quantitative measurements from previous studies). We standardised the original data collected for the other regions following the same approach as for the Chaco. We then built contingency tables to analyse each region and to compare between regions. The curated total number of records from our literature review was 4411 (records for R, S and F, were 3399, 678 and 334, respectively) for 4,032 species (many species had information on two variables, and very few on the three variables). The database covers a wide taxonomic range, encompassing species from approximately 1,250 genera and 180 botanical families, belonging to ten different growth forms, and coming from seven major regions with a wide range of evolutionary histories of fire, from long and intense (Mediterranean-Type Climate Ecosystems) to very recent (New Zealand).

  16. f

    Opportunities for Integrated Ecological Analysis across Inland Australia...

    • plos.figshare.com
    xlsx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Greg R. Guerin; Ben Sparrow; Andrew Tokmakoff; Anita Smyth; Emrys Leitch; Zdravko Baruch; Andrew J. Lowe (2023). Opportunities for Integrated Ecological Analysis across Inland Australia with Standardised Data from Ausplots Rangelands [Dataset]. http://doi.org/10.1371/journal.pone.0170137
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Greg R. Guerin; Ben Sparrow; Andrew Tokmakoff; Anita Smyth; Emrys Leitch; Zdravko Baruch; Andrew J. Lowe
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Australia
    Description

    Australian rangelands ecosystems cover 81% of the continent but are understudied and continental-scale research has been limited in part by a lack of precise data that are standardised between jurisdictions. We present a new dataset from AusPlots Rangelands that enables integrative rangelands analysis due to its geographic scope and standardised methodology. The method provides data on vegetation and soils, enabling comparison of a suite of metrics including fractional vegetation cover, basal area, and species richness, diversity, and composition. Cover estimates are robust and repeatable, allowing comparisons among environments and detection of modest change. The 442 field plots presented here span a rainfall gradient of 129–1437 mm Mean annual precipitation with varying seasonality. Vegetation measurements include vouchered vascular plant species, growth form, basal area, height, cover and substrate type from 1010 point intercepts as well as systematically recorded absences, which are useful for predictive modelling and validation of remote sensing applications. Leaf and soil samples are sampled for downstream chemical and genomic analysis. We overview the sampling of vegetation parameters and environments, applying the data to the question of how species abundance distributions (SADs) vary over climatic gradients, a key question for the influence of environmental change on ecosystem processes. We found linear relationships between SAD shape and rainfall within grassland and shrubland communities, indicating more uneven abundance in deserts and suggesting relative abundance may shift as a consequence of climate change, resulting in altered diversity and ecosystem function. The standardised data of AusPlots enables such analyses at large spatial scales, and the testing of predictions through time with longitudinal sampling. In future, the AusPlots field program will be directed towards improving coverage of space, under-represented environments, vegetation types and fauna and, increasingly, re-sampling of established plots. Providing up-to-date data access methods to enhance re-use is also a priority.

  17. Means, standard deviations and standardised factor loadings for PDSS-R data....

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maria E. Pushpanathan; Andrea M. Loftus; Natalie Gasson; Meghan G. Thomas; Caitlin F. Timms; Michelle Olaithe; Romola S. Bucks (2023). Means, standard deviations and standardised factor loadings for PDSS-R data. [Dataset]. http://doi.org/10.1371/journal.pone.0192394.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Maria E. Pushpanathan; Andrea M. Loftus; Natalie Gasson; Meghan G. Thomas; Caitlin F. Timms; Michelle Olaithe; Romola S. Bucks
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Means, standard deviations and standardised factor loadings for PDSS-R data.

  18. Data analysis for Fig 3: PLIN2 and β-actin.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini (2025). Data analysis for Fig 3: PLIN2 and β-actin. [Dataset]. http://doi.org/10.1371/journal.pone.0325052.s014
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 12, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    a) Raw data of PLIN2 from Figure 3a and t-WB analysis for Fig 3b, 3c. b) Raw data of β-actin from Fig 3a and t-WB analysis for Figure 3d,e. c) PLIN2 normalization to β-actin expressed as fold change compared to control at 2.5 μg protein loading for Fig 3f (left panel). d) PLIN2 normalization to β-actin expressed as fold change compared to control at 5 μg protein loading for Fig 3f (middle panel). e) PLIN2 normalization to β-actin expressed as fold change compared to control at 10 μg protein loading for Fig 3f (right panel). (XLSX)

  19. f

    Data analysis for Fig 2 and Supplementary Fig 1: LDLR.

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini (2025). Data analysis for Fig 2 and Supplementary Fig 1: LDLR. [Dataset]. http://doi.org/10.1371/journal.pone.0325052.s011
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 12, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    a) Raw data of LDLR from Figure 1c and 2a and t-WB analysis for Fig 2b, 2c. b) LDLR signals normalized to WB1 at 20 μg (red) and t-WB analysis for S3 Fig 1a, 1b. c) LDLR signals normalized to WB1 at 40 μg (red) and t-WB analysis for S3 Fig 1c, 1d. d) LDLR signals normalized to WB1 at 60 μg (red) and t-WB analysis for Fig 2d, 2e. (XLSX)

  20. f

    Data analysis for Fig 1.

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated Jun 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini (2025). Data analysis for Fig 1. [Dataset]. http://doi.org/10.1371/journal.pone.0325052.s010
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 12, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Alice Maestri; Ewa Ehrenborg; Olivera Werngren; Maria Olin; Carolina E. Hagberg; Matteo Pedrelli; Paolo Parini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    a) Arbitrary values and t-WB analysis used for schematic in Fig 1b. b) Raw data of LDLR from Fig 1c and t-WB analysis for Fig 1d. c) Arbitrary values and t-WB analysis used for schematic in Fig 1e. d) Raw data of PLIN3 from Fig 1f and t-WB analysis for Fig 1g. e) Arbitrary values and t-WB analysis used for schematic in Fig 1h. f) Raw data of HSC70 from Fig 1i and t-WB analysis for Fig 1j. (XLSX)

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jessica Joyner (2019). REMNet Tutorial, R Part 5: Normalizing Microbiome Data in R 5.2.19 [Dataset]. http://doi.org/10.25334/M13H-XT81

REMNet Tutorial, R Part 5: Normalizing Microbiome Data in R 5.2.19

Explore at:
Dataset updated
Aug 28, 2019
Dataset provided by
QUBES
Authors
Jessica Joyner
Description

Video on normalizing microbiome data from the Research Experiences in Microbiomes Network

Search
Clear search
Close search
Google apps
Main menu