100+ datasets found
  1. Road-R Dataset Sample

    • kaggle.com
    zip
    Updated Aug 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sciencestoked (2023). Road-R Dataset Sample [Dataset]. https://www.kaggle.com/datasets/sciencestoked/road-r-dataset-sample
    Explore at:
    zip(186910332 bytes)Available download formats
    Dataset updated
    Aug 17, 2023
    Authors
    sciencestoked
    Description

    Dataset

    This dataset was created by sciencestoked

    Contents

  2. random-points-sampling-r

    • kaggle.com
    zip
    Updated Nov 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JORGE GARCIA-INIGUEZ (2023). random-points-sampling-r [Dataset]. https://www.kaggle.com/datasets/jorgegarciainiguez/random-points-sampling-r
    Explore at:
    zip(712850 bytes)Available download formats
    Dataset updated
    Nov 18, 2023
    Authors
    JORGE GARCIA-INIGUEZ
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by JORGE GARCIA-INIGUEZ

    Released under MIT

    Contents

  3. q

    Chapter 2: Data sampling, accuracy, and precision

    • qubeshub.org
    Updated Dec 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raisa Hernández-Pacheco; Alexis Diaz (2020). Chapter 2: Data sampling, accuracy, and precision [Dataset]. http://doi.org/10.25334/66G4-8G08
    Explore at:
    Dataset updated
    Dec 23, 2020
    Dataset provided by
    QUBES
    Authors
    Raisa Hernández-Pacheco; Alexis Diaz
    Description

    Biostatistics Using R: A Laboratory Manual was created with the goals of providing biological content to lab sessions by using authentic research data and introducing R programming language. Chapter 2 introduces sampling, accuracy, and precision.

  4. R script and input data for "ALL-EMA sampling design"

    • envidat.ch
    .r, .txt +1
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Klaus Ecker; Yves Tillé (2025). R script and input data for "ALL-EMA sampling design" [Dataset]. http://doi.org/10.16904/envidat.402
    Explore at:
    not available, .txt, .rAvailable download formats
    Dataset updated
    May 28, 2025
    Dataset provided by
    Swiss Federal Institute for Forest, Snow and Landscape Research
    Institute of Statistics, University of Neuchatel
    Authors
    Klaus Ecker; Yves Tillé
    Area covered
    Switzerland
    Dataset funded by
    FOEN
    Description

    License: GPL-v2 The R script presents an advanced sampling approach for monitoring biodiversity on agricultural land by combining multiple objectives and integrating environmental and geographic space. The example demonstrates the first-stage selection of squares (km2) in the ALL-EMA sampling design using modern sampling techniques such as unequal probability sampling with fixed sample size, balanced sampling, stratified balancing and geographic spreading. Sampling is done with unequal probabilities and weights defined by power allocation to give equal weight to extrapolations to the total agricultural area of Switzerland and two stratifications of predefined interest (regions and agricultural production zones). Calibration is used to limit the distribution of the sampling weights. The sample sizes are almost fixed within the strata and evenly distributed across the years of a temporal rotation plan, which is favourable for the organisation of the field survey. Sampling also ensures an optimal (annual) distribution across geographic space, including altitude. Despite the complexity of the sampling, estimation based on probability theory is straightforward. Ecker, K.T., Meier, E.S. & Tillé, Y. 2023. Integrating spatial and ecological information into comprehensive biodiversity monitoring on agricultural land. Environmental Monitoring and Assessment 195.

  5. Summary statistics of population and samples taken at different sampling...

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maria M; Ibrahim M. Almanjahie; Muhammad Ismail; Ammara Nawaz Cheema (2023). Summary statistics of population and samples taken at different sampling schemes for n = 4, r = 1. [Dataset]. http://doi.org/10.1371/journal.pone.0275340.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Maria M; Ibrahim M. Almanjahie; Muhammad Ismail; Ammara Nawaz Cheema
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Summary statistics of population and samples taken at different sampling schemes for n = 4, r = 1.

  6. d

    Data from: SSP: An R package to estimate sampling effort in studies of...

    • search.dataone.org
    • data.niaid.nih.gov
    • +2more
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edlin Guerra-Castro; Juan Carlos Cajas; Nuno Simoes; Juan Jose Cruz-Motta; Maite Mascaro (2025). SSP: An R package to estimate sampling effort in studies of ecological communities [Dataset]. http://doi.org/10.5061/dryad.3bk3j9kj5
    Explore at:
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Edlin Guerra-Castro; Juan Carlos Cajas; Nuno Simoes; Juan Jose Cruz-Motta; Maite Mascaro
    Time period covered
    Jan 1, 2021
    Description

    SSP (simulation-based sampling protocol) is an R package that uses simulations of ecological data and dissimilarity-based multivariate standard error (MultSE) as an estimator of precision to evaluate the adequacy of different sampling efforts for studies that will test hypothesis using permutational multivariate analysis of variance. The procedure consists in simulating several extensive data matrixes that mimic some of the relevant ecological features of the community of interest using a pilot data set. For each simulated data, several sampling efforts are repeatedly executed and MultSE calculated. The mean value, 0.025 and 0.975 quantiles of MultSE for each sampling effort across all simulated data are then estimated and standardized regarding the lowest sampling effort. The optimal sampling effort is identified as that in which the increase in sampling effort does not improve the highest MultSE beyond a threshold value (e.g. 2.5 %). The performance of SSP was validated using real dat...

  7. Lead Sampling in Two Cities

    • catalog.data.gov
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2025). Lead Sampling in Two Cities [Dataset]. https://catalog.data.gov/dataset/lead-sampling-in-two-cities
    Explore at:
    Dataset updated
    Sep 22, 2025
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    Lead concentrations in drinking water samples collected under various sampling protocols in homes with lead service lines and in homes without lead service lines in two US cities. This dataset is associated with the following publication: Lytle, D., M. Urbanic, A. Paul, R. Achtemeier, A. Lewis, S. Hammaker, A. Estep, M. Nadagouda, R. James, and S. Triantafyllidou. Alternative approaches to lead sampling in drinking water: A comparative study of homes with and without lead service lines in two cities. WATER RESEARCH. Elsevier Science Ltd, New York, NY, USA, 994: 180063, (2025).

  8. d

    Scripts from: Performance of generalized distance sampling models with...

    • search.dataone.org
    • datadryad.org
    Updated Oct 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Julia Barczyk; Marc Kéry; Grzegorz Neubauer; Kenneth F. Kellner; Malcolm C. K. Soh; Jaume A. Badia-Boher (2025). Scripts from: Performance of generalized distance sampling models with temporary emigration: a simulation study [Dataset]. http://doi.org/10.5061/dryad.j0zpc86tr
    Explore at:
    Dataset updated
    Oct 4, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Julia Barczyk; Marc Kéry; Grzegorz Neubauer; Kenneth F. Kellner; Malcolm C. K. Soh; Jaume A. Badia-Boher
    Description

    Generalized distance sampling (GDS) models are the distance sampling equivalent of temporary emigration N-mixture models. In addition to density and the perceptibility component of detection, both contain an additional parameter for availability for detection which becomes estimable when data from repeated 'visits' are available. GDS models thus account for open populations. This makes them more robust, since natural populations are hardly ever perfectly closed, arguably even over the course of a single breeding season. However, the performance of these models has not been tested thoroughly, and prior (unpublished) analyses suggested that biased estimates, especially for density (high) and availability (low), may typically occur under certain conditions. We conducted three simulation studies and found that bias arises in low-information scenarios, particularly with low sample sizes and low parameter values. Our simulations enable us to determine "estimation frontiers", which separate sa..., , # Title of Dataset: Performance of generalized distance sampling models with temporary emigration: a simulation study

    Description of the data

    The study was not based on real data. All data used in the study were generated using simulation code.

    Code/Software

    The dataset contains four R files with simulation codes:

    1. Code_1_simGDS_function.R- R code with data simulation function;
    2. Code_2_gds_Sim1.R - R code to perform Simulation 1 with varying number of sites (20–500) and of surveys (2–10);
    3. Code_3_gds_Sim2.R - R code to perform Simulation 2 with varying number of sites (20–500), surveys (2–10), density λ (0.01-2 individuals per hectare), availability ϕ (0.01-1), and the parameter that governs the decline of the detection function over distance σ (20-200 meters);
    4. Code_4_gds_Sim3.R - R code to perform Simulation 3 with effects of three continuous covariates on all the three parameters (λ,ϕ,σ).

    First, run Code_1. The other codes are independent, but the first simul...,

  9. d

    Data from: Accumulated wastewater calculations for smallmouth bass sampling...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Accumulated wastewater calculations for smallmouth bass sampling sites in the Shenandoah River Watershed, USA [Dataset]. https://catalog.data.gov/dataset/accumulated-wastewater-calculations-for-smallmouth-bass-sampling-sites-in-the-shenandoah-r
    Explore at:
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Shenandoah River, United States
    Description

    This data release presents calculated accumulated wastewater (ACCWW, as a percent of total streamflow) values for 43 National Hydrologic Dataset Version 2.1 (NHDPlus V2.1) stream segments coinciding with long-term smallmouth bass sampling locations (Table 1) in the Shenandoah River Watershed (encompassing parts of Virginia and West Virginia, USA). Values are calculated for quarter-year (Quarter 1 [Q1], January - March; Quarter 2 [Q2], April - June; Quarter 3 [Q3], July-September; Quarter 4 [Q4], October-December) time scales (Table 2) and annual time scales (Table 3) for years 2000 to 2018. Estimates at a stream segment represent the combined total upstream wastewater discharges as well as direct discharges into the stream segment. Any users of these data should review the entire metadata record and the associated manuscript (see Larger Work Citation). See 'Distribution Liability' statements for more information.

  10. Ex-R Study Urine Data

    • catalog.data.gov
    Updated Jan 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2022). Ex-R Study Urine Data [Dataset]. https://catalog.data.gov/dataset/ex-r-study-urine-data
    Explore at:
    Dataset updated
    Jan 24, 2022
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    Emory University (analyzed the urine samples for pyrethroid metabolites). This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Contact Researcher. Format: Pyrethroid metabolite concentration data for 50 adults over six-weeks. This dataset is associated with the following publication: Morgan , M., J. Sobus , D.B. Barr, C. Croghan , F. Chen , R. Walker, L. Alston, E. Andersen, and M. Clifton. Temporal variability of pyrethroid metabolite levels in bedtime, morning, and 24-hr urine samples for 50 adults in North Carolina. ENVIRONMENT INTERNATIONAL. Elsevier Science Ltd, New York, NY, USA, 144: 81-91, (2015).

  11. f

    R-squares (in bold, above diagonal) and *sample sizes (n) and p-values...

    • datasetcatalog.nlm.nih.gov
    Updated Aug 31, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federico, Paula; Westbrook, John K.; Kunz, Thomas H.; Brown, Veronica A.; McCracken, Gary F.; Eldridge, Melanie (2012). R-squares (in bold, above diagonal) and *sample sizes (n) and p-values (below diagonal) between temporal patterns of moth abundance at each sampling site. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001142574
    Explore at:
    Dataset updated
    Aug 31, 2012
    Authors
    Federico, Paula; Westbrook, John K.; Kunz, Thomas H.; Brown, Veronica A.; McCracken, Gary F.; Eldridge, Melanie
    Description

    *n is the number of days in which samples were collected at each site on the same day.

  12. d

    R/V BELLOWS 95-04 surface samples

    • catalog.data.gov
    Updated Sep 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). R/V BELLOWS 95-04 surface samples [Dataset]. https://catalog.data.gov/dataset/r-v-bellows-95-04-surface-samples
    Explore at:
    Dataset updated
    Sep 12, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    The U.S. Geological Survey, in cooperation with the University of South Florida and Eckerd College, completed a bathymetric, sidescan sonar, high-resolution seismic-reflection, and surface sediment sampling survey of the inner shelf environment along the western Florida coast. The survey area extends 15km from Sarasota Point to Buttonwood Harbor. This study is part of a larger program initiated by the U.S. Geological Survey to map the geologic framework and monitor the modern processes that affect the western Florida coastal zone. This portion of the project included a reconnaissance high-resolution seismic and side-scan sonar surveys of the entire study area, detailed mapping to identify patterns of hard grounds and sediment cover, and coring of sediments to document historical development of the inner shelf and coastal system.

  13. d

    Data from: The program STRUCTURE does not reliably recover the correct...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Jan 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sébastien J. Puechmaille (2016). The program STRUCTURE does not reliably recover the correct population structure when sampling is uneven: sub-sampling and new estimators alleviate the problem [Dataset]. http://doi.org/10.5061/dryad.2d4m9
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 19, 2016
    Dataset provided by
    Dryad
    Authors
    Sébastien J. Puechmaille
    Time period covered
    Dec 13, 2015
    Area covered
    Africa, Asia, America, Europe, Oceania
    Description

    Inferences of population structure and more precisely the identification of genetically homogeneous groups of individuals are essential to the fields of ecology, evolutionary biology, and conservation biology. Such population structure inferences are routinely investigated via the program STRUCTURE implementing a Bayesian algorithm to identify groups of individuals at Hardy-Weinberg and linkage equilibrium. While the method is performing relatively well under various population models with even sampling between subpopulations, the robustness of the method to uneven sample size between subpopulations and/or hierarchical levels of population structure has not yet been tested despite being commonly encountered in empirical datasets. In this study, I used simulated and empirical microsatellite datasets to investigate the impact of uneven sample size between subpopulations and/or hierarchical levels of population structure on the detected population structure. The results demonstrated that u...

  14. d

    Fish Sampling Log data collected during NOAA R/V Townsend Cromwell cruises...

    • catalog.data.gov
    • fisheries.noaa.gov
    Updated Jan 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact, Custodian) (2025). Fish Sampling Log data collected during NOAA R/V Townsend Cromwell cruises between 1982 and 1998 and NOAA R/V Oscar E Sette cruises in 2007 and 2009 in the Central and Western Pacific [Dataset]. https://catalog.data.gov/dataset/fish-sampling-log-data-collected-during-noaa-r-v-townsend-cromwell-cruises-between-1982-and-1992
    Explore at:
    Dataset updated
    Jan 24, 2025
    Dataset provided by
    (Point of Contact, Custodian)
    Description

    FIsh caught on NOAA R/V Townsend Cromwell cruises from 1982 to 1998 and NOAA R/V Oscar E Sette in 2007 and 2009 were measured and/or weighed and sex determination was conducted. Specimen samples were also preserved from selected fishes.

  15. w

    Synthetic Data for an Imaginary Country, Sample, 2023 - World

    • microdata.worldbank.org
    • nada-demo.ihsn.org
    Updated Jul 7, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Development Data Group, Data Analytics Unit (2023). Synthetic Data for an Imaginary Country, Sample, 2023 - World [Dataset]. https://microdata.worldbank.org/index.php/catalog/5906
    Explore at:
    Dataset updated
    Jul 7, 2023
    Dataset authored and provided by
    Development Data Group, Data Analytics Unit
    Time period covered
    2023
    Area covered
    World
    Description

    Abstract

    The dataset is a relational dataset of 8,000 households households, representing a sample of the population of an imaginary middle-income country. The dataset contains two data files: one with variables at the household level, the other one with variables at the individual level. It includes variables that are typically collected in population censuses (demography, education, occupation, dwelling characteristics, fertility, mortality, and migration) and in household surveys (household expenditure, anthropometric data for children, assets ownership). The data only includes ordinary households (no community households). The dataset was created using REaLTabFormer, a model that leverages deep learning methods. The dataset was created for the purpose of training and simulation and is not intended to be representative of any specific country.

    The full-population dataset (with about 10 million individuals) is also distributed as open data.

    Geographic coverage

    The dataset is a synthetic dataset for an imaginary country. It was created to represent the population of this country by province (equivalent to admin1) and by urban/rural areas of residence.

    Analysis unit

    Household, Individual

    Universe

    The dataset is a fully-synthetic dataset representative of the resident population of ordinary households for an imaginary middle-income country.

    Kind of data

    ssd

    Sampling procedure

    The sample size was set to 8,000 households. The fixed number of households to be selected from each enumeration area was set to 25. In a first stage, the number of enumeration areas to be selected in each stratum was calculated, proportional to the size of each stratum (stratification by geo_1 and urban/rural). Then 25 households were randomly selected within each enumeration area. The R script used to draw the sample is provided as an external resource.

    Mode of data collection

    other

    Research instrument

    The dataset is a synthetic dataset. Although the variables it contains are variables typically collected from sample surveys or population censuses, no questionnaire is available for this dataset. A "fake" questionnaire was however created for the sample dataset extracted from this dataset, to be used as training material.

    Cleaning operations

    The synthetic data generation process included a set of "validators" (consistency checks, based on which synthetic observation were assessed and rejected/replaced when needed). Also, some post-processing was applied to the data to result in the distributed data files.

    Response rate

    This is a synthetic dataset; the "response rate" is 100%.

  16. Default sim_abundance function call, with descriptions, default values and...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray (2023). Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments. [Dataset]. http://doi.org/10.1371/journal.pone.0232822.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments.

  17. f

    Appendix A. Sampling localities, equilibrium simulations, and simulations...

    • datasetcatalog.nlm.nih.gov
    • wiley.figshare.com
    Updated Aug 5, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pizzatto, Lígia; Barton, Di; Shine, Richard; Kelehear, Crystal; Phillips, Ben L.; Brown, Gregory P. (2016). Appendix A. Sampling localities, equilibrium simulations, and simulations with varying r. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001609953
    Explore at:
    Dataset updated
    Aug 5, 2016
    Authors
    Pizzatto, Lígia; Barton, Di; Shine, Richard; Kelehear, Crystal; Phillips, Ben L.; Brown, Gregory P.
    Description

    Sampling localities, equilibrium simulations, and simulations with varying r.

  18. Additional file 3: of Aiming for a representative sample: Simulating random...

    • figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Loan van Hoeven; Mart Janssen; Kit Roes; Hendrik Koffijberg (2023). Additional file 3: of Aiming for a representative sample: Simulating random versus purposive strategies for hospital selection [Dataset]. http://doi.org/10.6084/m9.figshare.c.3624569_D2.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Loan van Hoeven; Mart Janssen; Kit Roes; Hendrik Koffijberg
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    R code for simulating sampling strategies. Description: R code that creates an exemplary data set and simulates the sampling strategies. (R 26Â kb)

  19. H

    Comparing Groundwater Sampling Devices for Denitrification Assessment using...

    • hydroshare.org
    zip
    Updated Nov 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Felix Fahrenbach; Thomas R. Rüde (2025). Comparing Groundwater Sampling Devices for Denitrification Assessment using the N2/Ar Method [Dataset]. https://www.hydroshare.org/resource/42aec34687374fbbafa8e1b4ad907940
    Explore at:
    zip(66.4 KB)Available download formats
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    HydroShare
    Authors
    Felix Fahrenbach; Thomas R. Rüde
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset comprises analysis results (major ions, N2, and Ar concentrations) from groundwater samples taken in the Lower Rhine Embayment (Germany) to assess the impact of sampling methods on N2, Ar, and excess-N2 concentrations. The data is used in the manuscript "Comparing Groundwater Sampling Devices for Denitrification Assessment using the N2/Ar Method" by Felix Fahrenbach and Thomas R. Rüde, which is currently undergoing review by Groundwater.

    The libraries tidyverse (Wickham et al. 2019), psych (Revelle 2014), car (Fox and Weisberg 2019), rstatix (Kassambara 2023), and PMCMRplus (Pohlert 2024) need to be installed to run the R scripts. Running the Python scripts requires the following packages: numpy (Harris et al. 2020), pandas (McKinney 2010), scipy (Virtanen et al. 2020), statsmodels (Seabold and Perktold 2010), and matplotlib (Hunter 2007).

    References Fox, J., and S. Weisberg. 2019. An R Companion to Applied Regression. 3rd ed. Thousand Oaks CA: Sage, https://www.john-fox.ca/Companion/. Harris, C. R., K. J. Millman, S. J. van der Walt, R. Gommers, P. Virtanen, D. Cournapeau, E. Wieser, et al. 2020. Array programming with NumPy. Nature 585, no. 7825: 357–62, https://doi.org/10.1038/s41586-020-2649-2. Hunter, J. D. 2007. Matplotlib: A 2D graphics environment. Computing in Science & Engineering 9, no. 3: 90–95, https://doi.org/10.1109/MCSE.2007.55. Kassambara, A. 2023. rstatix: Pipe-Friendly Framework for Basic Statistical Tests, https://rpkgs.datanovia.com/rstatix/. McKinney, W. 2010. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, edited by S. van der Walt and J. Millman, 56–61, https://doi.org/10.25080/Majora-92bf1922-00a. Pohlert, T. 2024. PMCMRplus: Calculate Pairwise Multiple Comparisons of Mean Rank Sums Extended, https://CRAN.R-project.org/package=PMCMRplus. Revelle, W. 2014. psych: Procedures for Psychological, Psychometric, and Personality Research. Evanston, Illinois: Northwestern University, https://CRAN.R-project.org/package=psych. Seabold, S., and J. Perktold. 2010. statsmodels: Econometric and statistical modeling with python. In Proceedings of the 9th Python in Science Conference. Virtanen, P., R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, et al. 2020. SciPy 1.0: Fundamental algorithms for scientific computing in python. Nature Methods 17: 261–72, https://doi.org/10.1038/s41592-019-0686-2. Wickham, H., M. Averick, J. Bryan, W. Chang, L. McGowan, R. François, G. Grolemund, et al. 2019. Welcome to the Tidyverse. Journal of Open Source Software 4, no. 43: 1686, https://doi.org/10.21105/joss.01686.

  20. d

    Scientific sampling event log from the WB1105 cruise from R/V Weatherbird II...

    • search.dataone.org
    • bco-dmo.org
    Updated Dec 5, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Joseph J. Torres (2021). Scientific sampling event log from the WB1105 cruise from R/V Weatherbird II (WB1105) (DWH Micronekton project) [Dataset]. https://search.dataone.org/view/sha256%3A53b5c5ad3e90d771672fe611ba71a04d7f624fd856d103813860bb7dd0c02010
    Explore at:
    Dataset updated
    Dec 5, 2021
    Dataset provided by
    Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    Authors
    Dr Joseph J. Torres
    Time period covered
    Sep 4, 2012 - Sep 9, 2012
    Area covered
    Description

    The science party maintained a sampling event log, recording all instrument deployments and significant events during the 2010 RAPID_I cruise aboard the R/V WEATHERBIRD II (WB1105). Refer to comments column for additional information.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
sciencestoked (2023). Road-R Dataset Sample [Dataset]. https://www.kaggle.com/datasets/sciencestoked/road-r-dataset-sample
Organization logo

Road-R Dataset Sample

Explore at:
zip(186910332 bytes)Available download formats
Dataset updated
Aug 17, 2023
Authors
sciencestoked
Description

Dataset

This dataset was created by sciencestoked

Contents

Search
Clear search
Close search
Google apps
Main menu