100+ datasets found

Road-R Dataset Sample
kaggle.com
zip
Updated Aug 17, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
sciencestoked (2023). Road-R Dataset Sample [Dataset]. https://www.kaggle.com/datasets/sciencestoked/road-r-dataset-sample
Explore at:
zip(186910332 bytes)Available download formats
Dataset updated
Aug 17, 2023
Authors
sciencestoked
Description
Dataset

This dataset was created by sciencestoked

Contents
R script and input data for "ALL-EMA sampling design"
envidat.ch
.r, .txt +1
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Klaus Ecker; Yves Tillé (2025). R script and input data for "ALL-EMA sampling design" [Dataset]. http://doi.org/10.16904/envidat.402
Explore at:
not available, .txt, .rAvailable download formats
Unique identifier
https://doi.org/10.16904/envidat.402
Dataset updated
May 28, 2025
Dataset provided by
Swiss Federal Institute for Forest, Snow and Landscape Research
Institute of Statistics, University of Neuchatel
Authors
Klaus Ecker; Yves Tillé
Area covered
Switzerland
Dataset funded by
FOEN
Description
License: GPL-v2 The R script presents an advanced sampling approach for monitoring biodiversity on agricultural land by combining multiple objectives and integrating environmental and geographic space. The example demonstrates the first-stage selection of squares (km2) in the ALL-EMA sampling design using modern sampling techniques such as unequal probability sampling with fixed sample size, balanced sampling, stratified balancing and geographic spreading. Sampling is done with unequal probabilities and weights defined by power allocation to give equal weight to extrapolations to the total agricultural area of Switzerland and two stratifications of predefined interest (regions and agricultural production zones). Calibration is used to limit the distribution of the sampling weights. The sample sizes are almost fixed within the strata and evenly distributed across the years of a temporal rotation plan, which is favourable for the organisation of the field survey. Sampling also ensures an optimal (annual) distribution across geographic space, including altitude. Despite the complexity of the sampling, estimation based on probability theory is straightforward. Ecker, K.T., Meier, E.S. & Tillé, Y. 2023. Integrating spatial and ecological information into comprehensive biodiversity monitoring on agricultural land. Environmental Monitoring and Assessment 195.
random-points-sampling-r
kaggle.com
zip
Updated Nov 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
JORGE GARCIA-INIGUEZ (2023). random-points-sampling-r [Dataset]. https://www.kaggle.com/datasets/jorgegarciainiguez/random-points-sampling-r
Explore at:
zip(712850 bytes)Available download formats
Dataset updated
Nov 18, 2023
Authors
JORGE GARCIA-INIGUEZ
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by JORGE GARCIA-INIGUEZ

Released under MIT

Contents
Summary statistics of population and samples taken at different sampling...
plos.figshare.com
xls
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maria M; Ibrahim M. Almanjahie; Muhammad Ismail; Ammara Nawaz Cheema (2023). Summary statistics of population and samples taken at different sampling schemes for n = 4, r = 1. [Dataset]. http://doi.org/10.1371/journal.pone.0275340.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0275340.t001
Dataset updated
Jun 3, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Maria M; Ibrahim M. Almanjahie; Muhammad Ismail; Ammara Nawaz Cheema
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Summary statistics of population and samples taken at different sampling schemes for n = 4, r = 1.
q
Chapter 2: Data sampling, accuracy, and precision
qubeshub.org
Updated Dec 23, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Raisa Hernández-Pacheco; Alexis Diaz (2020). Chapter 2: Data sampling, accuracy, and precision [Dataset]. http://doi.org/10.25334/66G4-8G08
Explore at:
Unique identifier
https://doi.org/10.25334/66G4-8G08
Dataset updated
Dec 23, 2020
Dataset provided by
QUBES
Authors
Raisa Hernández-Pacheco; Alexis Diaz
Description
Biostatistics Using R: A Laboratory Manual was created with the goals of providing biological content to lab sessions by using authentic research data and introducing R programming language. Chapter 2 introduces sampling, accuracy, and precision.
d
Data from: SSP: An R package to estimate sampling effort in studies of...
search.dataone.org
data.niaid.nih.gov
+2more
Updated Apr 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Edlin Guerra-Castro; Juan Carlos Cajas; Nuno Simoes; Juan Jose Cruz-Motta; Maite Mascaro (2025). SSP: An R package to estimate sampling effort in studies of ecological communities [Dataset]. http://doi.org/10.5061/dryad.3bk3j9kj5
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.3bk3j9kj5
Dataset updated
Apr 24, 2025
Dataset provided by
Dryad Digital Repository
Authors
Edlin Guerra-Castro; Juan Carlos Cajas; Nuno Simoes; Juan Jose Cruz-Motta; Maite Mascaro
Time period covered
Jan 1, 2021
Description
SSP (simulation-based sampling protocol) is an R package that uses simulations of ecological data and dissimilarity-based multivariate standard error (MultSE) as an estimator of precision to evaluate the adequacy of different sampling efforts for studies that will test hypothesis using permutational multivariate analysis of variance. The procedure consists in simulating several extensive data matrixes that mimic some of the relevant ecological features of the community of interest using a pilot data set. For each simulated data, several sampling efforts are repeatedly executed and MultSE calculated. The mean value, 0.025 and 0.975 quantiles of MultSE for each sampling effort across all simulated data are then estimated and standardized regarding the lowest sampling effort. The optimal sampling effort is identified as that in which the increase in sampling effort does not improve the highest MultSE beyond a threshold value (e.g. 2.5 %). The performance of SSP was validated using real dat...
d
Scripts from: Performance of generalized distance sampling models with...
search.dataone.org
datadryad.org
Updated Oct 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Julia Barczyk; Marc KÃ©ry; Grzegorz Neubauer; Kenneth F. Kellner; Malcolm C. K. Soh; Jaume A. Badia-Boher (2025). Scripts from: Performance of generalized distance sampling models with temporary emigration: a simulation study [Dataset]. http://doi.org/10.5061/dryad.j0zpc86tr
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.j0zpc86tr
Dataset updated
Oct 4, 2025
Dataset provided by
Dryad Digital Repository
Authors
Julia Barczyk; Marc KÃ©ry; Grzegorz Neubauer; Kenneth F. Kellner; Malcolm C. K. Soh; Jaume A. Badia-Boher
Description
Generalized distance sampling (GDS) models are the distance sampling equivalent of temporary emigration N-mixture models. In addition to density and the perceptibility component of detection, both contain an additional parameter for availability for detection which becomes estimable when data from repeated 'visits' are available. GDS models thus account for open populations. This makes them more robust, since natural populations are hardly ever perfectly closed, arguably even over the course of a single breeding season. However, the performance of these models has not been tested thoroughly, and prior (unpublished) analyses suggested that biased estimates, especially for density (high) and availability (low), may typically occur under certain conditions. We conducted three simulation studies and found that bias arises in low-information scenarios, particularly with low sample sizes and low parameter values. Our simulations enable us to determine "estimation frontiers", which separate sa..., , # Title of Dataset: Performance of generalized distance sampling models with temporary emigration: a simulation study

Description of the data

The study was not based on real data. All data used in the study were generated using simulation code.

Code/Software

The dataset contains four R files with simulation codes:

Code_1_simGDS_function.R- R code with data simulation function;

Code_2_gds_Sim1.R - R code to perform Simulation 1 with varying number of sites (20â€“500) and of surveys (2â€“10);

Code_3_gds_Sim2.R - R code to perform Simulation 2 with varying number of sites (20â€“500), surveys (2â€“10), density Î» (0.01-2 individuals per hectare), availability Ï• (0.01-1), and the parameter that governs the decline of the detection function over distance Ïƒ (20-200 meters);

Code_4_gds_Sim3.R - R code to perform Simulation 3 with effects of three continuous covariates on all the three parameters (Î»,Ï•,Ïƒ).

First, run Code_1. The other codes are independent, but the first simul...,
Default sim_abundance function call, with descriptions, default values and...
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray (2023). Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments. [Dataset]. http://doi.org/10.1371/journal.pone.0232822.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0232822.t002
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Default sim_abundance function call, with descriptions, default values and associated parameter symbols of key arguments.
H
Comparing Groundwater Sampling Devices for Denitrification Assessment using...
hydroshare.org
zip
Updated Nov 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Felix Fahrenbach; Thomas R. Rüde (2025). Comparing Groundwater Sampling Devices for Denitrification Assessment using the N2/Ar Method [Dataset]. https://www.hydroshare.org/resource/42aec34687374fbbafa8e1b4ad907940
Explore at:
zip(66.4 KB)Available download formats
Dataset updated
Nov 21, 2025
Dataset provided by
HydroShare
Authors
Felix Fahrenbach; Thomas R. Rüde
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The dataset comprises analysis results (major ions, N2, and Ar concentrations) from groundwater samples taken in the Lower Rhine Embayment (Germany) to assess the impact of sampling methods on N2, Ar, and excess-N2 concentrations. The data is used in the manuscript "Comparing Groundwater Sampling Devices for Denitrification Assessment using the N2/Ar Method" by Felix Fahrenbach and Thomas R. Rüde, which is currently undergoing review by Groundwater.

The libraries tidyverse (Wickham et al. 2019), psych (Revelle 2014), car (Fox and Weisberg 2019), rstatix (Kassambara 2023), and PMCMRplus (Pohlert 2024) need to be installed to run the R scripts. Running the Python scripts requires the following packages: numpy (Harris et al. 2020), pandas (McKinney 2010), scipy (Virtanen et al. 2020), statsmodels (Seabold and Perktold 2010), and matplotlib (Hunter 2007).

References Fox, J., and S. Weisberg. 2019. An R Companion to Applied Regression. 3rd ed. Thousand Oaks CA: Sage, https://www.john-fox.ca/Companion/. Harris, C. R., K. J. Millman, S. J. van der Walt, R. Gommers, P. Virtanen, D. Cournapeau, E. Wieser, et al. 2020. Array programming with NumPy. Nature 585, no. 7825: 357–62, https://doi.org/10.1038/s41586-020-2649-2. Hunter, J. D. 2007. Matplotlib: A 2D graphics environment. Computing in Science & Engineering 9, no. 3: 90–95, https://doi.org/10.1109/MCSE.2007.55. Kassambara, A. 2023. rstatix: Pipe-Friendly Framework for Basic Statistical Tests, https://rpkgs.datanovia.com/rstatix/. McKinney, W. 2010. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, edited by S. van der Walt and J. Millman, 56–61, https://doi.org/10.25080/Majora-92bf1922-00a. Pohlert, T. 2024. PMCMRplus: Calculate Pairwise Multiple Comparisons of Mean Rank Sums Extended, https://CRAN.R-project.org/package=PMCMRplus. Revelle, W. 2014. psych: Procedures for Psychological, Psychometric, and Personality Research. Evanston, Illinois: Northwestern University, https://CRAN.R-project.org/package=psych. Seabold, S., and J. Perktold. 2010. statsmodels: Econometric and statistical modeling with python. In Proceedings of the 9th Python in Science Conference. Virtanen, P., R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, et al. 2020. SciPy 1.0: Fundamental algorithms for scientific computing in python. Nature Methods 17: 261–72, https://doi.org/10.1038/s41592-019-0686-2. Wickham, H., M. Averick, J. Bryan, W. Chang, L. McGowan, R. François, G. Grolemund, et al. 2019. Welcome to the Tidyverse. Journal of Open Source Software 4, no. 43: 1686, https://doi.org/10.21105/joss.01686.
Default sim_distribution function call, with descriptions and associated...
plos.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray (2023). Default sim_distribution function call, with descriptions and associated parameter symbols of key arguments. [Dataset]. http://doi.org/10.1371/journal.pone.0232822.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0232822.t003
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Default sim_distribution function call, with descriptions and associated parameter symbols of key arguments.
Lead Sampling in Two Cities
catalog.data.gov
Updated Sep 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2025). Lead Sampling in Two Cities [Dataset]. https://catalog.data.gov/dataset/lead-sampling-in-two-cities
Explore at:
Dataset updated
Sep 22, 2025
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
Lead concentrations in drinking water samples collected under various sampling protocols in homes with lead service lines and in homes without lead service lines in two US cities. This dataset is associated with the following publication: Lytle, D., M. Urbanic, A. Paul, R. Achtemeier, A. Lewis, S. Hammaker, A. Estep, M. Nadagouda, R. James, and S. Triantafyllidou. Alternative approaches to lead sampling in drinking water: A comparative study of homes with and without lead service lines in two cities. WATER RESEARCH. Elsevier Science Ltd, New York, NY, USA, 994: 180063, (2025).
f
R-squares (in bold, above diagonal) and *sample sizes (n) and p-values...
datasetcatalog.nlm.nih.gov
Updated Aug 31, 2012
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Federico, Paula; Westbrook, John K.; Kunz, Thomas H.; Brown, Veronica A.; McCracken, Gary F.; Eldridge, Melanie (2012). R-squares (in bold, above diagonal) and *sample sizes (n) and p-values (below diagonal) between temporal patterns of moth abundance at each sampling site. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001142574
Explore at:
Dataset updated
Aug 31, 2012
Authors
Federico, Paula; Westbrook, John K.; Kunz, Thomas H.; Brown, Veronica A.; McCracken, Gary F.; Eldridge, Melanie
Description
*n is the number of days in which samples were collected at each site on the same day.
d
Data from: The program STRUCTURE does not reliably recover the correct...
datadryad.org
data.niaid.nih.gov
+1more
zip
Updated Jan 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sébastien J. Puechmaille (2016). The program STRUCTURE does not reliably recover the correct population structure when sampling is uneven: sub-sampling and new estimators alleviate the problem [Dataset]. http://doi.org/10.5061/dryad.2d4m9
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.2d4m9
Dataset updated
Jan 19, 2016
Dataset provided by
Dryad
Authors
Sébastien J. Puechmaille
Time period covered
Dec 13, 2015
Area covered
Asia, Africa, Europe, Oceania, America
Description
Inferences of population structure and more precisely the identification of genetically homogeneous groups of individuals are essential to the fields of ecology, evolutionary biology, and conservation biology. Such population structure inferences are routinely investigated via the program STRUCTURE implementing a Bayesian algorithm to identify groups of individuals at Hardy-Weinberg and linkage equilibrium. While the method is performing relatively well under various population models with even sampling between subpopulations, the robustness of the method to uneven sample size between subpopulations and/or hierarchical levels of population structure has not yet been tested despite being commonly encountered in empirical datasets. In this study, I used simulated and empirical microsatellite datasets to investigate the impact of uneven sample size between subpopulations and/or hierarchical levels of population structure on the detected population structure. The results demonstrated that u...
Default sim_survey function call, with descriptions and associated parameter...
plos.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray (2023). Default sim_survey function call, with descriptions and associated parameter symbols of key arguments. [Dataset]. http://doi.org/10.1371/journal.pone.0232822.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0232822.t004
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Paul M. Regular; Gregory J. Robertson; Keith P. Lewis; Jonathan Babyn; Brian Healey; Fran Mowbray
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Default sim_survey function call, with descriptions and associated parameter symbols of key arguments.
d
Fish Sampling Log data collected during NOAA R/V Townsend Cromwell cruises...
catalog.data.gov
fisheries.noaa.gov
Updated Jan 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(Point of Contact, Custodian) (2025). Fish Sampling Log data collected during NOAA R/V Townsend Cromwell cruises between 1982 and 1998 and NOAA R/V Oscar E Sette cruises in 2007 and 2009 in the Central and Western Pacific [Dataset]. https://catalog.data.gov/dataset/fish-sampling-log-data-collected-during-noaa-r-v-townsend-cromwell-cruises-between-1982-and-1992
Explore at:
Dataset updated
Jan 24, 2025
Dataset provided by
(Point of Contact, Custodian)
Description
FIsh caught on NOAA R/V Townsend Cromwell cruises from 1982 to 1998 and NOAA R/V Oscar E Sette in 2007 and 2009 were measured and/or weighed and sex determination was conducted. Specimen samples were also preserved from selected fishes.
d
Scientific sampling event log from R/V Oceanus, R/V New Horizon OC473,...
search.dataone.org
bco-dmo.org
Updated Dec 5, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gareth Lawson (2021). Scientific sampling event log from R/V Oceanus, R/V New Horizon OC473, NH1208 in in the western N. Atlantic and eastern Pacific, 2011-2012 (OAPS project) [Dataset]. https://search.dataone.org/view/sha256%3A4b7a598a2a6944f06adf50ceb6ba5133ab26323a65ed00fa92c6ff0831ef3d41
Explore at:
Dataset updated
Dec 5, 2021
Dataset provided by
Biological and Chemical Oceanography Data Management Office (BCO-DMO)
Authors
Gareth Lawson
Description
This scientific sampling event log was created using an early implementation of the Rolling Deck to Repository (R2R) event log application (ELOG with cruise-specific custom configuration files). The log includes a record of all scientific sampling events from the cruise. In addition to event identification numbers unique for the cruise, the scientific sampling event log includes date and time (GMT), position (latittude and longitude), station and cast identifier as appropriate to the sampling event, sampling instrument name (e.g. CTD, TM, MOC10), name of person responsible for the sampling event, and a comment field to record additional information.
Ex-R Study Urine Data
catalog.data.gov
Updated Jan 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2022). Ex-R Study Urine Data [Dataset]. https://catalog.data.gov/dataset/ex-r-study-urine-data
Explore at:
Dataset updated
Jan 24, 2022
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
Emory University (analyzed the urine samples for pyrethroid metabolites). This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Contact Researcher. Format: Pyrethroid metabolite concentration data for 50 adults over six-weeks. This dataset is associated with the following publication: Morgan , M., J. Sobus , D.B. Barr, C. Croghan , F. Chen , R. Walker, L. Alston, E. Andersen, and M. Clifton. Temporal variability of pyrethroid metabolite levels in bedtime, morning, and 24-hr urine samples for 50 adults in North Carolina. ENVIRONMENT INTERNATIONAL. Elsevier Science Ltd, New York, NY, USA, 144: 81-91, (2015).
d
Data from: Accumulated wastewater calculations for smallmouth bass sampling...
catalog.data.gov
data.usgs.gov
+1more
Updated Nov 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2025). Accumulated wastewater calculations for smallmouth bass sampling sites in the Shenandoah River Watershed, USA [Dataset]. https://catalog.data.gov/dataset/accumulated-wastewater-calculations-for-smallmouth-bass-sampling-sites-in-the-shenandoah-r
Explore at:
Dataset updated
Nov 21, 2025
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Shenandoah River, United States
Description
This data release presents calculated accumulated wastewater (ACCWW, as a percent of total streamflow) values for 43 National Hydrologic Dataset Version 2.1 (NHDPlus V2.1) stream segments coinciding with long-term smallmouth bass sampling locations (Table 1) in the Shenandoah River Watershed (encompassing parts of Virginia and West Virginia, USA). Values are calculated for quarter-year (Quarter 1 [Q1], January - March; Quarter 2 [Q2], April - June; Quarter 3 [Q3], July-September; Quarter 4 [Q4], October-December) time scales (Table 2) and annual time scales (Table 3) for years 2000 to 2018. Estimates at a stream segment represent the combined total upstream wastewater discharges as well as direct discharges into the stream segment. Any users of these data should review the entire metadata record and the associated manuscript (see Larger Work Citation). See 'Distribution Liability' statements for more information.
f
Data from: Robust inference under r-size-biased sampling without replacement...
tandf.figshare.com
xlsx
Updated Nov 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
P. Economou; G. Tzavelas; A. Batsidis (2023). Robust inference under r-size-biased sampling without replacement from finite population [Dataset]. http://doi.org/10.6084/m9.figshare.11542974.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.11542974.v1
Dataset updated
Nov 28, 2023
Dataset provided by
Taylor & Francis
Authors
P. Economou; G. Tzavelas; A. Batsidis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The case of size-biased sampling of known order from a finite population without replacement is considered. The behavior of such a sampling scheme is studied with respect to the sampling fraction. Based on a simulation study, it is concluded that such a sample cannot be treated either as a random sample from the parent distribution or as a random sample from the corresponding r-size weighted distribution and as the sampling fraction increases, the biasness in the sample decreases resulting in a transition from an r-size-biased sample to a random sample. A modified version of a likelihood-free method is adopted for making statistical inference for the unknown population parameters, as well as for the size of the population when it is unknown. A simulation study, which takes under consideration the sampling fraction, demonstrates that the proposed method presents better and more robust behavior compared to the approaches, which treat the r-size-biased sample either as a random sample from the parent distribution or as a random sample from the corresponding r-size weighted distribution. Finally, a numerical example which motivates this study illustrates our results.
d
Data from: A hierarchical distance sampling model to estimate abundance and...
datadryad.org
data.niaid.nih.gov
+1more
zip
Updated Nov 25, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rahel Sollmann; Beth Gardner; Kathryn A. Williams; Andrew T. Gilbert; Richard R. Veit (2016). A hierarchical distance sampling model to estimate abundance and covariate associations of species and communities [Dataset]. http://doi.org/10.5061/dryad.gb905
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.gb905
Dataset updated
Nov 25, 2016
Dataset provided by
Dryad
Authors
Rahel Sollmann; Beth Gardner; Kathryn A. Williams; Andrew T. Gilbert; Richard R. Veit
Time period covered
Nov 23, 2015
Description
Seabird distance sampling dataThis .R file contains all data to repeat the community distance sampling case study on seabird abundance and distribution off the mid-Atlantic coast presented in the associated paper. The data are in the form of a R list object; the R script to read in and analyze the data are part of the Supplement 2, available with the paper. The ReadMe file contains a detailed description of the data.sbdata.R