Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Many capture-recapture surveys of wildlife populations operate in continuous time but detections are typically aggregated into occasions for analysis, even when exact detection times are available. This discards information and introduces subjectivity, in the form of decisions about occasion definition. We develop a spatio-temporal Poisson process model for spatially explicit capture-recapture (SECR) surveys that operate continuously and record exact detection times. We show that, except in some special cases (including the case in which detection probability does not change within occasion), temporally aggregated data do not provide sufficient statistics for density and related parameters, and that when detection probability is constant over time our continuous-time (CT) model is equivalent to an existing model based on detection frequencies. We use the model to estimate jaguar density from a camera-trap survey and conduct a simulation study to investigate the properties of a CT estimator and discrete-occasion estimators with various levels of temporal aggregation. This includes investigation of the effect on the estimators of spatio-temporal correlation induced by animal movement. The CT estimator is found to be unbiased and more precise than discrete-occasion estimators based on binary capture data (rather than detection frequencies) when there is no spatio-temporal correlation. It is also found to be only slightly biased when there is correlation induced by animal movement, and to be more robust to inadequate detector spacing, while discrete-occasion estimators with binary data can be sensitive to occasion length, particularly in the presence of inadequate detector spacing. Our model includes as a special case a discrete-occasion estimator based on detection frequencies, and at the same time lays a foundation for the development of more sophisticated CT models and estimators. It allows modelling within-occasion changes in detectability, readily accommodates variation in detector effort, removes subjectivity associated with user-defined occasions, and fully utilises CT data. We identify a need for developing CT methods that incorporate spatio-temporal dependence in detections and see potential for CT models being combined with telemetry-based animal movement models to provide a richer inference framework.
Facebook
Twitterhttps://www.usa.gov/government-works/https://www.usa.gov/government-works/
The Poisson Process file concerns the solution of an exercise from the fourth module of the Statistics and Applied Data Analysis Specialization course at the University of Colorado Boulder that I took. In these notes, I intend to explain the most important steps.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Each row shows the composition of the feature sets selected by MI given the sensor size constraint, the columns give the percentage of the total selected feature sets with the labelled sensor contents.
Facebook
TwitterA combination of discrete and daily-aligned groundwater levels for the Mississippi River Valley alluvial aquifer clipped to the Mississippi Alluvial Plain, as defined by Painter and Westerman (2018), with corresponding metadata are based on processing of U.S. Geological Survey National Water Information System (NWIS) (U.S. Geological Survey, 2020) data. The processing was made after retrieval using aggregation and filtering through the infoGW2visGWDB software (Asquith and Seanor, 2019). The nomenclature GWmaster mimics that of the output from infoGW2visGWDB. Two separate data retrievals for NWIS were made. First, the discrete data were retrieved, and second, continuous records from recorder sites with daily-mean or other daily statistics codes were retrieved. Each dataset was separately passed through the infoGW2visGWDB software to create a "GWmaster discrete" and "GWmaster continuous" and these tables were combined and then sorted on the site identifier and date to form the data products described herein. A sweep through the combined dataset (the "database") was made to isolate duplicate observations, or observations for the same well and on the same day. If a discrete value was present, it was retained as authoritative for the day and in descending order of priority daily-mean, daily-maximum, and daily minimum. Therefore, only a single record for a well and day are present in the dataset. The duplicate search removed 876 records and 31 wells were involved; in total, this is about 0.3 percent of the database. References: Asquith, W.H., Seanor, R.C., 2019, infoGW2visGWDB—An R groundwater data-processing utility for manipulating, checking the veracity, and converting an "infoGW" object to the "GWmaster" object for the visGWDB software with demonstration for the Mississippi River Valley alluvial aquifer: U.S. Geological Survey software release, Reston, Va., https://doi.org/10.5066/P9MK0B6L. Painter, J.A., and Westerman, D.A., 2018. Mississippi Alluvial Plain extent, November 2017: U.S. Geological Survey data release, https://doi.org/10.5066/F70R9NMJ. U.S. Geological Survey, 2020, USGS water data for the Nation: U.S. Geological Survey National Water Information System database, accessed April 2, 2020, at https://doi.org/10.5066/F7P55KJN.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In applications such as clinical safety analysis, the data of the experiments usually consist of frequency counts. In the analysis of such data, researchers often face the problem of multiple testing based on discrete test statistics, aimed at controlling family-wise error rate (FWER). Most existing FWER controlling procedures are developed for continuous data, which are often conservative when analyzing discrete data. By using minimal attainable p-values, several FWER controlling procedures have been specifically developed for discrete data in the literature. In this article, by using known marginal distributions of true null p-values, three more powerful stepwise procedures are developed, which are modified versions of the conventional Bonferroni, Holm and Hochberg procedures, respectively. It is shown that the first two procedures strongly control the FWER under arbitrary dependence and are more powerful than the existing Tarone-type procedures, while the last one only ensures control of the FWER in special settings. Through extensive simulation studies, we provide numerical evidence of superior performance of the proposed procedures in terms of the FWER control and minimal power. A real clinical safety data are used to demonstrate applications of our proposed procedures. An R package “MHTdiscrete” and a web application are developed for implementing the proposed procedures.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Electroencephalogram (EEG) is used to monitor child's brain during coma by recording data on electrical neural activity of the brain. Signals are captured by multiple electrodes called channels located over the scalp. Statistical analyses of EEG data includes classification and prediction using arrays of EEG features, but few models for the underlying stochastic processes have been proposed. For this purpose, a new strictly stationary strong mixing diffusion model with marginal multimodal (three-peak) distribution (MixGGDiff) and exponentially decaying autocorrelation function for modeling of increments of EEG data was proposed. The increments were treated as discrete-time observations and a diffusion process where the stationary distribution is viewed as a mixture of three non-central generalized Gaussian distributions (MixGGD) was constructed.Probability density function of a mixed generalized Gaussian distribution (MixGGD) consists of three components and is described using a total of 12 parameters:\muk, location parameter of each of the components,sk, shape parameter of each of the components, \sigma2k, parameter related to the scale of each of the components andwk, weight of each of the components, where k, k={1,2,3} refers to theindex of the component of a MixGGD. The parameters of this distribution were estimated using the expectation-maximization algorithm, where the added shape parameter is estimated using the higher order statistics approach based on an analytical relationship between the shape parameter and kurtosis.To illustrate an application of the MixGGDiff to real data, analysis of EEG data collected in Uganda between 2008 and 2015 from 78 children within age-range of 18 months to 12 years who were in coma due to cerebral malaria was performed. EEG were recorded using the International 10–20 system with the sampling rate of 500 Hz and the average record duration of 30 min. EEG signal for every child was the result of a recording from 19 channels. MixGGD was fitted to each channel of every child's recording separately, hence for each channel a total of 12 parameter estimates were obtained. The data is presented in a matrix form (dimension 79*228) in a .csv format and consists of 79 rows where the first row is a header row which contains the names of the variables and the subsequent 78 rows represent parameter estimates of one instance (i.e. one child, without identifiers that could be related back to a specific child). There are a total of 228 columns (19 channels times 12 parameter estimates) where each column represents one parameter estimate of one component of MixGGD in the order of the channels, thus columns 1 to 12 refer to parameter estimates on the first channel, columns 13 to 24 refer to parameter estimates on the second channel and so on. Each variable name starts with "chi" where "ch" is an abbreviation of "channel" and i refers to the order of the channel from EEG recording. The rest of the characters in variable names refer to the parameter estimate names of the components of a MixGGD, thus for example "ch3sigmasq1" refers to the parameter estimate of \sigma2 of the first component of MixGGD obtained from EEG increments on the third channel. Parameter estimates contained in the .csv file are all real numbers within a range of -671.11 and 259326.96.Research results based upon these data are published at https://doi.org/10.1007/s00477-023-02524-y
Facebook
TwitterStatistics Canada has developed a Web Data Service that provides access to data and metadata that we release each business day. This is a good option for users who want to consume a discrete amount of data points updates to Statistics Canada data. To obtain information on how to use and consume our Web Data Service, please read the Web Data Service User Guide.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset of 418 primary school teachers' preferences on ICT-based teaching characteristics, analyzed using Discrete Choice Models, specifically McFadden's conditional logit model. The data includes variables such as subject area, grade level, and interactivity of digital resources. Each multivariate response is represented by three successive rows.
Facebook
TwitterThis data set describes the holding temperatures and developmental stages of larvae of red and blue king crab (Paralithodes camtschaticus and P. platypus) reared to the first crab stage at the Kodiak Laboratory.
Facebook
TwitterThe English Business Survey (EBS) will provide ministers and officials with information about the current economic and business conditions across England. By providing timely and robust information on a regular and geographically detailed basis, the survey will enhance officials’ understanding of how businesses are being affected throughout England and improve policy making by making it more responsive to changes in economic circumstances.
BIS has selected TNS-BMRB, an independent survey provider, to conduct the survey, covering approximately 3,000 businesses across England each month. BIS are conscious of burdens on business and therefore the survey is as light-touch as possible, being both voluntary and telephone-based, requiring only 11 to 12 minutes and has been designed to not require reference to any detailed information.
The survey will provide qualitative information across a range of important variables (eg output, capacity, employment, labour costs, output prices and investment), compared with three months ago and expectations for 3 months ahead.
The outputs of the survey should also be useful to businesses, providing valuable intelligence about local economic and business conditions.
The EBS is still in its infancy and therefore full quality assurance of the data is not yet possible. Estimates from the survey have therefore been designated as Experimental Official Statistics. Results should be interpreted with this in mind.
EBS statistics are published on a monthly and quarterly basis:
Detailed results are available from the English Business Survey Reporting tool, see ‘Detailed results’ section, below. The latest statistical releases and monthly statistics are available below, with historic releases and data available from the http://webarchive.nationalarchives.gov.uk/20121017180846/http://www.bis.gov.uk/analysis/statistics/sub-national-statistics/ebsurvey/ebsurvey-archive">EBS archive page.
Data from the English Business Survey are published on a monthly and quarterly basis. The exact publication date will be announced four weeks in advance. We are working towards a regular publication cycle, however, due to the experimental nature of the data, the publication date for each month may vary. Future publication dates will be added to the http://www.statistics.gov.uk/hub/release-calendar/index.html?newquery=*&title=English+Business+Survey&source-agency=Business%2C+Innovation+and+Skills&pagetype=calendar-entry&lday=&lmonth=&lyear=&uday=&umonth=&uyear">National Statistics Publication Hub.
Detailed results providing the full range of English Business Survey statistics are available from the http://dservuk.tns-global.com/English-Business-Survey-Reporting-Tool">Reporting Tool. Quarterly (Discrete & Cumulative) data are available for the full range of geographies:
The latest EBS data will be added to the tool on a quarterly basis and cumulative monthly data will be available from the http://dservuk.tns-global.com/English-Business-Survey-Reporting-Tool">Reporting Tool by early 2013.
If you have any questions on the EBS please send us an email at: ebsurvey@bis.gsi.gov.uk
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Experimental data can broadly be divided in discrete or continuous data. Continuous data are obtained from measurements that are performed as a function of another quantitative variable, e.g., time, length, concentration, or wavelength. The results from these types of experiments are often used to generate plots that visualize the measured variable on a continuous, quantitative scale. To simplify state-of-the-art data visualization and annotation of data from such experiments, an open-source tool was created with R/shiny that does not require coding skills to operate it. The freely available web app accepts wide (spreadsheet) and tidy data and offers a range of options to normalize the data. The data from individual objects can be shown in 3 different ways: (1) lines with unique colors, (2) small multiples, and (3) heatmap-style display. Next to this, the mean can be displayed with a 95% confidence interval for the visual comparison of different conditions. Several color-blind-friendly palettes are available to label the data and/or statistics. The plots can be annotated with graphical features and/or text to indicate any perturbations that are relevant. All user-defined settings can be stored for reproducibility of the data visualization. The app is dubbed PlotTwist and runs locally or online: https://huygens.science.uva.nl/PlotTwist
Facebook
Twitterhttps://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
The Discrete and Power Devices market plays a pivotal role in the modern electronics landscape, encompassing a wide range of components used to manage and control electrical power. These devices, which include transistors, diodes, rectifiers, and thyristors, serve essential functions in various applications, such as
Facebook
TwitterOpen Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
Statistics Canada has developed a Web Data Service that provides access to data and metadata that we release each business day. This is a good option for users who want to consume a discrete amount of data points updates to Statistics Canada data. To obtain information on how to use and consume our Web Data Service, please read the Web Data Service User Guide.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
There is a growing appreciation of the multiple social and nonsocial factors influencing the foraging behavior of social animals, but little understanding of how these factors depend on habitat characteristics or individual traits. This partly reflects the difficulties inherent in using conventional statistical techniques to analyze multi-factor, multi-context foraging decisions. Discrete choice models provide a way to do so, and we demonstrate this by using them to investigate patch preference in a wild population of social foragers (chacma baboons, Papio ursinus). Data were collected from 29 adults across two social groups encompassing 683 foraging decisions over a six-month period, and the results interpreted using an information theoretic approach. Baboon foraging decisions were influenced by multiple nonsocial and social factors, and were often contingent on the characteristics of the habitat or individual. Differences in decision-making between habitats were consistent with changes in interference competition costs but not changes in social foraging benefits. Individual differences in decision-making were suggestive of a trade-off between dominance rank and social capital. Our findings emphasize that taking a multi-factor, multi-context approach is important to fully understand animal decision-making. We also demonstrate how discrete choice models can be used to achieve this.
Facebook
Twitterhttps://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
The Power Discrete market plays a pivotal role in the broader semiconductor industry, catering to the increasing demand for efficient power management solutions across various sectors, including automotive, consumer electronics, industrial applications, and renewable energy. Power Discrete devices, such as transisto
Facebook
Twitterhttps://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
The Discrete Devices and Power Modules market plays a pivotal role in various industries, providing essential components that drive the functionality and efficiency of electronic systems. Discrete devices, such as diodes, transistors, and resistors, are fundamental in managing electrical energy, while power modules
Facebook
TwitterWe begin by reviewing the statistical framework of information theory as applicable to neuroimaging data analysis. A major factor hindering wider adoption of this framework in neuroimaging is the difficulty of estimating information theoretic quantities in practice. We present a novel estimation technique that combines the statistical theory of copulas with the closed form solution for the entropy of Gaussian variables. This results in a general, computationally efficient, flexible, and robust multivariate statistical framework that provides effect sizes on a common meaningful scale, allows for unified treatment of discrete, continuous, unidimensional and multidimensional variables, and enables direct comparisons of representations from behavioral and brain responses across any recording modality. We validate the use of this estimate as a statistical test within a neuroimaging context, considering both discrete stimulus classes and continuous stimulus features. We also present examples ...
Facebook
TwitterThis table provides statistics on Discrete Patients by Practitioner Type and Alberta Health Services Geographic Zone Service Location and Recipient Location under the Alberta Health Care Insurance Plan (AHCIP). This table is an Excel version of a table in the "Alberta Health Care Insurance Statistical Supplement" report published annually by Alberta Health.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A DGGS is a type of spatial reference system that partitions the globe into many individual, evenly spaced, and well-aligned cells to encode location. We calculated normalized area and compactness of cell geometries for 5 open-source DGGS implementations - Uber H3, Google S2, RiskAware OpenEAGGR, rHEALPix by Landcare Research New Zealand, HEALPix by NASA Jet Propulsion Labs, and DGGRID by Southern Oregon University - to evaluate their suitability for a global-level statistical data cube.
This repository contains all generated data and statistics.
Kmoch et. al (2022). Area and Shape Distortions in Open-Source Discrete Global Grid Systems. Big Earth Data
Facebook
TwitterThis table provides statistics on the Distribution of Discrete Patients by Payment Range for Services Provided by Physicians, based on fee-for-service payments under the Alberta Health Care Insurance Plan (AHCIP). This table is an Excel version of a table in the “Alberta Health Care Insurance Plan Statistical Supplement” report published annually by Alberta Health.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Many capture-recapture surveys of wildlife populations operate in continuous time but detections are typically aggregated into occasions for analysis, even when exact detection times are available. This discards information and introduces subjectivity, in the form of decisions about occasion definition. We develop a spatio-temporal Poisson process model for spatially explicit capture-recapture (SECR) surveys that operate continuously and record exact detection times. We show that, except in some special cases (including the case in which detection probability does not change within occasion), temporally aggregated data do not provide sufficient statistics for density and related parameters, and that when detection probability is constant over time our continuous-time (CT) model is equivalent to an existing model based on detection frequencies. We use the model to estimate jaguar density from a camera-trap survey and conduct a simulation study to investigate the properties of a CT estimator and discrete-occasion estimators with various levels of temporal aggregation. This includes investigation of the effect on the estimators of spatio-temporal correlation induced by animal movement. The CT estimator is found to be unbiased and more precise than discrete-occasion estimators based on binary capture data (rather than detection frequencies) when there is no spatio-temporal correlation. It is also found to be only slightly biased when there is correlation induced by animal movement, and to be more robust to inadequate detector spacing, while discrete-occasion estimators with binary data can be sensitive to occasion length, particularly in the presence of inadequate detector spacing. Our model includes as a special case a discrete-occasion estimator based on detection frequencies, and at the same time lays a foundation for the development of more sophisticated CT models and estimators. It allows modelling within-occasion changes in detectability, readily accommodates variation in detector effort, removes subjectivity associated with user-defined occasions, and fully utilises CT data. We identify a need for developing CT methods that incorporate spatio-temporal dependence in detections and see potential for CT models being combined with telemetry-based animal movement models to provide a richer inference framework.