Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Observed phenotypic responses to selection in the wild often differ from predictions based on measurements of selection and genetic variance. An overlooked hypothesis to explain this paradox of stasis is that a skewed phenotypic distribution affects natural selection and evolution. We show through mathematical modelling that, when a trait selected for an optimum phenotype has a skewed distribution, directional selection is detected even at evolutionary equilibrium, where it causes no change in the mean phenotype. When environmental effects are skewed, Lande and Arnold’s (1983) directional gradient is in the direction opposite to the skew. In contrast, skewed breeding values can displace the mean phenotype from the optimum, causing directional selection in the direction of the skew. These effects can be partitioned out using alternative selection estimates based on average derivatives of individual relative fitness, or additive genetic covariances between relative fitness and trait (Robertson-Price identity). We assess the validity of these predictions using simulations of selection estimation under moderate samples size. Ecologically relevant traits may commonly have skewed distributions, as we here exemplify with avian laying date – repeatedly described as more evolutionarily stable than expected –, so this skewness should be accounted for when investigating evolutionary dynamics in the wild.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Results based on 250 replications of skew generalized t-link samples (probit and skew-probit fits).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Maximum likelihood fits of probit, skew-probit, Generalized T (GT)-link and Skew Generalized T (SGT) -link models to the respiratory infection data.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This data used for this analysis contains personal fitness tracker from thirty fitbit users. Approximately thirty eligible Fitbit users consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, and sleep monitoring. It includes information about daily activity, steps, and heart rate that can be used to explore users' habits.
There are several limitations to this data set that may skew or cause our analysis to not be completely conclusive. These limitations include the following:
About this Data: This dataset was generated by respondents to a distributed survey via Amazon Mechanical Turk between 03.12.2016-05.12.2016. Variation between output represents use of different types of Fitbit trackers and individual tracking behaviors/preferences. Per the Amazon Mechanical Turk Website: "Amazon Mechanical Turk is a forum where Requesters post work as Human Intelligence Tasks (HITs). Workers complete HITs in exchange for a reward. You write, test, and publish your HIT using the Mechanical Turk developer sandbox, Amazon Mechanical Turk APIs, and AWS SDKs."
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Effective conservation and species management requires an understanding of the causes of poor population growth. Conservation physiology uses biomarkers to identify factors that contribute to low individual fitness and population declines. Building on this, macrophysiology can use the same markers to assess how individual physiology varies with different ecological or demographic factors over large temporal and spatial scales. Here, we use a macrophysiological approach to identify the ecological and demographic correlates of poor population growth rates in the Cape mountain zebra metapopulation. We use two non-invasive biomarkers: faecal glucocorticoids as a measure of chronic stress, and faecal androgens as an indicator of male physiological status. We found that faecal glucocorticoid concentrations were highest in the spring prior to summer rainfall, and were elevated in individuals from populations associated with low quality habitat (lower grass abundance). In addition, faecal androgen concentrations were higher in populations with a high proportion of non-breeding stallions (where male:female adult sex ratios exceed 2:1) suggesting sex ratio imbalances may intensify male competition. Finally, population growth rate was negatively associated with faecal glucocorticoid concentrations and female fecundity was negatively associated with faecal androgens, indicating a relationship between hormone profiles and fitness. Together, our results provide cross population evidence for how poor population growth rates in Cape mountain zebra can be linked to individual physiological biomarkers. More broadly, we advocate physiological biomarkers as indicators of population viability, and as a way to evaluate the impact of variable ecological and demographic factors. In addition, conservation physiology can be used to assess the efficacy of management interventions for this subspecies, and this approach could inform models of species’ responses to future environmental change.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset is an engineered version of the original Ames Housing dataset from the "House Prices: Advanced Regression Techniques" Kaggle competition. The goal of this engineering was to clean the data, handle missing values, encode categorical features, scale numeric features, manage outliers, reduce skewness, select useful features, and create new features to improve model performance for house price prediction.
The original dataset contains information on 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, with the target variable being SalePrice. This engineered version has undergone several preprocessing steps to make it ready for machine learning models.
PoolQC) were filled with "None". Numeric columns were filled with median, and other categorical columns with mode.SalePrice were removed.The final dataset has fewer columns than the original (reduced from 81 to approximately 250 after one-hot encoding, then further reduced by feature selection), with improved quality for modeling.
To add more predictive power, the following new features were created based on domain knowledge:
1. HouseAge: Age of the house at the time of sale. Calculated as YrSold - YearBuilt. This captures how old the house is, which can negatively affect price due to depreciation.
- Example: A house built in 2000 and sold in 2008 has HouseAge = 8.
2. Quality_x_Size: Interaction term between overall quality and living area. Calculated as OverallQual * GrLivArea. This combines quality and size to capture the value of high-quality large homes.
- Example: A house with OverallQual = 7 and GrLivArea = 1500 has Quality_x_Size = 10500.
3. TotalSF: Total square footage of the house. Calculated as GrLivArea + TotalBsmtSF + 1stFlrSF + 2ndFlrSF (if available). This aggregates area features into a single metric for better price prediction.
- Example: If GrLivArea = 1500 and TotalBsmtSF = 1000, TotalSF = 2500.
4. Log_LotArea: Log-transformed lot area to reduce skewness. Calculated as np.log1p(LotArea). This makes the distribution of lot sizes more normal, helping models handle extreme values.
- Example: A lot area of 10000 becomes Log_LotArea ≈ 9.21.
These new features were created using the original (unscaled) values to maintain interpretability, then scaled with RobustScaler to match the rest of the dataset.
SalePrice, such as:
OverallQual: Material and finish quality (scaled, 1-10).GrLivArea: Above grade (ground) living area square feet (scaled).GarageCars: Size of garage in car capacity (scaled).TotalBsmtSF: Total square feet of basement area (scaled).FullBath, YearBuilt, etc. (see the code for the full list).ExterQual: Exterior material quality (encoded as 0=Po to 4=Ex).BsmtQual: Basement quality (encoded as 0=None to 5=Ex).MSZoning_RL: 1 if residential low density, 0 otherwise.Neighborhood_NAmes: 1 if in NAmes neighborhood, 0 otherwise.HouseAge: Age of the house (scaled).Quality_x_Size: Overall quality times living area (scaled).TotalSF: Total square footage (scaled).Log_LotArea: Log-transformed lot area (scaled).SalePrice - The property's sale price in dollars (not scaled, as it's the target).Total columns: Approximately 200-250 (after one-hot encoding and feature selection).
This dataset is derived from the Ames Housing...
Facebook
Twitter(1) The ability of dispersing individuals to adjust their behaviour to changing conditions is instrumental in overcoming challenges and reducing dispersal costs, consequently increasing overall dispersal success. Understanding how dispersers’ behaviour and physiology change during the dispersal process, and how they differ from resident individuals, can shed light on the mechanisms by which dispersers increase survival and maximise reproduction. (2) By analysing individual behaviour and concentrations of faecal glucocorticoid metabolites (fGCM), a stress-associated biomarker, we sought to identify the proximate causes behind differences in survival and reproduction between dispersing and resident meerkats (Suricata suricatta). (3) We used data collected on 67 dispersing and 108 resident females to investigate (i) which individual, social, and environmental factors are correlated to foraging and vigilance, and whether the role of such factors differs among dispersal phases, and between d...
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Annual vegetation aboveground net primary productivity (ANPP) exhibits a nonlinear dependence on annual precipitation. A common pattern of nonlinearity, called asymmetry, arises when productivity responses in wet years are larger than declines in dry years. To-date, ANPP asymmetry has been attributed primarily to vegetation water stress, an internal ecosystem response to precipitation and soil water availability. However, when quantified via the asymmetry index (AI) estimated from productivity measurements, the asymmetry can be a sampling artifact that arises from a positively skewed annual precipitation distribution. In this paper, we aimed to separate the sampling effect (from external precipitation variability) from the nonlinear response of the system (the internal ecosystem dynamics). We constructed a probabilistic model that integrates the precipitation distribution with the precipitation-productivity response curve (PPT-ANPP curve), derived using empirical formulae and a process-based soil water balance model. The model was used to derive the probability density function of AI and to attribute its shape to the PPT distribution and the PPT-ANPP response curve. The models were compared to data from 47 grasslands. Results demonstrated that positively skewed precipitation produces a positive AI as a statistical artifact. The nonlinear ecosystem PPT-ANPP dependence can further enhance or dampen this statistical artifact. In all sites, the precipitation skew highly affected the probability of correctly identifying asymmetry using AI. Observed negative asymmetry arises from a larger soil water holding capacity, and positive asymmetry from plant water stress. More robust statistical indicators of nonlinear ecological responses to climate variability are needed to improve ecosystem forecasts.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The rapid advancement of additive manufacturing (AM) requires researchers to keep up with these advancements by continually improving the AM processes. Improving manufacturing processes involves evaluating the process outputs and their conformity to the required specifications. Process capability indices, calculated using critical quality characteristics (QCs), have long been used in the evaluation process due to their proven effectiveness. AM processes typically involve multi-correlated critical QCs, indicating the need to develop a multivariate process capability index (MPCI) rather than a univariate capability index, which may lead to misleading results. In this regard, this study proposes a general methodological framework for evaluating AM processes using MPCI. The proposed framework starts by identifying the AM process and product design. Fused Deposition Modeling (FDM) is chosen for this investigation. Then, the specification limits associated with critical QCs are established. To ensure that the MPCI assumptions are met, the critical QCs data are examined for normality, stability, and correlation. Additionally, the MPCI is estimated by simulating a large sample using the properties of the collected QCs data and determining the percent of nonconforming (PNC). Furthermore, the FDM process and its capable tolerance limits are then assessed using the proposed MPCI. Finally, the study presents a sensitivity analysis of the FDM process and suggestions for improvement based on the analysis of assignable causes of variation. The results revealed that the considered process mean is shifted for all QCs, and the most variation is associated with part diameter data. Moreover, the process data are not normally distributed, and the proposed transformation algorithm performs well in reducing data skewness. Also, the performance of the FDM process according to different designations of specification limits was estimated. The results showed that the FDM process is incapable of different designs except with very coarse specifications.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Results based on 250 replications of probit samples: Probit and skew-probit fits.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The rapid advancement of additive manufacturing (AM) requires researchers to keep up with these advancements by continually improving the AM processes. Improving manufacturing processes involves evaluating the process outputs and their conformity to the required specifications. Process capability indices, calculated using critical quality characteristics (QCs), have long been used in the evaluation process due to their proven effectiveness. AM processes typically involve multi-correlated critical QCs, indicating the need to develop a multivariate process capability index (MPCI) rather than a univariate capability index, which may lead to misleading results. In this regard, this study proposes a general methodological framework for evaluating AM processes using MPCI. The proposed framework starts by identifying the AM process and product design. Fused Deposition Modeling (FDM) is chosen for this investigation. Then, the specification limits associated with critical QCs are established. To ensure that the MPCI assumptions are met, the critical QCs data are examined for normality, stability, and correlation. Additionally, the MPCI is estimated by simulating a large sample using the properties of the collected QCs data and determining the percent of nonconforming (PNC). Furthermore, the FDM process and its capable tolerance limits are then assessed using the proposed MPCI. Finally, the study presents a sensitivity analysis of the FDM process and suggestions for improvement based on the analysis of assignable causes of variation. The results revealed that the considered process mean is shifted for all QCs, and the most variation is associated with part diameter data. Moreover, the process data are not normally distributed, and the proposed transformation algorithm performs well in reducing data skewness. Also, the performance of the FDM process according to different designations of specification limits was estimated. The results showed that the FDM process is incapable of different designs except with very coarse specifications.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Impressions of a place are partly formed by smell. The urban waterfronts often leave a rather poor impression due to odor pollution, resulting in recurring complaints. The nature of such complaints can be subjective and vague, so there is a growing interest in quantitative measurements of emissions to explore the causes of malodorous influence. In the present work, an air quality monitor with an H2S sensor was employed to continuously measure emissions of malodors at 1-min resolution. H2S is often considered to be the predominant odorous substance from sludge and water bodies as it is readily perceptible. The integrated means of concentration from in situ measurements were combined with the AERMOD dispersion model to reveal the spatial distribution of odor concentrations and estimate the extent of odor-prone areas at a daily time step. Year-long observations showed that the diurnal profile exhibits a positively skewed distribution. Meteorology plays a vital role in odor dispersion; the degree of dispersion was explored on a case-by-case basis. There is a greater likelihood of capturing the concentration peaks at night (21:00 to 6:00) as the air is more stable then with less tendency for vertical mixing but favors a horizontal spread. This study indicates that malodors are changeable in time and space and establishes a new approach to using H2S sensor data and resolves a long-standing question about odor in Hong Kong. Implications: this study establishes a new approach combining dispersion model with novel H2S sensor data to understand the characteristics and pattern of odor emanated from the urban waterfront in Hong Kong. The sensor has dynamic concentration range to detect the episodic level of H2S and low level at background conditions. It provides more complete information in relation to odor annoyance, as well as quantitative information useful for odor regulation
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Median and IQR of skewed data for CRP.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Observed phenotypic responses to selection in the wild often differ from predictions based on measurements of selection and genetic variance. An overlooked hypothesis to explain this paradox of stasis is that a skewed phenotypic distribution affects natural selection and evolution. We show through mathematical modelling that, when a trait selected for an optimum phenotype has a skewed distribution, directional selection is detected even at evolutionary equilibrium, where it causes no change in the mean phenotype. When environmental effects are skewed, Lande and Arnold’s (1983) directional gradient is in the direction opposite to the skew. In contrast, skewed breeding values can displace the mean phenotype from the optimum, causing directional selection in the direction of the skew. These effects can be partitioned out using alternative selection estimates based on average derivatives of individual relative fitness, or additive genetic covariances between relative fitness and trait (Robertson-Price identity). We assess the validity of these predictions using simulations of selection estimation under moderate samples size. Ecologically relevant traits may commonly have skewed distributions, as we here exemplify with avian laying date – repeatedly described as more evolutionarily stable than expected –, so this skewness should be accounted for when investigating evolutionary dynamics in the wild.