Facebook
TwitterThis dataset is part of a series of datasets, where batteries are continuously cycled with randomly generated current profiles. Reference charging and discharging cycles are also performed after a fixed interval of randomized usage to provide reference benchmarks for battery state of health. In this dataset, four 18650 Li-ion batteries (Identified as RW17, RW18, RW19 and RW20) were continuously operated by repeatedly charging them to 4.2V and then discharging them to 3.2V using a randomized sequence of discharging currents between 0.5A and 5A. This type of discharging profile is referred to here as random walk (RW) discharging. A customized probability distribution is used in this experiment to select a new load setpoint every 1 minute during RW discharging operation. The custom probability distribution was designed to be skewed towards selecting higher currents.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Genome-wide analysis of gene expression or protein binding patterns using different array or sequencing based technologies is now routinely performed to compare different populations, such as treatment and reference groups. It is often necessary to normalize the data obtained to remove technical variation introduced in the course of conducting experimental work, but standard normalization techniques are not capable of eliminating technical bias in cases where the distribution of the truly altered variables is skewed, i.e. when a large fraction of the variables are either positively or negatively affected by the treatment. However, several experiments are likely to generate such skewed distributions, including ChIP-chip experiments for the study of chromatin, gene expression experiments for the study of apoptosis, and SNP-studies of copy number variation in normal and tumour tissues. A preliminary study using spike-in array data established that the capacity of an experiment to identify altered variables and generate unbiased estimates of the fold change decreases as the fraction of altered variables and the skewness increases. We propose the following work-flow for analyzing high-dimensional experiments with regions of altered variables: (1) Pre-process raw data using one of the standard normalization techniques. (2) Investigate if the distribution of the altered variables is skewed. (3) If the distribution is not believed to be skewed, no additional normalization is needed. Otherwise, re-normalize the data using a novel HMM-assisted normalization procedure. (4) Perform downstream analysis. Here, ChIP-chip data and simulated data were used to evaluate the performance of the work-flow. It was found that skewed distributions can be detected by using the novel DSE-test (Detection of Skewed Experiments). Furthermore, applying the HMM-assisted normalization to experiments where the distribution of the truly altered variables is skewed results in considerably higher sensitivity and lower bias than can be attained using standard and invariant normalization methods.
Facebook
Twitterhttps://www.ycharts.com/termshttps://www.ycharts.com/terms
View market daily updates and historical trends for SKEW. from United States. Source: Chicago Board Options Exchange. Track economic data with YCharts ana…
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Blockchain data query: ethereum.logs topic0 skew
Facebook
TwitterIn the fixed-effects stochastic frontier model an efficiency measure relative to the best firm in the sample is universally employed. This paper considers a new measure relative to the worst firm in the sample. We find that estimates of this measure have smaller bias than those of the traditional measure when the sample consists of many firms near the efficient frontier. Moreover, a two-sided measure relative to both the best and the worst firms is proposed. Simulations suggest that the new measures may be preferred depending on the skewness of the inefficiency distribution and the scale of efficiency differences.
Facebook
TwitterTo improve flood-frequency estimates at rural streams in Mississippi, annual exceedance probability (AEP) flows at gaged streams in Mississippi and regional-regression equations, used to estimate annual exceedance probability flows for ungaged streams in Mississippi, were developed by using current geospatial data, additional statistical methods, and annual peak-flow data through the 2013 water year. The regional-regression equations were derived from statistical analyses of peak-flow data, basin characteristics associated with 281 streamgages, the generalized skew from Bulletin 17B (Interagency Advisory Committee on Water Data, 1982), and a newly developed study-specific skew for select four-digit hydrologic unit code (HUC4) watersheds in Mississippi. Four flood regions were identified based on residuals from the regional-regression analyses. No analysis was conducted for streams in the Mississippi Alluvial Plain flood region because of a lack of long-term streamflow data and poorly defined basin characteristics. Flood regions containing sites with similar basin and climatic characteristics yielded better regional-regression equations with lower error percentages. The generalized least squares method was used to develop the final regression models for each flood region for annual exceedance probability flows. The peak-flow statistics were estimated by fitting a log-Pearson type III distribution to records of annual peak flows and then applying two additional statistical methods: (1) the expected moments algorithm to help describe uncertainty in annual peak flows and to better represent missing and historical record; and (2) the generalized multiple Grubbs-Beck test to screen out potentially influential low outliers and to better fit the upper end of the peak-flow distribution. Standard errors of prediction of the generalized least-squares models ranged from 28 to 46 percent. Pseudo coefficients of determination of the models ranged from 91 to 96 percent. Flood Region A, located in north-central Mississippi, contained 27 streamgages with drainage areas that ranged from 1.41 to 612 square miles. The 1% annual exceedance probability had a standard error of prediction of 31 percent which was lower than the prediction errors in Flood Regions B and C.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Supplementary Table 2: Samples from the validation and application cohorts that were predicted to be skewed using NGS data with and without the inclusion of escape genes consisting of significantly skewed variant positions.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Quantitative-genetic models of differentiation under migration-selection balance often rely on the assumption of normally distributed genotypic and phenotypic values. When a population is subdivided into demes with selection toward different local optima, migration between demes may result in asymmetric, or skewed, local distributions. Using a simplified two-habitat model, we derive formulas without a priori assuming a Gaussian distribution of genotypic values, and we find expressions that naturally incorporate higher moments, such as skew. These formulas yield predictions of the expected divergence under migration-selection balance that are more accurate than models assuming Gaussian distributions, which illustrates the importance of incorporating these higher moments to assess the response to selection in heterogeneous environments. We further show with simulations that traits with loci of large effect display the largest skew in their distribution at migration-selection balance.
Facebook
TwitterThis dataset is part of a series of datasets, where batteries are continuously cycled with randomly generated current profiles. Reference charging and discharging cycles are also performed after a fixed interval of randomized usage to provide reference benchmarks for battery state of health. In this dataset, four 18650 Li-ion batteries (Identified as RW25, RW26, RW27 and RW28) were continuously operated by repeatedly charging them to 4.2V and then discharging them to 3.2V using a randomized sequence of discharging currents between 0.5A and 5A. This type of discharging profile is referred to here as random walk (RW) discharging. A customized probability distribution is used in this experiment to select a new load setpoint every 1 minute during RW discharging operation. The custom probability distribution was designed to be skewed towards selecting higher currents. The ambient temperature at which the batteries are cycled was held at approximately 40C for these experiments.
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Remarkable variation exists in the distribution of reproduction (skew) among members of cooperatively breeding groups, both within and between species. Reproductive skew theory has provided an important framework for understanding this variation. In the primitively eusocial Hymenoptera, two models have been routinely tested: concessions models, which assume complete control of reproduction by a dominant individual, and tug-of-war models, which assume on-going competition among group members over reproduction. Current data provide little support for either model, but uncertainty about the ability of individuals to detect genetic relatedness and difficulties in identifying traits conferring competitive ability mean that the relative importance of concessions versus tug-of-war remains unresolved. Here, we suggest that the use of social parasitism to generate meaningful variation in key social variables represents a valuable opportunity to explore the mechanisms underpinning reproductive skew within the social Hymenoptera. We present a direct test of concessions and tug-of-war models in the paper wasp Polistes dominulus by exploiting pronounced changes in relatedness and power structures that occur following replacement of the dominant by a congeneric social parasite. Comparisons of skew in parasitized and unparasitized colonies are consistent with a tug-of-war over reproduction within P. dominulus groups, but provide no evidence for reproductive concessions.
Facebook
Twitter**Student Mental Health Survey: Scaled Data on IT Students' Academic and Emotional Well-being ** **Overview **This dataset contains survey responses from IT students, focusing on academic stress, mental health, and lifestyle factors. It includes two files that capture different stages of data preparation to suit various analytical needs.
Files Included MentalHealthSurvey.csv:
Description: Contains the original survey data with raw categorical and numerical variables. Usefulness: Ideal for initial data exploration and understanding the unprocessed patterns before any data transformation. MentalHealthSurvey_Cleaned.csv:
Description: This file contains cleaned and preprocessed data with scaled numerical variables. The data was scaled using standard scaling techniques, which adjust the values so that each variable has a mean of 0 and a standard deviation of 1. Why Scaling is Useful: Scaling ensures that all numerical variables contribute equally to statistical models, particularly in factor analysis, where varying scales can skew the results. Scaled data improves model performance, stability, and interpretability, making it especially valuable for advanced analyses like predictive modeling and machine learning. Applications Initial Data Exploration: Use the raw data to explore variable distributions, correlations, and identify potential data quality issues. Advanced Analysis: The cleaned and scaled data is optimal for statistical analysis, helping to uncover meaningful patterns and insights into the factors affecting students' mental health and academic performance. Both files offer a complete view of the dataset, from raw data exploration to scaled data ready for rigorous analysis.
Facebook
TwitterWe construct a copula from the skew t distribution of Sahu et al. (2003). This copula can capture asymmetric and extreme dependence between variables, and is one of the few copulas that can do so and still be used in high dimensions effectively. However, it is difficult to estimate the copula model by maximum likelihood when the multivariate dimension is high, or when some or all of the marginal distributions are discrete-valued, or when the parameters in the marginal distributions and copula are estimated jointly. We therefore propose a Bayesian approach that overcomes all these problems. The computations are undertaken using a Markov chain Monte Carlo simulation method which exploits the conditionally Gaussian representation of the skew t distribution. We employ the approach in two contemporary econometric studies. The first is the modelling of regional spot prices in the Australian electricity market. Here, we observe complex non-Gaussian margins and nonlinear inter-regional dependence. Accurate characterization of this dependence is important for the study of market integration and risk management purposes. The second is the modelling of ordinal exposure measures for 15 major websites. Dependence between websites is important when measuring the impact of multi-site advertising campaigns. In both cases the skew t copula substantially outperforms symmetric elliptical copula alternatives, demonstrating that the skew t copula is a powerful modelling tool when coupled with Bayesian inference.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Effective conservation and species management requires an understanding of the causes of poor population growth. Conservation physiology uses biomarkers to identify factors that contribute to low individual fitness and population declines. Building on this, macrophysiology can use the same markers to assess how individual physiology varies with different ecological or demographic factors over large temporal and spatial scales. Here, we use a macrophysiological approach to identify the ecological and demographic correlates of poor population growth rates in the Cape mountain zebra metapopulation. We use two non-invasive biomarkers: faecal glucocorticoids as a measure of chronic stress, and faecal androgens as an indicator of male physiological status. We found that faecal glucocorticoid concentrations were highest in the spring prior to summer rainfall, and were elevated in individuals from populations associated with low quality habitat (lower grass abundance). In addition, faecal androgen concentrations were higher in populations with a high proportion of non-breeding stallions (where male:female adult sex ratios exceed 2:1) suggesting sex ratio imbalances may intensify male competition. Finally, population growth rate was negatively associated with faecal glucocorticoid concentrations and female fecundity was negatively associated with faecal androgens, indicating a relationship between hormone profiles and fitness. Together, our results provide cross population evidence for how poor population growth rates in Cape mountain zebra can be linked to individual physiological biomarkers. More broadly, we advocate physiological biomarkers as indicators of population viability, and as a way to evaluate the impact of variable ecological and demographic factors. In addition, conservation physiology can be used to assess the efficacy of management interventions for this subspecies, and this approach could inform models of species’ responses to future environmental change.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
An iterative Jacobi-like algorithm is described for transforming a skew-symmetric complex matrix A of even dimension into Murnaghan’s normal form. The decomposition allows to determine the singular values of A and to solve the system of linear equations Ax = b in a least square sense without accidentally destroying the skew-symmetry. Complex skew-symmetric matrices arise in the context of the time-dependent variational principle (TDVP), that maps a quantum mechanical system to a Hamiltonian system in a high-dimensional curved phase space. When the skew-symmetric phase space metric becomes singular because of parameter redundancies, the equations of motion have to be solved in a least square sense. The presented algorithm ensures that the symplectic structure is retained also in the least square solution. As a test case it is applied to studying the deflection of a photoelectron by a hydrogen atom using the TDVP. A Fortran implementation of the skew-Jacobi algorithm is provided as supplementary material.
Facebook
TwitterDryad_growth_ratesData on growth rates of herring gull chicksDryad_egg_measurementsHerring gull egg measurementsDryad_nest_summaryInformation on herring gull nest success or failureDryad_HERG_Sex_DataResults of genetic work (sex assignments) of herring gull chicks.
Facebook
TwitterMonogamy results in high genetic relatedness among offspring and thus it is generally assumed to be favoured by kin selection. Female multiple mating (polyandry) has nevertheless evolved several times in the social Hymenoptera (ants, bees and wasps), and a substantial amount of work has been conducted to understand its costs and benefits. Relatedness and inclusive fitness benefits are, however, not only influenced by queen mating frequency but also by paternity skew, which is a quantitative measure of paternity biases among the offspring of polyandrous females. We performed a large scale phylogenetic analysis of paternity skew across polyandrous social Hymenoptera. We found a general and significant negative association between paternity frequency and paternity skew. High paternity skew, which increases relatedness amongst colony members and thus maximizes inclusive fitness gains, characterized species with low paternity frequency. However, species with highly polyandrous queens had low paternity skew, with paternity equalized amongst potential sires. Equal paternity shares among fathers are expected to maximize fitness benefits derived from genetic diversity among offspring. We discuss the potential for post-copulatory sexual selection to influence patterns of paternity in social insects, and suggest that sexual selection may have played a key, yet overlooked role in social evolution.,Jaffe et al-Interspecific sims Sall by superfamilySampling protocol and simulations used to construct the null hypothesis under random paternity allocation. Interspecific simulations based on the entire data set (Sall, n = 72 species), performed within ants, bees and wasps.Jaffe et al-Interspecific sims SallSampling protocol and simulations used to construct the null hypothesis under random paternity allocation. Interspecific simulations based on the entire data set (Sall, n = 72 species), performed accross all species.Jaffe et al-Interspecific sims Ssig by superfamilySampling protocol and simulations used to construct the null hypothesis under random paternity allocation. Interspecific simulations based on the species with skew values differing from the random skew expectation (Ssig, n = 56 species), performed within ants, bees and wasps.Jaffe et al-Interspecific sims SsigSampling protocol and simulations used to construct the null hypothesis under random paternity allocation. Interspecific simulations based on the species with skew values differing from the random skew expectation (Ssig, n = 56 species), performed across all species.Jaffe et al-Intraspecific simulationsSampling protocol and simulations used to construct the null hypothesis under random paternity allocation. Intraspecific simulations.Jaffe et al-null hypothesis filesReadMe file,
Facebook
TwitterAuthors: Tyler Linderoth, Lauren Deaner, Nancy Chen, Reed Bowman, Raoul K. Boughton, Sarah W. Fitzpatrick Year: 2025
Contact: Tyler Linderoth, lindero1@msu.edu
Code used to analyze data are available at https://github.com/tplinderoth/M4_FSJ_translocations or the archived release of this GitHub repository: https://doi.org/10.5281/zenodo.14606489
The following is a list of the different data types by file suffix included in this repository along with brief descriptions for how to work with them. Many examples for how these data were used can be found in the "Linderoth_etal_Mosaic_FSJ_translocations_study_code.txt" document from the GitHub repository referenced above.
Descriptions of FASTA, BED, ...
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Trade-offs are a fundamental concept in evolutionary biology because they are thought to explain much of nature’s biological diversity, from variation in life-histories to differences in metabolism. Despite the predicted importance of trade-offs, they are notoriously difficult to detect. Here we contribute to the existing rich theoretical literature on trade-offs by examining how the shape of the distribution of resources or metabolites acquired in an allocation pathway influences the strength of trade-offs between traits. We further explore how variation in resource distribution interacts with two aspects of pathway complexity (i.e., the number of branches and hierarchical structure) affects tradeoffs. We simulate variation in the shape of the distribution of a resource by sampling 106 individuals from a beta distribution with varying parameters to alter the resource shape. In a simple “Y-model” allocation of resources to two traits, any variation in a resource leads to slopes less than -1, with left skewed and symmetrical distributions leading to negative relationships between traits, and highly right skewed distributions associated with positive relationships between traits. Adding more branches further weakens negative and positive relationships between traits, and the hierarchical structure of pathways typically weakens relationships between traits, although in some contexts hierarchical complexity can strengthen positive relationships between traits. Our results further illuminate how variation in the acquisition and allocation of resources, and particularly the shape of a resource distribution and how it interacts with pathway complexity, makes it challenging to detect trade-offs. We offer several practical suggestions on how to detect trade-offs given these challenges. Methods Overview of Flux Simulations To study the strength and direction of trade-offs within a population, we developed a simulation of flux in a simple metabolic pathway, where a precursor metabolite emerging from node A may either be converted to metabolic products B1 or B2 (Fig. 1). This conception of a pathway is similar to De Jong and Van Noordwijk’s Y-model (Van Noordwijk & De Jong, 1986; De Jong & Van Noordwijk, 1992), but we used simulation instead of analytical statistical models to allow us to consider greater complexity in the distribution of variables and pathways. For a simple pathway (Fig. 1), the total flux Jtotal (i.e., the flux at node A, denoted as JA) for each individual (N = 106) was first sampled from a predetermined beta distribution as described below. The flux at node B1 (JB1) was then randomly sampled from this distribution with max = Jtotal = JA and min = 0. The flux at the remaining node, B2, was then simply the remaining flux (JB2 = JA - JB1). Simulations of more complex pathways followed the same basic approach as described above, with increased numbers of branches and hierarchical levels added to the pathway as described below under Question 2. The metabolic pathways were simulated using Python (v. 3.8.2) (Van Rossum & Drake Jr., 2009) where we could control the underlying distribution of metabolite allocation. The output flux at nodes B1 and B2 was plotted using R (v. 4.2.1) (Team, 2022) with the resulting trade-off visualized as a linear regression using the ggplot2 R package (v. 3.4.2) (Wickham, 2016). While we have conceptualized the pathway as the flux of metabolites, it could be thought of as any resource being allocated to different traits. Question 1: How does variation in resource distribution within a population affect the strength and direction of trade-offs? We first simulated the simplest scenario where all individuals had the same total flux Jtotal = 1, in which case the phenotypic trade-off is expected to be most easily detected. We then modified this initial scenario to explore how variation in the distribution of resource acquisition (Jtotal) affected the strength and direction of trade-offs. Specifically, the resource distribution was systematically varied by sampling n = 103 total flux levels from a beta distribution, which has two parameters alpha and beta that control the size and shape of the distribution (Miller & Miller, 1999). When alpha is large and beta is small, the distribution is left skewed, whereas for small alpha and large beta, the distribution is right skewed. Likewise, for alpha = beta, the curve is symmetrical and approximately normal when the parameters are sufficiently large (>2). We can thus systematically vary the underlying resource distribution of a population by iterating through values of alpha and beta from 0.5 to 5 (in increments of 0.5), which was done using the NumPy Python package (v. 1.19.1) (Harris et al., 2020). The resulting slope of each linear regression of the flux at B1 and B2 (i.e., the two branching nodes) was then calculated using the lm function in R and plotted as a contour map using the latticeExtra Rpackage (v. 0.6-30) (Sarkar, 2008). Question 2: How does the complexity of the pathway used to produce traits affect the strength and direction of trade-offs? Metabolic pathways are typically more complex than what is described above. Most pathways consist of multiple branch points and multiple hierarchical levels. To understand how complexity affects the ability to detect trade-offs when combined with variation in the distribution of total flux we systematically manipulated the number of branch points and hierarchical levels within pathways (Fig. 1). We first explored the effect of adding branches to the pathway from the same node, such that instead of only branching off to nodes B1 and B2, the pathway branched to nodes B1 through to Bn (Fig. 1B), where n is the total number of branches (maximum n = 10 branches). Flux at a node was calculated as previously described, and the remaining flux was evenly distributed amongst the remaining nodes (i.e., nodes B2 through to Bnwould each receive J2-n = (Jtotal - JB1)/(n - 1) flux). For each pathway, we simulated flux using a beta distribution of Jtotalwith alpha = 5, beta = 0.5 to simulate a left skewed distribution, alpha = beta = 5 to simulate a normal distribution, and with alpha = 0.5, beta = 5 to simulate a right skewed distribution, as well as the simplest case where all individuals have total flux Jtotal = 1. We next considered how adding hierarchical levels to a metabolic pathway affected trade-offs. We modified our initial pathway with node A branching to nodes B1 and B2, and then node B2 further branched to nodes C1 and C2 (Fig. 1C). To compute the flux at the two new nodes C1 and C2, we simply repeated the same calculation as before, but using the flux at node B2, JB2, as the total flux. That is, the flux at node C1 was obtained by randomly sampling from the distribution at B2 with max = JB and min = 0, and the flux at node C2 is the remaining flux (JC = JB2 - JC1). Much like in the previous scenario with multiple branch points, we used three beta distributions (with the same parameters as before) to represent left, normal, and right skewed resource distributions, as well as the simplest case where Jtotal = 1 for all individuals. Quantile Regressions We performed quantile regression to understand whether this approach could help to detect trade-offs. Quantile regression is a form of statistical analysis that fits a curve through upper or lower quantiles of the data to assess whether an independent variable potentially sets a lower or upper limit to a response variable (Cade et al., 1999). This type of analysis is particularly useful when it is thought that an independent variable places a constraint on a response variable, yet variation in the response variable is influenced by many additional factors that add “noise” to the data, making a simple bivariate relationship difficult to detect (Thomson et al., 1996). Quantile regression is an extension of ordinary least squares regression, which regresses the best fitting line through the 50th percentile of the data. In addition to performing ordinary least squares regression for each pairwise comparison between the four nodes (B1, B2, C1, C2), we performed a series of quantile regressions using the ggplot2 R package (v. 3.4.2), where only the qth quantile was used for the regression (q = 0.99 and 0.95 to 0.5 in increments of 0.05, see Fig. S1) (Cade et al., 1999).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
IntroductionThere are Challenges in statistically modelling immune responses to longitudinal HIV viral load exposure as a function of covariates. We define Bayesian Markov Chain Monte Carlo mixed effects models to incorporate priors and examine the effect of different distributional assumptions. We prospectively fit these models to an as-yet-unpublished data from the Tshwane District Hospital HIV treatment clinic in South Africa, to determine if cumulative log viral load, an indicator of long-term viral exposure, is a valid predictor of immune response.MethodsModels are defined, to express ‘slope’, i.e. mean annual increase in CD4 counts, and ‘asymptote’, i.e. the odds of having a CD4 count ≥500 cells/μL during antiretroviral treatment, as a function of covariates and random-effects. We compare the effect of using informative versus non-informative prior distributions on model parameters. Models with cubic splines or Skew-normal distributions are also compared using the conditional Deviance Information Criterion.ResultsThe data of 750 patients are analyzed. Overall, models adjusting for cumulative log viral load provide a significantly better fit than those that do not. An increase in cumulative log viral load is associated with a decrease in CD4 count slope (19.6 cells/μL (95% credible interval: 28.26, 10.93)) and a reduction in the odds of achieving a CD4 counts ≥500 cells/μL (0.42 (95% CI: 0.236, 0.730)) during 5 years of therapy. Using informative priors improves the cumulative log viral load estimate, and a skew-normal distribution for the random-intercept and measurement error results is a better fit compared to using classical Gaussian distributions.DiscussionWe demonstrate in an unpublished South African cohort that cumulative log viral load is a strong and significant predictor of both CD4 count slope and asymptote. We argue that Bayesian methods should be used more frequently for such data, given their flexibility to incorporate prior information and non-Gaussian distributions.
Facebook
TwitterThis dataset is part of a series of datasets, where batteries are continuously cycled with randomly generated current profiles. Reference charging and discharging cycles are also performed after a fixed interval of randomized usage to provide reference benchmarks for battery state of health. In this dataset, four 18650 Li-ion batteries (Identified as RW17, RW18, RW19 and RW20) were continuously operated by repeatedly charging them to 4.2V and then discharging them to 3.2V using a randomized sequence of discharging currents between 0.5A and 5A. This type of discharging profile is referred to here as random walk (RW) discharging. A customized probability distribution is used in this experiment to select a new load setpoint every 1 minute during RW discharging operation. The custom probability distribution was designed to be skewed towards selecting higher currents.