Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains 1000 rows and 8 columns, representing students’ population details and academic performance. The columns include Components like gender, race, Guardian level of education, lunch type, and test preparation course, along with their math, reading, and writing scores. It provides a detailed view of how background and preparation affect student performance.
Gender: 518 females, 482 males
Race/Ethnicity: Group C (319), Group D (262), Group B (190), Group E (140), Group A (89)
Parental Education: Some college (226), Associate’s degree (222), High school (196), Some high school (179), Bachelor’s (118), Master’s (59)
Lunch: Standard (645), Free/Reduced (355)
Test Preparation: None (642), Completed (358)
Math Score: Min 0, Max 100, Mean ≈ 66.1, Median 66
Reading Score: Min 17, Max 100, Mean ≈ 69.2, Median 70
Writing Score: Min 10, Max 100, Mean ≈ 68.1, Median 69
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We control for common variance from repeated measures using random intercepts for participants.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
We control for common variance from repeated measures using random intercepts for participants.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This file contains datasets supporting the findings of this study. (XLSX)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundTo date, there is a lack of sufficient evidence on the type of clusters in which severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is most likely to spread. Notably, the differences between cluster-level and population-level outbreaks in epidemiological characteristics and transmissibility remain unclear. Identifying the characteristics of these two levels, including epidemiology and transmission dynamics, allows us to develop better surveillance and control strategies following the current removal of suppression measures in China.MethodsWe described the epidemiological characteristics of SARS-CoV-2 and calculated its transmissibility by taking a Chinese city as an example. We used descriptive analysis to characterize epidemiological features for coronavirus disease 2019 (COVID-19) incidence database from 1 Jan 2020 to 2 March 2020 in Chaoyang District, Beijing City, China. The susceptible-exposed-infected-asymptomatic-recovered (SEIAR) model was fitted with the dataset, and the effective reproduction number (Reff) was calculated as the transmissibility of a single population. Also, the basic reproduction number (R0) was calculated by definition for three clusters, such as household, factory and community, as the transmissibility of subgroups.ResultsThe epidemic curve in Chaoyang District was divided into three stages. We included nine clusters (subgroups), which comprised of seven household-level and one factory-level and one community-level cluster, with sizes ranging from 2 to 17 cases. For the nine clusters, the median incubation period was 17.0 days [Interquartile range (IQR): 8.4–24.0 days (d)], and the average interval between date of onset (report date) and diagnosis date was 1.9 d (IQR: 1.7 to 6.4 d). At the population level, the transmissibility of the virus was high in the early stage of the epidemic (Reff = 4.81). The transmissibility was higher in factory-level clusters (R0 = 16) than in community-level clusters (R0 = 3), and household-level clusters (R0 = 1).ConclusionsIn Chaoyang District, the epidemiological features of SARS-CoV-2 showed multi-stage pattern. Many clusters were reported to occur indoors, mostly from households and factories, and few from the community. The risk of transmission varies by setting, with indoor settings being more severe than outdoor settings. Reported household clusters were the predominant type, but the population size of the different types of clusters limited transmission. The transmissibility of SARS-CoV-2 was different between a single population and its subgroups, with cluster-level transmissibility higher than population-level transmissibility.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The values based on an S sampling scheme under non-response utilizing Population I
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Extensive research work has been done for the estimation of population mean using bivariate auxiliary information based on conventional measures. Conventional measures of the auxiliary variables provide suspicious results in the presence of outliers/extreme values. However, non-conventional measures of the auxiliary variables include quartile deviation, mid-range, inter-quartile range, quartile average, tri-mean, Hodge-Lehmann estimator etc. give efficient results in case of extreme values. Unfortunately, non-conventional measures are not used by survey practitioners to enhance the estimation of unknown population parameters using bivariate auxiliary information. In this article, difference-cum-exponential-type estimators for population mean utilizing bivariate auxiliary information based on non-conventional measures under simple and stratified random sampling schemes have been suggested. Mathematical properties such as bias and mean squared error are derived. To support theoretical findings, various real-life applications are used to confirm the superiority of the suggested estimators as compared to the competing estimators under study.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This compressed directory contains the two data sets discussed in ‘Appendix: Data Analysis’ (above) and the scripts used to generate Figs 1 and 2. (ZIP)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This study focuses on estimating a finite population cumulative distribution function (CDF) using two-stage and three-stage cluster sampling under non-response. This work is then extended to estimate the finite population CDF under non-response using stratified two-stage and three-stage cluster sampling. We propose two distinct families of CDF estimators, specifically designed for these complex surveys, namely classical ratio/product-type and exponential ratio/product-type. Furthermore, we introduce a difference estimator for the CDF under non-response, utilizing ancillary information about the variances and covariances of the estimators under these complex schemes. We provide mathematical expressions for the biases and mean squared errors of the proposed CDF estimators, based on first-order approximation. To evaluate the performance of the proposed estimators, we conduct extensive simulations and assess their efficiency. The simulation results demonstrate that the proposed families of estimators perform well under different sampling scenarios. Our findings indicate that difference CDF estimators are more explicit than the other estimators discussed. We support our theoretical claims by analyzing real datasets.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Mathematical details of the computation of the means, variances, and covariance of the nonstationary Gompertz process.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Empirically observed overdispersion of TE load.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Several members of the suggested families of CDF estimators under non-response
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Deasphalting bitumen using paraffinic solvent injection is a commonly used technique to reduce both its viscosity and density and ease its flow through pipelines. Common modeling approaches for asphaltene precipitation prediction such as population balance model (PBM) contains complex mathematical relation and require conducting precise experiments to define initial and boundary conditions. Machine learning (ML) approach is considered as a robust, fast, and reliable alternative modeling approach. The main objective of this research work was to model the effect of paraffinic solvent injection on the amount of asphaltene precipitation using ML and PBM approaches. Five hundred and ninety (590) experimental data were collected from the literature for model development. The gathered data was processed using box plot, data scaling, and data splitting. Data pre-processing led to the use of 517 data points for modeling. Then, multilayer perceptron, random forest, decision tree, support vector machine, committee machine intelligent system optimized by annealing, and random search techniques were used for modeling. Precipitant molecular weight, injection rate, API gravity, pressure, C5 asphaltene content, and temperature were determined as the most relevant features for the process. Although the results of the PBM model are precise, the AI/ML model (CMIS) is the preferred model due to its robustness, reliability, and relative accuracy. The committee machine intelligent system is the superior model among the developed smart models with an RMSE of 1.7% for the testing dataset and prediction of asphaltene precipitation during bitumen recovery.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Mean and variance of TE load in the absence of selection.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Multivariate model prediction accuracy on the test dataset (RMSE mean and standard deviation for 30 experimental runs across 4 prediction horizons).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The paper presents an innovative computational framework for predictive solutions for simulating the spread of malaria. The structure incorporates sophisticated computing methods to improve the reliability of predicting malaria outbreaks. The study strives to provide a strong and effective tool for forecasting the propagation of malaria via the use of an AI-based recurrent neural network (RNN). The model is classified into two groups, consisting of humans and mosquitoes. To develop the model, the traditional Ross-Macdonald model is expanded upon, allowing for a more comprehensive analysis of the intricate dynamics at play. To gain a deeper understanding of the extended Ross model, we employ RNN, treating it as an initial value problem involving a system of first-order ordinary differential equations, each representing one of the seven profiles. This method enables us to obtain valuable insights and elucidate the complexities inherent in the propagation of malaria. Mosquitoes and humans constitute the two cohorts encompassed within the exposition of the mathematical dynamical model. Human dynamics are comprised of individuals who are susceptible, exposed, infectious, and in recovery. The mosquito population, on the other hand, is divided into three categories: susceptible, exposed, and infected. For RNN, we used the input of 0 to 300 days with an interval length of 3 days. The evaluation of the precision and accuracy of the methodology is conducted by superimposing the estimated solution onto the numerical solution. In addition, the outcomes obtained from the RNN are examined, including regression analysis, assessment of error autocorrelation, examination of time series response plots, mean square error, error histogram, and absolute error. A reduced mean square error signifies that the model’s estimates are more accurate. The result is consistent with acquiring an approximate absolute error close to zero, revealing the efficacy of the suggested strategy. This research presents a novel approach to solving the malaria propagation model using recurrent neural networks. Additionally, it examines the behavior of various profiles under varying initial conditions of the malaria propagation model, which consists of a system of ordinary differential equations.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Because SARS-CoV-2 constantly mutates to escape from the immune response, there is a reduction of neutralizing capacity of antibodies initially targeting the historical strain against emerging Variants of Concern (VoC)s. That is why the measure of the protection conferred by vaccination cannot solely rely on the antibody levels, but also requires to measure their neutralization capacity. Here we used a mathematical model to follow the humoral response in 26 individuals that received up to three vaccination doses of Bnt162b2 vaccine, and for whom both anti-S IgG and neutralization capacity was measured longitudinally against all main VoCs. Our model could identify two independent mechanisms that led to a marked increase in measured humoral response over the successive vaccination doses. In addition to the already known increase in IgG levels after each dose, we identified that the neutralization capacity was significantly increased after the third vaccine administration against all VoCs, despite large inter-individual variability. Consequently, the model projects that the mean duration of detectable neutralizing capacity against non-Omicron VoC is between 348 days (Beta variant, 95% Prediction Intervals PI [307; 389]) and 587 days (Alpha variant, 95% PI [537; 636]). Despite the low neutralization levels after three doses, the mean duration of detectable neutralizing capacity against Omicron variants varies between 173 days (BA.5 variant, 95% PI [142; 200]) and 256 days (BA.1 variant, 95% PI [227; 286]). Our model shows the benefit of incorporating the neutralization capacity in the follow-up of patients to better inform on their level of protection against the different SARS-CoV-2 variants.Trial registration: This clinical trial is registered with ClinicalTrials.gov, Trial IDs NCT04750720 and NCT05315583.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Inclusion and exclusion criteria used for title and abstract, stage 1 and stage 2 screening.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Parameter definitions, values and priors.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains 1000 rows and 8 columns, representing students’ population details and academic performance. The columns include Components like gender, race, Guardian level of education, lunch type, and test preparation course, along with their math, reading, and writing scores. It provides a detailed view of how background and preparation affect student performance.
Gender: 518 females, 482 males
Race/Ethnicity: Group C (319), Group D (262), Group B (190), Group E (140), Group A (89)
Parental Education: Some college (226), Associate’s degree (222), High school (196), Some high school (179), Bachelor’s (118), Master’s (59)
Lunch: Standard (645), Free/Reduced (355)
Test Preparation: None (642), Completed (358)
Math Score: Min 0, Max 100, Mean ≈ 66.1, Median 66
Reading Score: Min 17, Max 100, Mean ≈ 69.2, Median 70
Writing Score: Min 10, Max 100, Mean ≈ 68.1, Median 69