The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
True values are η = (0.5, 1.0, 3.5) and p = (0.2, 0.3, 0.5).
Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Parameters are shown separately for the three different datasets (1, 2, pathological gamblers [PG]).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains the prioritization provided by a panel of 15 experts to a set of 28 barriers categories for 8 different roles of the future energy system. A Delphi method was followed and the scores provided in the three rounds carried out are included. The dataset also contains the scripts used to assess the results and the output of this assessment.
A list of the information contained in this file is:
data folder: this folders includes the scores given by the 15 experts in the 3 rounds. Every round is in an individual folder. There is a file per expert that has the scores between -5 (not relevant at all) to 5 (completely relevant) per barrier (rows) and actor (columns). There is also a file with the description of the experts in terms of their position in the company, the type of company and the country.
fig folder: this folder includes the figures created to assess the information provided by the experts. For each round, the following figures are created (in each respective folder):
Boxplot with the distribution of scores per barriers and roles.
Heatmap with the mean scores per barriers and roles.
Boxplots with the comparison of the different distributions provided by the experts of each group (depending on the keywords) per barrier and role.
Heatmap with the mean score per barrier and use case and with the prioritization per barrier and use case.
Finally, bar plots with the mean scores differences between rounds and boxplot with comparisons of the scores distributions are also provided.
stat folder: this folder includes the files with the results of the different statistical assessment carried out. For each round, the following figures are created (in each respective folder):
The statistics used to assess the scores (Intraclass correlation coefficient, Inter-rater agreement, Inter-rater agreement p-value, Homogeneity of Variances, Average interquartile range, Standard Deviation of interquartile ranges, Friedman test p-value Average power post hoc) per barrier and per role.
The results of the post hoc of the Friedman Test per berries and per roles.
The average score per barrier and per role.
The mean value of the scores provided by the experts grouped by the keywords per barrier and role. P-value of the comparison of these two values.
The end prioritization of the barrier for the use case (averaging the scores or merging the critical sets)
Finally, the differences between the mean and standard deviations of the scores between two consecutive rounds are provided.
This dataset includes elevation-based probability and depth statistics for estimating inundation under various sea-level rise and high tide flooding scenarios in and around the National Park Service’s Dry Tortugas National Park. These datasets were developed using digital elevation model (DEM) from National Oceanic and Atmospheric Administration (NOAA). This data release includes results from analyses of two local sea-level rise scenarios for two-time steps — the Intermediate-Low and Intermediate-High for 2050 and 2100 from Sweet and others (2022). Additionally, this data release includes maps of inundation probability under the minor, moderate, and major high tide flooding thresholds defined by NOAA. We estimated the probability of an area being inundated under a given scenario using Monte Carlo simulations with 1,000 iterations. For an individual iteration, each pixel of the DEM was randomly propagated based on the lidar data uncertainty, while the sea-level rise and high tide flooding water level estimates were also propagated based on uncertainty in the sea-level rise estimate (Sweet and others, 2022) and tidal datum transformation, respectively. Moreover, the probability of a pixel being inundated was calculated by summing the binary simulation outputs and dividing by 1,000. Following, probability was binned into the following classes: 1) Unlikely, probability ≤0.33; 2) Likely as not, probability >0.33 and ≤0.66; and 3) Likely, probability >0.66. Finally, depth statistics were only recorded when depth was equal to or greater than 0. We calculated the median depth, 25th percentile, 75th percentile, and interquartile range using all the pixels that met this criterion. When utilizing the depth statistics, it is important to also consider the probability of this pixel being flooded. In other words, the depth layers may show some depth returns, but the pixel may have rarely been inundated for the 1,000 iterations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Abbreviations: GFR, glomerular filtration rate; MDRD, Modification of Diet in Renal Disease; CKD-EPI, Chronic Kidney Disease Epidemiology Collaboration; CI, confidence interval; IQR, interquartile range.Performance of bias, precision and accuracy between measured GFR and estimated GFR in the validation data set.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Median (interquartile range) of accuracy measures for optimal profile generation methods.
This dataset includes elevation-based probability and depth statistics for estimating inundation under various sea-level rise and high tide flooding scenarios in and around the National Park Service’s San Juan National Historic Site. These datasets were developed using 1-m digital elevation model (DEM) from the 3D Elevation program. This data release includes results from analyses of two local sea-level rise scenarios for two-time steps — the Intermediate-Low and Intermediate-High for 2050 and 2100 from Sweet and others (2022). Additionally, this data release includes maps of inundation probability under the minor, moderate, and major high tide flooding thresholds defined by the National Oceanic and Atmospheric Administration (NOAA). We estimated the probability of an area being inundated under a given scenario using Monte Carlo simulations with 1,000 iterations. For an individual iteration, each pixel of the DEM was randomly propagated based on the lidar data uncertainty, while the sea-level rise and high tide flooding water level estimates were also propagated based on uncertainty in the sea-level rise estimate (Sweet and others, 2022) and tidal datum transformation, respectively. Moreover, the probability of a pixel being inundated was calculated by summing the binary simulation outputs and dividing by 1,000. Following, probability was binned into the following classes: 1) Unlikely, probability ≤0.33; 2) Likely as not, probability >0.33 and ≤0.66; and 3) Likely, probability >0.66. Finally, depth statistics were only recorded when depth was equal to or greater than 0. We calculated the median depth, 25th percentile, 75th percentile, and interquartile range using all the pixels that met this criterion. When utilizing the depth statistics, it is important to also consider the probability of this pixel being flooded. In other words, the depth layers may show some depth returns, but the pixel may have rarely been inundated for the 1,000 iterations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Median and interquartile range for R2 and l and s parameters from three two-parameter models, fitted to data on group median (i.e., fit to median IP) and individual level in physical effort conditions.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Displays the descriptive statistics, including the minimum, maximum, median and interquartile range (IQR) values, for the variables analyzed.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Estimated normality statistics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Abbreviations: GFR, glomerular filtration rate; CKD-EPI, Chronic Kidney Disease Epidemiology Collaboration; IQR, interquartile range.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundRheumatic and musculoskeletal disorders (RMDs) are associated with cardiovascular diseases (CVDs), with hypertension being the most common. We aimed to determine the prevalence of high blood pressure (HBP), awareness, treatment, and blood pressure control among patients with RMDs seen in a Rheumatology clinic in Uganda.MethodsWe conducted a cross-sectional study at the Rheumatology Clinic of Mulago National Referral Hospital (MNRH), Kampala, Uganda. Socio-demographic, clinical characteristics and anthropometric data were collected. Multivariable logistic regression was performed using STATA 16 to determine factors associated with HBP in patients with RMDs.ResultsA total of 100 participants were enrolled. Of these, majority were female (84%, n = 84) with mean age of 52.1 (standard deviation: 13.8) years and median body mass index of 28 kg/m2 (interquartile range (IQR): 24.8 kg/m2–32.9 kg/m2). The prevalence of HBP was 61% (n = 61, 95% CI: 51.5–70.5), with the majority (77%, n = 47, 95% CI: 66.5–87.6) being aware they had HTN. The prevalence of HTN was 47% (n = 47, 37.2–56.8), and none had it under control. Factors independently associated with HBP were age 46-55years (adjusted prevalence ratio (aPR): 2.5, 95% confidence interval (CI): 1.06–5.95), 56–65 years (aPR: 2.6, 95% CI: 1.09–6.15), >65 years (aPR: 2.5, 95% CI: 1.02–6.00), obesity (aPR: 3.7, 95% CI: 1.79–7.52), overweight (aPR: 2.7, 95% CI: 1.29–5.77).ConclusionThere was a high burden of HBP among people with RMDs in Uganda with poor blood pressure control, associated with high BMI and increasing age. There is a need for further assessment of the RMD specific drivers of HBP and meticulous follow up of patients with RMDs.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Baseline characteristics of PLHIV enrolled during the two study periods, pre and during UTT.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Guardian’s response to postprocedural questionnaires.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Extensive research work has been done for the estimation of population mean using bivariate auxiliary information based on conventional measures. Conventional measures of the auxiliary variables provide suspicious results in the presence of outliers/extreme values. However, non-conventional measures of the auxiliary variables include quartile deviation, mid-range, inter-quartile range, quartile average, tri-mean, Hodge-Lehmann estimator etc. give efficient results in case of extreme values. Unfortunately, non-conventional measures are not used by survey practitioners to enhance the estimation of unknown population parameters using bivariate auxiliary information. In this article, difference-cum-exponential-type estimators for population mean utilizing bivariate auxiliary information based on non-conventional measures under simple and stratified random sampling schemes have been suggested. Mathematical properties such as bias and mean squared error are derived. To support theoretical findings, various real-life applications are used to confirm the superiority of the suggested estimators as compared to the competing estimators under study.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundHypertension is a risk factor for cardiovascular events. Inflammation plays an important role in the development of essential hypertension. Studies assessing the association between complete blood count-based inflammatory scores (CBCIS) and hypertension are scarce. Therefore, this study aimed to determine the relationship between CBCIS and hypertension among individuals with and without human immunodeficiency virus (HIV).MethodThis was a cross-sectional study among 344 participants at Serenje District Hospital and Serenje Urban Clinic. We used structured questionnaires to collect sociodemographic, clinical and laboratory characteristics. CBCIS included lymphocyte-monocyte ratio (LMR), neutrophil-lymphocyte ratio (NLR), platelet-lymphocyte ratio (PLR), derived neutrophil-lymphocyte ratio (d-NLR), and differential white blood cells. The primary outcome variable was hypertension defined as systolic and diastolic blood pressure higher than or equal to 140/90 mmHg. Logistic regression was used to estimate the association between hypertension and CBCIS in statistical package for social science (SPSS) version 22.0.ResultsThe participants had a median age of 32 years (interquartile range (IQR) 24–42) and 65.1% (n = 224) were female. The prevalence of hypertension was 10.5% (n = 36). Among those with hypertension, 55.6% (n = 20) were female and 44.4% (n = 16) were male. The CBCIS significantly associated with hypertension in people living with HIV (PLWH) was PLR (adjusted odds ratio (AOR) 0.98; 95% confidence interval (CI) 0.97–0.99, p = 0.01) while in people without HIV, AMC (AOR 15.40 95%CI 3.75–63.26), ANC (AOR 1.88 95%CI 1.05–3.36), WBC (AOR 0.52 95%CI 0.31–0.87) and PLR (AOR 0.98 95%CI 0.97–0.99) were the factors associated with hypertension. Compared to people without HIV, only WBC, ANC, NLR, and d-NLR were good predictors of hypertension among PLWH.ConclusionOur study indicates a notable HIV-status driven association between CBCIS and hypertension, suggesting the use of CBICS as potential biomarkers for hypertension risk with substantial implications for early detection and preventive measures.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
The Precipitation Estimation from Remotely Sensed Information using an Artificial Neural Network-Climate Data Record (PERSIANN-CDR) is a satellite-based precipitation dataset for hydrological and climate studies, spanning from 1983 to present. It is the longest satellite-based precipitation record available, with daily data at 0.25° resolution for the 60°S–60°N latitude band.PERSIANN rain rate estimates are generated at 0.25° resolution and calibrated to a monthly merged in-situ and satellite product from the Global Precipitation Climatology Project (GPCP). The model uses Gridded Satellite (GridSat-B1) infrared data at 3-hourly time steps, with the raw output (PERSIANN-B1) bias-corrected and accumulated to produce the daily PERSIANN-CDR.The maps show 31 years (1984–2014) of annual and seasonal median and interquartile range (IQR) data. The median represents the 50th percentile of precipitation, and the IQR reflects the range between the 75th and 25th percentiles, showing data variability. Median and IQR are preferred over mean and standard deviation as they are less influenced by extreme values and better represent non-normally distributed data, such as precipitation, which is skewed and zero-limited.Data and Metadata: NCEIThis is a component of the Gulf Data Atlas (V1.0) for the Physical topic area.