17 datasets found

f
Apriori algorithm-based association rules.
plos.figshare.com
bin
Updated Aug 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang (2023). Apriori algorithm-based association rules. [Dataset]. http://doi.org/10.1371/journal.pone.0289749.t001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0289749.t001
Dataset updated
Aug 8, 2023
Dataset provided by
PLOS ONE
Authors
Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In recent years, the prevalence of T2DM has been increasing annually, in particular, the personal and socioeconomic burden caused by multiple complications has become increasingly serious. This study aimed to screen out the high-risk complication combination of T2DM through various data mining methods, establish and evaluate a risk prediction model of the complication combination in patients with T2DM. Questionnaire surveys, physical examinations, and biochemical tests were conducted on 4,937 patients with T2DM, and 810 cases of sample data with complications were retained. The high-risk complication combination was screened by association rules based on the Apriori algorithm. Risk factors were screened using the LASSO regression model, random forest model, and support vector machine. A risk prediction model was established using logistic regression analysis, and a dynamic nomogram was constructed. Receiver operating characteristic (ROC) curves, harrell’s concordance index (C-Index), calibration curves, decision curve analysis (DCA), and internal validation were used to evaluate the differentiation, calibration, and clinical applicability of the models. This study found that patients with T2DM had a high-risk combination of lower extremity vasculopathy, diabetic foot, and diabetic retinopathy. Based on this, body mass index, diastolic blood pressure, total cholesterol, triglyceride, 2-hour postprandial blood glucose and blood urea nitrogen levels were screened and used for the modeling analysis. The area under the ROC curves of the internal and external validations were 0.768 (95% CI, 0.744−0.792) and 0.745 (95% CI, 0.669−0.820), respectively, and the C-index and AUC value were consistent. The calibration plots showed good calibration, and the risk threshold for DCA was 30–54%. In this study, we developed and evaluated a predictive model for the development of a high-risk complication combination while uncovering the pattern of complications in patients with T2DM. This model has a practical guiding effect on the health management of patients with T2DM in community settings.
MOESM2 of Data mining combined to the multicriteria decision analysis for...
springernature.figshare.com
application/cdfv2
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar (2023). MOESM2 of Data mining combined to the multicriteria decision analysis for the improvement of road safety: case of France [Dataset]. http://doi.org/10.6084/m9.figshare.7660082.v1
Explore at:
application/cdfv2Available download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7660082.v1
Dataset updated
May 30, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
France
Description
Additional file 2. The integral table of transactions T.
MOESM3 of Data mining combined to the multicriteria decision analysis for...
springernature.figshare.com
figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar (2023). MOESM3 of Data mining combined to the multicriteria decision analysis for the improvement of road safety: case of France [Dataset]. http://doi.org/10.6084/m9.figshare.7660091.v1
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7660091.v1
Dataset updated
May 30, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
France
Description
Additional file 3. The integral matrix of concordance indices.
f
Data_Sheet_1_Combinations of scalp acupuncture location for the treatment of...
frontiersin.figshare.com
pdf
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yu-Fang Wang; Wei-Yi Chen; Chang-Ti Lee; Yi-Ying Shen; Chou-Chin Lan; Guan-Ting Liu; Chan-Yen Kuo; Mao-Liang Chen; Po-Chun Hsieh (2023). Data_Sheet_1_Combinations of scalp acupuncture location for the treatment of post-stroke hemiparesis: A systematic review and Apriori algorithm-based association rule analysis.pdf [Dataset]. http://doi.org/10.3389/fnins.2022.956854.s001
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fnins.2022.956854.s001
Dataset updated
Jun 2, 2023
Dataset provided by
Frontiers
Authors
Yu-Fang Wang; Wei-Yi Chen; Chang-Ti Lee; Yi-Ying Shen; Chou-Chin Lan; Guan-Ting Liu; Chan-Yen Kuo; Mao-Liang Chen; Po-Chun Hsieh
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundPost-stroke hemiparesis strongly affects stroke patients’ activities of daily living and health-related quality of life. Scalp acupuncture (SA) is reportedly beneficial for post-stroke hemiparesis. However, there is still no standard of SA for the treatment of post-stroke hemiparesis. Apriori algorithm-based association rule analysis is a kind of “if-then” rule-based machine learning method suitable for investigating the underlying rules of acupuncture point/location selections. This study aimed to investigate the core SA combinations for the treatment of post-stroke hemiparesis by using a systematic review and Apriori algorithm-based association rule analysis.MethodsWe conducted a systematic review to include relevant randomized controlled trial (RCT) studies investigating the effects of SA treatment in treating patients with post-stroke hemiparesis, assessed by the Fugl-Meyer Assessment (FMA) score. We excluded studies using herbal medicine or manual acupuncture.ResultsWe extracted 33 SA locations from the 35 included RCT studies. The following SA styles were noted: International Standard Scalp Acupuncture (ISSA), WHO Standard Acupuncture Point Locations (SAPL), Zhu’s style SA, Jiao’s style SA, and Lin’s style SA. Sixty-one association rules were investigated based on the integrated SA location data.ConclusionsSAPL_GV20 (Baihui), SAPL_GV24 (Shenting), ISSA_MS6_i (ISSA Anterior Oblique Line of Vertex-Temporal, lesion-ipsilateral), ISSA_MS7_i (ISSA Posterior Oblique Line of Vertex-Temporal, lesion-ipsilateral), ISSA_PR (ISSA Parietal region, comprised of ISSA_MS5, ISSA_MS6, ISSA_MS7, ISSA_MS8, and ISSA_MS9), and SAPL_Ex.HN3 (Yintang) can be considered the core SA location combination for the treatment of post-stroke hemiparesis. We recommend a core SA combination for further animal studies, clinical trials, and treatment strategies.
f
The result comparison of the different D.
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The result comparison of the different D. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t002
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t002
Dataset updated
May 30, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The result comparison of the different D.
f
DataSheet1_Uncovering Modern Clinical Applications of Fuzi and Fuzi-Based...
frontiersin.figshare.com
datasetcatalog.nlm.nih.gov
docx
Updated Jun 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chi-Jung Tai; Mohamed El-Shazly; Yi-Hong Tsai; Dezső Csupor; Judit Hohmann; Yang-Chang Wu; Tzyy-Guey Tseng; Fang-Rong Chang; Hui-Chun Wang (2023). DataSheet1_Uncovering Modern Clinical Applications of Fuzi and Fuzi-Based Formulas: A Nationwide Descriptive Study With Market Basket Analysis.docx [Dataset]. http://doi.org/10.3389/fphar.2021.641530.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fphar.2021.641530.s001
Dataset updated
Jun 10, 2023
Dataset provided by
Frontiers
Authors
Chi-Jung Tai; Mohamed El-Shazly; Yi-Hong Tsai; Dezső Csupor; Judit Hohmann; Yang-Chang Wu; Tzyy-Guey Tseng; Fang-Rong Chang; Hui-Chun Wang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Background: As time evolved, traditional Chinese medicine (TCM) became integrated into the global medical system as complementary treatments. Some essential TCM herbs started to play a limited role in clinical practices because of Western medication development. For example, Fuzi (Aconiti Lateralis Radix Praeparata) is a toxic but indispensable TCM herb. Fuzi was mainly used in poor circulation and life-threatening conditions by history records. However, with various Western medication options for treating critical conditions currently, how is Fuzi used clinically and its indications in modern TCM are unclear. This study aimed to evaluate Fuzi and Fuzi-based formulas in modern clinical practices using artificial intelligence and data mining methods.Methods: This nationwide descriptive study with market basket analysis used a cohort selected from the Taiwan National Health Insurance database that contained one million national representatives between 2003 and 2010 used for our analysis. Descriptive statistics were performed to demonstrate the modern clinical indications of Fuzi. Market basket analysis was calculated by the Apriori algorithm to discover the association rules between Fuzi and other TCM herbs.Results: A total of 104,281 patients using 405,837 prescriptions of Fuzi and Fuzi-based formulas were identified. TCM doctors were found to use Fuzi in pulmonary (21.5%), gastrointestinal (17.3%), and rheumatologic (11.0%) diseases, but not commonly in cardiovascular diseases (7.4%). Long-term users of Fuzi and Fuzi-based formulas often had the following comorbidities diagnosed by Western doctors: osteoarthritis (31.0%), peptic ulcers (29.5%), hypertension (19.9%), and COPD (19.7%). Patients also used concurrent medications such as H2-receptor antagonists, nonsteroidal anti-inflammatory drugs, β-blockers, calcium channel blockers, and aspirin. Through market basket analysis, for the first time, we noticed many practical Fuzi-related herbal pairs such as Fuzi–Hsihsin (Asari Radix et Rhizoma)–Dahuang (Rhei Radix et Rhizoma) for neurologic diseases and headache.Conclusion: For the first time, big data analysis was applied to uncover the modern clinical indications of Fuzi in addition to traditional use. We provided necessary evidence on the scientific use of Fuzi in current TCM practices, and the Fuzi-related herbal pairs discovered in this study are helpful to the development of new botanical drugs.
f
The SAR difference of different confidence degree thresholds in D = 3.
plos.figshare.com
xls
Updated Jun 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The SAR difference of different confidence degree thresholds in D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t009
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t009
Dataset updated
Jun 9, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The SAR difference of different confidence degree thresholds in D = 3.
f
DataSheet2_An Interaction-Based Method for Refining Results From Gene Set...
frontiersin.figshare.com
xlsx
Updated Jun 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yishen Wang; Yiwen Hong; Shudi Mao; Yukang Jiang; Yamei Cui; Jianying Pan; Yan Luo (2023). DataSheet2_An Interaction-Based Method for Refining Results From Gene Set Enrichment Analysis.xlsx [Dataset]. http://doi.org/10.3389/fgene.2022.890672.s002
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2022.890672.s002
Dataset updated
Jun 3, 2023
Dataset provided by
Frontiers
Authors
Yishen Wang; Yiwen Hong; Shudi Mao; Yukang Jiang; Yamei Cui; Jianying Pan; Yan Luo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Purpose: To demonstrate an interaction-based method for the refinement of Gene Set Enrichment Analysis (GSEA) results.Method: Intravitreal injection of miR-124-3p antagomir was used to knockdown the expression of miR-124-3p in mouse retina at postnatal day 3 (P3). Whole retinal RNA was extracted for mRNA transcriptome sequencing at P9. After preprocessing the dataset, GSEA was performed, and the leading-edge subsets were obtained. The Apriori algorithm was used to identify the frequent genes or gene sets from the union of the leading-edge subsets. A new statistic d was introduced to evaluate the frequent genes or gene sets. Reverse transcription quantitative PCR (RT-qPCR) was performed to validate the expression trend of candidate genes after the knockdown of miR-124-3p.Results: A total of 115,140 assembled transcript sequences were obtained from the clean data. With GSEA, the NOD-like receptor signaling pathway, C-type-like lectin receptor signaling pathway, phagosome, necroptosis, JAK-STAT signaling pathway, Toll-like receptor signaling pathway, leukocyte transendothelial migration, chemokine signaling pathway, NF-kappa B signaling pathway and RIG-I-like signaling pathway were identified as the top 10 enriched pathways, and their leading-edge subsets were obtained. After being refined by the Apriori algorithm and sorted by the value of the modulus of d, Prkcd, Irf9, Stat3, Cxcl12, Stat1, Stat2, Isg15, Eif2ak2, Il6st, Pdgfra, Socs4 and Csf2ra had the significant number of interactions and the greatest value of d to downstream genes among all frequent transactions. Results of RT-qPCR validation for the expression of candidate genes after the knockdown of miR-124-3p showed a similar trend to the RNA-Seq results.Conclusion: This study indicated that using the Apriori algorithm and defining the statistic d was a novel way to refine the GSEA results. We hope to convey the intricacies from the computational results to the low-throughput experiments, and to plan experimental investigations specifically.
f
The discretization results of D = 3.
plos.figshare.com
xls
Updated Jun 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The discretization results of D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t003
Dataset updated
Jun 9, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The discretization results of D = 3.
f
The discretization results of D = 4.
plos.figshare.com
xls
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The discretization results of D = 4. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t004
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t004
Dataset updated
Jun 3, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The discretization results of D = 4.
f
The two-item SAR of D = 3.
plos.figshare.com
xls
Updated Jun 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The two-item SAR of D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t006
Dataset updated
Jun 8, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The two-item SAR of D = 3.
f
The one-item SAR of D = 4.
plos.figshare.com
xls
Updated Jun 9, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The one-item SAR of D = 4. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t007
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0255684.t007
Dataset updated
Jun 9, 2023
Dataset provided by
PLOS ONE
Authors
Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The one-item SAR of D = 4.
f
Table_2_Delayed Comparison and Apriori Algorithm (DCAA): A Tool for...
frontiersin.figshare.com
docx
Updated Jun 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lianhong Ding; Shaoshuai Xie; Shucui Zhang; Hangyu Shen; Huaqiang Zhong; Daoyuan Li; Peng Shi; Lianli Chi; Qunye Zhang (2023). Table_2_Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein–Protein Interactions From Time-Series Phosphoproteomic Data.DOCX [Dataset]. http://doi.org/10.3389/fmolb.2020.606570.s002
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fmolb.2020.606570.s002
Dataset updated
Jun 13, 2023
Dataset provided by
Frontiers
Authors
Lianhong Ding; Shaoshuai Xie; Shucui Zhang; Hangyu Shen; Huaqiang Zhong; Daoyuan Li; Peng Shi; Lianli Chi; Qunye Zhang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein–protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein–protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.
f
Quantitative data in the database for Example 1.
plos.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiaohan Liao; Cunjin Xue; Fenzhen Su (2023). Quantitative data in the database for Example 1. [Dataset]. http://doi.org/10.1371/journal.pone.0177438.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0177438.t001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Xiaohan Liao; Cunjin Xue; Fenzhen Su
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Quantitative data in the database for Example 1.
f
Stunting final dataset.
plos.figshare.com
bin
Updated Jan 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alemu Birara Zemariam; Biruk Beletew Abate; Addis Wondmagegn Alamaw; Eyob shitie Lake; Gizachew Yilak; Mulat Ayele; Befkad Derese Tilahun; Habtamu Setegn Ngusie (2025). Stunting final dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0316452.s001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0316452.s001
Dataset updated
Jan 24, 2025
Dataset provided by
PLOS ONE
Authors
Alemu Birara Zemariam; Biruk Beletew Abate; Addis Wondmagegn Alamaw; Eyob shitie Lake; Gizachew Yilak; Mulat Ayele; Befkad Derese Tilahun; Habtamu Setegn Ngusie
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundStunting is a vital indicator of chronic undernutrition that reveals a failure to reach linear growth. Investigating growth and nutrition status during adolescence, in addition to infancy and childhood is very crucial. However, the available studies in Ethiopia have been usually focused in early childhood and they used the traditional stastical methods. Therefore, this study aimed to employ multiple machine learning algorithms to identify the most effective model for the prediction of stunting among adolescent girls in Ethiopia.MethodsA total of 3156 weighted samples of adolescent girls aged 15–19 years were used from the 2016 Ethiopian Demographic and Health Survey dataset. The data was pre-processed, and 80% and 20% of the observations were used for training, and testing the model, respectively. Eight machine learning algorithms were included for consideration of model building and comparison. The performance of the predictive model was evaluated using evaluation metrics value through Python software. The synthetic minority oversampling technique was used for data balancing and Boruta algorithm was used to identify best features. Association rule mining using an Apriori algorithm was employed to generate the best rule for the association between the independent feature and the targeted feature using R software.ResultsThe random forest classifier (sensitivity = 81%, accuracy = 77%, precision = 75%, f1-score = 78%, AUC = 85%) outperformed in predicting stunting compared to other ML algorithms considered in this study. Region, poor wealth index, no formal education, unimproved toilet facility, rural residence, not used contraceptive method, religion, age, no media exposure, occupation, and having one or more children were the top attributes to predict stunting. Association rule mining was identified the top seven best rules that most frequently associated with stunting among adolescent girls in Ethiopia.ConclusionThe random forest classifier outperformed in predicting and identifying the relevant predictors of stunting. Results have shown that machine learning algorithms can accurately predict stunting, making them potentially valuable as decision-support tools for the relevant stakeholders and giving emphasis for the identified predictors could be an important intervention to halt stunting among adolescent girls.
f
An overview of the avocado content (grams avocado per 100 g food) at food...
figshare.com
xls
Updated Jun 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vivienne X. Guan; Elizabeth P. Neale; Yasmine C. Probst (2023). An overview of the avocado content (grams avocado per 100 g food) at food group levelsb'*' in which avocado-containing products were found. [Dataset]. http://doi.org/10.1371/journal.pone.0279567.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0279567.t001
Dataset updated
Jun 6, 2023
Dataset provided by
PLOS ONE
Authors
Vivienne X. Guan; Elizabeth P. Neale; Yasmine C. Probst
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
An overview of the avocado content (grams avocado per 100 g food) at food group levelsb'*' in which avocado-containing products were found.
f
Table_1_Urban–Rural Differences in Patterns and Associated Factors of...
frontiersin.figshare.com
figshare.com
bin
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chichen Zhang; Shujuan Xiao; Lei Shi; Yaqing Xue; Xiao Zheng; Fang Dong; Jiachi Zhang; Benli Xue; Huang Lin; Ping Ouyang (2023). Table_1_Urban–Rural Differences in Patterns and Associated Factors of Multimorbidity Among Older Adults in China: A Cross-Sectional Study Based on Apriori Algorithm and Multinomial Logistic Regression.XLS [Dataset]. http://doi.org/10.3389/fpubh.2021.707062.s001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.3389/fpubh.2021.707062.s001
Dataset updated
Jun 1, 2023
Dataset provided by
Frontiers
Authors
Chichen Zhang; Shujuan Xiao; Lei Shi; Yaqing Xue; Xiao Zheng; Fang Dong; Jiachi Zhang; Benli Xue; Huang Lin; Ping Ouyang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
China
Description
Introduction: Multimorbidity has become one of the key issues in the public health sector. This study aimed to explore the urban–rural differences in patterns and associated factors of multimorbidity in China and to provide scientific reference for the development of health management strategies to reduce health inequality between urban and rural areas.Methods: A cross-sectional study, which used a multi-stage random sampling method, was conducted effectively among 3,250 participants in the Shanxi province of China. The chi-square test was used to compare the prevalence of chronic diseases among older adults with different demographic characteristics. The Apriori algorithm and multinomial logistic regression were used to explore the patterns and associated factors of multimorbidity among older adults, respectively.Results: The findings showed that 30.3% of older adults reported multimorbidity, with significantly higher proportions in rural areas. Among urban older adults, 10 binary chronic disease combinations with strong association strength were obtained. In addition, 11 binary chronic disease combinations and three ternary chronic disease combinations with strong association strength were obtained among rural older adults. In rural and urban areas, there is a large gap in patterns and factors associated with multimorbidity.Conclusions: Multimorbidity was prevalent among older adults, which patterns mainly consisted of two or three chronic diseases. The patterns and associated factors of multimorbidity varied from urban to rural regions. Expanding the study of urban–rural differences in multimorbidity will help the country formulate more reasonable public health policies to maximize the benefits of medical services for all.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang (2023). Apriori algorithm-based association rules. [Dataset]. http://doi.org/10.1371/journal.pone.0289749.t001

Apriori algorithm-based association rules.

Explore at:

34 scholarly articles cite this dataset (View in Google Scholar)

binAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pone.0289749.t001

Dataset updated

Aug 8, 2023

Dataset provided by

PLOS ONE

Authors

Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In recent years, the prevalence of T2DM has been increasing annually, in particular, the personal and socioeconomic burden caused by multiple complications has become increasingly serious. This study aimed to screen out the high-risk complication combination of T2DM through various data mining methods, establish and evaluate a risk prediction model of the complication combination in patients with T2DM. Questionnaire surveys, physical examinations, and biochemical tests were conducted on 4,937 patients with T2DM, and 810 cases of sample data with complications were retained. The high-risk complication combination was screened by association rules based on the Apriori algorithm. Risk factors were screened using the LASSO regression model, random forest model, and support vector machine. A risk prediction model was established using logistic regression analysis, and a dynamic nomogram was constructed. Receiver operating characteristic (ROC) curves, harrell’s concordance index (C-Index), calibration curves, decision curve analysis (DCA), and internal validation were used to evaluate the differentiation, calibration, and clinical applicability of the models. This study found that patients with T2DM had a high-risk combination of lower extremity vasculopathy, diabetic foot, and diabetic retinopathy. Based on this, body mass index, diastolic blood pressure, total cholesterol, triglyceride, 2-hour postprandial blood glucose and blood urea nitrogen levels were screened and used for the modeling analysis. The area under the ROC curves of the internal and external validations were 0.768 (95% CI, 0.744−0.792) and 0.745 (95% CI, 0.669−0.820), respectively, and the C-index and AUC value were consistent. The calibration plots showed good calibration, and the risk threshold for DCA was 30–54%. In this study, we developed and evaluated a predictive model for the development of a high-risk complication combination while uncovering the pattern of complications in patients with T2DM. This model has a practical guiding effect on the health management of patients with T2DM in community settings.

Clear search

Close search

Google apps

Main menu

Apriori algorithm-based association rules.

MOESM2 of Data mining combined to the multicriteria decision analysis for...

MOESM3 of Data mining combined to the multicriteria decision analysis for...

Data_Sheet_1_Combinations of scalp acupuncture location for the treatment of...

The result comparison of the different D.

DataSheet1_Uncovering Modern Clinical Applications of Fuzi and Fuzi-Based...

The SAR difference of different confidence degree thresholds in D = 3.

DataSheet2_An Interaction-Based Method for Refining Results From Gene Set...

The discretization results of D = 3.

The discretization results of D = 4.

The two-item SAR of D = 3.

The one-item SAR of D = 4.

Table_2_Delayed Comparison and Apriori Algorithm (DCAA): A Tool for...

Quantitative data in the database for Example 1.

Stunting final dataset.

An overview of the avocado content (grams avocado per 100 g food) at food...

Table_1_Urban–Rural Differences in Patterns and Associated Factors of...

Apriori algorithm-based association rules.