17 datasets found
  1. f

    Apriori algorithm-based association rules.

    • plos.figshare.com
    bin
    Updated Aug 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang (2023). Apriori algorithm-based association rules. [Dataset]. http://doi.org/10.1371/journal.pone.0289749.t001
    Explore at:
    binAvailable download formats
    Dataset updated
    Aug 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, the prevalence of T2DM has been increasing annually, in particular, the personal and socioeconomic burden caused by multiple complications has become increasingly serious. This study aimed to screen out the high-risk complication combination of T2DM through various data mining methods, establish and evaluate a risk prediction model of the complication combination in patients with T2DM. Questionnaire surveys, physical examinations, and biochemical tests were conducted on 4,937 patients with T2DM, and 810 cases of sample data with complications were retained. The high-risk complication combination was screened by association rules based on the Apriori algorithm. Risk factors were screened using the LASSO regression model, random forest model, and support vector machine. A risk prediction model was established using logistic regression analysis, and a dynamic nomogram was constructed. Receiver operating characteristic (ROC) curves, harrell’s concordance index (C-Index), calibration curves, decision curve analysis (DCA), and internal validation were used to evaluate the differentiation, calibration, and clinical applicability of the models. This study found that patients with T2DM had a high-risk combination of lower extremity vasculopathy, diabetic foot, and diabetic retinopathy. Based on this, body mass index, diastolic blood pressure, total cholesterol, triglyceride, 2-hour postprandial blood glucose and blood urea nitrogen levels were screened and used for the modeling analysis. The area under the ROC curves of the internal and external validations were 0.768 (95% CI, 0.744−0.792) and 0.745 (95% CI, 0.669−0.820), respectively, and the C-index and AUC value were consistent. The calibration plots showed good calibration, and the risk threshold for DCA was 30–54%. In this study, we developed and evaluated a predictive model for the development of a high-risk complication combination while uncovering the pattern of complications in patients with T2DM. This model has a practical guiding effect on the health management of patients with T2DM in community settings.

  2. MOESM2 of Data mining combined to the multicriteria decision analysis for...

    • springernature.figshare.com
    application/cdfv2
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar (2023). MOESM2 of Data mining combined to the multicriteria decision analysis for the improvement of road safety: case of France [Dataset]. http://doi.org/10.6084/m9.figshare.7660082.v1
    Explore at:
    application/cdfv2Available download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    France
    Description

    Additional file 2. The integral table of transactions T.

  3. MOESM3 of Data mining combined to the multicriteria decision analysis for...

    • springernature.figshare.com
    • figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar (2023). MOESM3 of Data mining combined to the multicriteria decision analysis for the improvement of road safety: case of France [Dataset]. http://doi.org/10.6084/m9.figshare.7660091.v1
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Fatima El Mazouri; Mohammed Chaouki Abounaima; Khalid Zenkouar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    France
    Description

    Additional file 3. The integral matrix of concordance indices.

  4. f

    Data_Sheet_1_Combinations of scalp acupuncture location for the treatment of...

    • frontiersin.figshare.com
    pdf
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yu-Fang Wang; Wei-Yi Chen; Chang-Ti Lee; Yi-Ying Shen; Chou-Chin Lan; Guan-Ting Liu; Chan-Yen Kuo; Mao-Liang Chen; Po-Chun Hsieh (2023). Data_Sheet_1_Combinations of scalp acupuncture location for the treatment of post-stroke hemiparesis: A systematic review and Apriori algorithm-based association rule analysis.pdf [Dataset]. http://doi.org/10.3389/fnins.2022.956854.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Frontiers
    Authors
    Yu-Fang Wang; Wei-Yi Chen; Chang-Ti Lee; Yi-Ying Shen; Chou-Chin Lan; Guan-Ting Liu; Chan-Yen Kuo; Mao-Liang Chen; Po-Chun Hsieh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundPost-stroke hemiparesis strongly affects stroke patients’ activities of daily living and health-related quality of life. Scalp acupuncture (SA) is reportedly beneficial for post-stroke hemiparesis. However, there is still no standard of SA for the treatment of post-stroke hemiparesis. Apriori algorithm-based association rule analysis is a kind of “if-then” rule-based machine learning method suitable for investigating the underlying rules of acupuncture point/location selections. This study aimed to investigate the core SA combinations for the treatment of post-stroke hemiparesis by using a systematic review and Apriori algorithm-based association rule analysis.MethodsWe conducted a systematic review to include relevant randomized controlled trial (RCT) studies investigating the effects of SA treatment in treating patients with post-stroke hemiparesis, assessed by the Fugl-Meyer Assessment (FMA) score. We excluded studies using herbal medicine or manual acupuncture.ResultsWe extracted 33 SA locations from the 35 included RCT studies. The following SA styles were noted: International Standard Scalp Acupuncture (ISSA), WHO Standard Acupuncture Point Locations (SAPL), Zhu’s style SA, Jiao’s style SA, and Lin’s style SA. Sixty-one association rules were investigated based on the integrated SA location data.ConclusionsSAPL_GV20 (Baihui), SAPL_GV24 (Shenting), ISSA_MS6_i (ISSA Anterior Oblique Line of Vertex-Temporal, lesion-ipsilateral), ISSA_MS7_i (ISSA Posterior Oblique Line of Vertex-Temporal, lesion-ipsilateral), ISSA_PR (ISSA Parietal region, comprised of ISSA_MS5, ISSA_MS6, ISSA_MS7, ISSA_MS8, and ISSA_MS9), and SAPL_Ex.HN3 (Yintang) can be considered the core SA location combination for the treatment of post-stroke hemiparesis. We recommend a core SA combination for further animal studies, clinical trials, and treatment strategies.

  5. f

    The result comparison of the different D.

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The result comparison of the different D. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The result comparison of the different D.

  6. f

    DataSheet1_Uncovering Modern Clinical Applications of Fuzi and Fuzi-Based...

    • frontiersin.figshare.com
    • datasetcatalog.nlm.nih.gov
    docx
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chi-Jung Tai; Mohamed El-Shazly; Yi-Hong Tsai; Dezső Csupor; Judit Hohmann; Yang-Chang Wu; Tzyy-Guey Tseng; Fang-Rong Chang; Hui-Chun Wang (2023). DataSheet1_Uncovering Modern Clinical Applications of Fuzi and Fuzi-Based Formulas: A Nationwide Descriptive Study With Market Basket Analysis.docx [Dataset]. http://doi.org/10.3389/fphar.2021.641530.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    Frontiers
    Authors
    Chi-Jung Tai; Mohamed El-Shazly; Yi-Hong Tsai; Dezső Csupor; Judit Hohmann; Yang-Chang Wu; Tzyy-Guey Tseng; Fang-Rong Chang; Hui-Chun Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Background: As time evolved, traditional Chinese medicine (TCM) became integrated into the global medical system as complementary treatments. Some essential TCM herbs started to play a limited role in clinical practices because of Western medication development. For example, Fuzi (Aconiti Lateralis Radix Praeparata) is a toxic but indispensable TCM herb. Fuzi was mainly used in poor circulation and life-threatening conditions by history records. However, with various Western medication options for treating critical conditions currently, how is Fuzi used clinically and its indications in modern TCM are unclear. This study aimed to evaluate Fuzi and Fuzi-based formulas in modern clinical practices using artificial intelligence and data mining methods.Methods: This nationwide descriptive study with market basket analysis used a cohort selected from the Taiwan National Health Insurance database that contained one million national representatives between 2003 and 2010 used for our analysis. Descriptive statistics were performed to demonstrate the modern clinical indications of Fuzi. Market basket analysis was calculated by the Apriori algorithm to discover the association rules between Fuzi and other TCM herbs.Results: A total of 104,281 patients using 405,837 prescriptions of Fuzi and Fuzi-based formulas were identified. TCM doctors were found to use Fuzi in pulmonary (21.5%), gastrointestinal (17.3%), and rheumatologic (11.0%) diseases, but not commonly in cardiovascular diseases (7.4%). Long-term users of Fuzi and Fuzi-based formulas often had the following comorbidities diagnosed by Western doctors: osteoarthritis (31.0%), peptic ulcers (29.5%), hypertension (19.9%), and COPD (19.7%). Patients also used concurrent medications such as H2-receptor antagonists, nonsteroidal anti-inflammatory drugs, β-blockers, calcium channel blockers, and aspirin. Through market basket analysis, for the first time, we noticed many practical Fuzi-related herbal pairs such as Fuzi–Hsihsin (Asari Radix et Rhizoma)–Dahuang (Rhei Radix et Rhizoma) for neurologic diseases and headache.Conclusion: For the first time, big data analysis was applied to uncover the modern clinical indications of Fuzi in addition to traditional use. We provided necessary evidence on the scientific use of Fuzi in current TCM practices, and the Fuzi-related herbal pairs discovered in this study are helpful to the development of new botanical drugs.

  7. f

    The SAR difference of different confidence degree thresholds in D = 3.

    • plos.figshare.com
    xls
    Updated Jun 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The SAR difference of different confidence degree thresholds in D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t009
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The SAR difference of different confidence degree thresholds in D = 3.

  8. f

    DataSheet2_An Interaction-Based Method for Refining Results From Gene Set...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yishen Wang; Yiwen Hong; Shudi Mao; Yukang Jiang; Yamei Cui; Jianying Pan; Yan Luo (2023). DataSheet2_An Interaction-Based Method for Refining Results From Gene Set Enrichment Analysis.xlsx [Dataset]. http://doi.org/10.3389/fgene.2022.890672.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Frontiers
    Authors
    Yishen Wang; Yiwen Hong; Shudi Mao; Yukang Jiang; Yamei Cui; Jianying Pan; Yan Luo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Purpose: To demonstrate an interaction-based method for the refinement of Gene Set Enrichment Analysis (GSEA) results.Method: Intravitreal injection of miR-124-3p antagomir was used to knockdown the expression of miR-124-3p in mouse retina at postnatal day 3 (P3). Whole retinal RNA was extracted for mRNA transcriptome sequencing at P9. After preprocessing the dataset, GSEA was performed, and the leading-edge subsets were obtained. The Apriori algorithm was used to identify the frequent genes or gene sets from the union of the leading-edge subsets. A new statistic d was introduced to evaluate the frequent genes or gene sets. Reverse transcription quantitative PCR (RT-qPCR) was performed to validate the expression trend of candidate genes after the knockdown of miR-124-3p.Results: A total of 115,140 assembled transcript sequences were obtained from the clean data. With GSEA, the NOD-like receptor signaling pathway, C-type-like lectin receptor signaling pathway, phagosome, necroptosis, JAK-STAT signaling pathway, Toll-like receptor signaling pathway, leukocyte transendothelial migration, chemokine signaling pathway, NF-kappa B signaling pathway and RIG-I-like signaling pathway were identified as the top 10 enriched pathways, and their leading-edge subsets were obtained. After being refined by the Apriori algorithm and sorted by the value of the modulus of d, Prkcd, Irf9, Stat3, Cxcl12, Stat1, Stat2, Isg15, Eif2ak2, Il6st, Pdgfra, Socs4 and Csf2ra had the significant number of interactions and the greatest value of d to downstream genes among all frequent transactions. Results of RT-qPCR validation for the expression of candidate genes after the knockdown of miR-124-3p showed a similar trend to the RNA-Seq results.Conclusion: This study indicated that using the Apriori algorithm and defining the statistic d was a novel way to refine the GSEA results. We hope to convey the intricacies from the computational results to the low-throughput experiments, and to plan experimental investigations specifically.

  9. f

    The discretization results of D = 3.

    • plos.figshare.com
    xls
    Updated Jun 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The discretization results of D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The discretization results of D = 3.

  10. f

    The discretization results of D = 4.

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The discretization results of D = 4. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The discretization results of D = 4.

  11. f

    The two-item SAR of D = 3.

    • plos.figshare.com
    xls
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The two-item SAR of D = 3. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The two-item SAR of D = 3.

  12. f

    The one-item SAR of D = 4.

    • plos.figshare.com
    xls
    Updated Jun 9, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han (2023). The one-item SAR of D = 4. [Dataset]. http://doi.org/10.1371/journal.pone.0255684.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xin Liu; Xuefeng Sang; Jiaxuan Chang; Yang Zheng; Yuping Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The one-item SAR of D = 4.

  13. f

    Table_2_Delayed Comparison and Apriori Algorithm (DCAA): A Tool for...

    • frontiersin.figshare.com
    docx
    Updated Jun 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lianhong Ding; Shaoshuai Xie; Shucui Zhang; Hangyu Shen; Huaqiang Zhong; Daoyuan Li; Peng Shi; Lianli Chi; Qunye Zhang (2023). Table_2_Delayed Comparison and Apriori Algorithm (DCAA): A Tool for Discovering Protein–Protein Interactions From Time-Series Phosphoproteomic Data.DOCX [Dataset]. http://doi.org/10.3389/fmolb.2020.606570.s002
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 13, 2023
    Dataset provided by
    Frontiers
    Authors
    Lianhong Ding; Shaoshuai Xie; Shucui Zhang; Hangyu Shen; Huaqiang Zhong; Daoyuan Li; Peng Shi; Lianli Chi; Qunye Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of high-throughput omics data is one of the most important approaches for obtaining information regarding interactions between proteins/genes. Time-series omics data are a series of omics data points indexed in time order and normally contain more abundant information about the interactions between biological macromolecules than static omics data. In addition, phosphorylation is a key posttranslational modification (PTM) that is indicative of possible protein function changes in cellular processes. Analysis of time-series phosphoproteomic data should provide more meaningful information about protein interactions. However, although many algorithms, databases, and websites have been developed to analyze omics data, the tools dedicated to discovering molecular interactions from time-series omics data, especially from time-series phosphoproteomic data, are still scarce. Moreover, most reported tools ignore the lag between functional alterations and the corresponding changes in protein synthesis/PTM and are highly dependent on previous knowledge, resulting in high false-positive rates and difficulties in finding newly discovered protein–protein interactions (PPIs). Therefore, in the present study, we developed a new method to discover protein–protein interactions with the delayed comparison and Apriori algorithm (DCAA) to address the aforementioned problems. DCAA is based on the idea that there is a lag between functional alterations and the corresponding changes in protein synthesis/PTM. The Apriori algorithm was used to mine association rules from the relationships between items in a dataset and find PPIs based on time-series phosphoproteomic data. The advantage of DCAA is that it does not rely on previous knowledge and the PPI database. The analysis of actual time-series phosphoproteomic data showed that more than 68% of the protein interactions/regulatory relationships predicted by DCAA were accurate. As an analytical tool for PPIs that does not rely on a priori knowledge, DCAA should be useful to predict PPIs from time-series omics data, and this approach is not limited to phosphoproteomic data.

  14. f

    Quantitative data in the database for Example 1.

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaohan Liao; Cunjin Xue; Fenzhen Su (2023). Quantitative data in the database for Example 1. [Dataset]. http://doi.org/10.1371/journal.pone.0177438.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Xiaohan Liao; Cunjin Xue; Fenzhen Su
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Quantitative data in the database for Example 1.

  15. f

    Stunting final dataset.

    • plos.figshare.com
    bin
    Updated Jan 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alemu Birara Zemariam; Biruk Beletew Abate; Addis Wondmagegn Alamaw; Eyob shitie Lake; Gizachew Yilak; Mulat Ayele; Befkad Derese Tilahun; Habtamu Setegn Ngusie (2025). Stunting final dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0316452.s001
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 24, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Alemu Birara Zemariam; Biruk Beletew Abate; Addis Wondmagegn Alamaw; Eyob shitie Lake; Gizachew Yilak; Mulat Ayele; Befkad Derese Tilahun; Habtamu Setegn Ngusie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundStunting is a vital indicator of chronic undernutrition that reveals a failure to reach linear growth. Investigating growth and nutrition status during adolescence, in addition to infancy and childhood is very crucial. However, the available studies in Ethiopia have been usually focused in early childhood and they used the traditional stastical methods. Therefore, this study aimed to employ multiple machine learning algorithms to identify the most effective model for the prediction of stunting among adolescent girls in Ethiopia.MethodsA total of 3156 weighted samples of adolescent girls aged 15–19 years were used from the 2016 Ethiopian Demographic and Health Survey dataset. The data was pre-processed, and 80% and 20% of the observations were used for training, and testing the model, respectively. Eight machine learning algorithms were included for consideration of model building and comparison. The performance of the predictive model was evaluated using evaluation metrics value through Python software. The synthetic minority oversampling technique was used for data balancing and Boruta algorithm was used to identify best features. Association rule mining using an Apriori algorithm was employed to generate the best rule for the association between the independent feature and the targeted feature using R software.ResultsThe random forest classifier (sensitivity = 81%, accuracy = 77%, precision = 75%, f1-score = 78%, AUC = 85%) outperformed in predicting stunting compared to other ML algorithms considered in this study. Region, poor wealth index, no formal education, unimproved toilet facility, rural residence, not used contraceptive method, religion, age, no media exposure, occupation, and having one or more children were the top attributes to predict stunting. Association rule mining was identified the top seven best rules that most frequently associated with stunting among adolescent girls in Ethiopia.ConclusionThe random forest classifier outperformed in predicting and identifying the relevant predictors of stunting. Results have shown that machine learning algorithms can accurately predict stunting, making them potentially valuable as decision-support tools for the relevant stakeholders and giving emphasis for the identified predictors could be an important intervention to halt stunting among adolescent girls.

  16. f

    An overview of the avocado content (grams avocado per 100 g food) at food...

    • figshare.com
    xls
    Updated Jun 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vivienne X. Guan; Elizabeth P. Neale; Yasmine C. Probst (2023). An overview of the avocado content (grams avocado per 100 g food) at food group levelsb'*' in which avocado-containing products were found. [Dataset]. http://doi.org/10.1371/journal.pone.0279567.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Vivienne X. Guan; Elizabeth P. Neale; Yasmine C. Probst
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An overview of the avocado content (grams avocado per 100 g food) at food group levelsb'*' in which avocado-containing products were found.

  17. f

    Table_1_Urban–Rural Differences in Patterns and Associated Factors of...

    • frontiersin.figshare.com
    • figshare.com
    bin
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chichen Zhang; Shujuan Xiao; Lei Shi; Yaqing Xue; Xiao Zheng; Fang Dong; Jiachi Zhang; Benli Xue; Huang Lin; Ping Ouyang (2023). Table_1_Urban–Rural Differences in Patterns and Associated Factors of Multimorbidity Among Older Adults in China: A Cross-Sectional Study Based on Apriori Algorithm and Multinomial Logistic Regression.XLS [Dataset]. http://doi.org/10.3389/fpubh.2021.707062.s001
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Frontiers
    Authors
    Chichen Zhang; Shujuan Xiao; Lei Shi; Yaqing Xue; Xiao Zheng; Fang Dong; Jiachi Zhang; Benli Xue; Huang Lin; Ping Ouyang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    Introduction: Multimorbidity has become one of the key issues in the public health sector. This study aimed to explore the urban–rural differences in patterns and associated factors of multimorbidity in China and to provide scientific reference for the development of health management strategies to reduce health inequality between urban and rural areas.Methods: A cross-sectional study, which used a multi-stage random sampling method, was conducted effectively among 3,250 participants in the Shanxi province of China. The chi-square test was used to compare the prevalence of chronic diseases among older adults with different demographic characteristics. The Apriori algorithm and multinomial logistic regression were used to explore the patterns and associated factors of multimorbidity among older adults, respectively.Results: The findings showed that 30.3% of older adults reported multimorbidity, with significantly higher proportions in rural areas. Among urban older adults, 10 binary chronic disease combinations with strong association strength were obtained. In addition, 11 binary chronic disease combinations and three ternary chronic disease combinations with strong association strength were obtained among rural older adults. In rural and urban areas, there is a large gap in patterns and factors associated with multimorbidity.Conclusions: Multimorbidity was prevalent among older adults, which patterns mainly consisted of two or three chronic diseases. The patterns and associated factors of multimorbidity varied from urban to rural regions. Expanding the study of urban–rural differences in multimorbidity will help the country formulate more reasonable public health policies to maximize the benefits of medical services for all.

  18. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang (2023). Apriori algorithm-based association rules. [Dataset]. http://doi.org/10.1371/journal.pone.0289749.t001

Apriori algorithm-based association rules.

Related Article
Explore at:
34 scholarly articles cite this dataset (View in Google Scholar)
binAvailable download formats
Dataset updated
Aug 8, 2023
Dataset provided by
PLOS ONE
Authors
Xin Luo; Jijia Sun; Hong Pan; Dian Zhou; Ping Huang; Jingjing Tang; Rong Shi; Hong Ye; Ying Zhao; An Zhang
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In recent years, the prevalence of T2DM has been increasing annually, in particular, the personal and socioeconomic burden caused by multiple complications has become increasingly serious. This study aimed to screen out the high-risk complication combination of T2DM through various data mining methods, establish and evaluate a risk prediction model of the complication combination in patients with T2DM. Questionnaire surveys, physical examinations, and biochemical tests were conducted on 4,937 patients with T2DM, and 810 cases of sample data with complications were retained. The high-risk complication combination was screened by association rules based on the Apriori algorithm. Risk factors were screened using the LASSO regression model, random forest model, and support vector machine. A risk prediction model was established using logistic regression analysis, and a dynamic nomogram was constructed. Receiver operating characteristic (ROC) curves, harrell’s concordance index (C-Index), calibration curves, decision curve analysis (DCA), and internal validation were used to evaluate the differentiation, calibration, and clinical applicability of the models. This study found that patients with T2DM had a high-risk combination of lower extremity vasculopathy, diabetic foot, and diabetic retinopathy. Based on this, body mass index, diastolic blood pressure, total cholesterol, triglyceride, 2-hour postprandial blood glucose and blood urea nitrogen levels were screened and used for the modeling analysis. The area under the ROC curves of the internal and external validations were 0.768 (95% CI, 0.744−0.792) and 0.745 (95% CI, 0.669−0.820), respectively, and the C-index and AUC value were consistent. The calibration plots showed good calibration, and the risk threshold for DCA was 30–54%. In this study, we developed and evaluated a predictive model for the development of a high-risk complication combination while uncovering the pattern of complications in patients with T2DM. This model has a practical guiding effect on the health management of patients with T2DM in community settings.

Search
Clear search
Close search
Google apps
Main menu