10 datasets found
  1. Dataset for Apriori and FP growth Algorithm

    • kaggle.com
    Updated May 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shashank (2020). Dataset for Apriori and FP growth Algorithm [Dataset]. https://www.kaggle.com/newshuntkannada/dataset-for-apriori-and-fp-growth-algorithm/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 4, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Shashank
    Description

    Dataset

    This dataset was created by Shashank

    Contents

  2. I

    Frequent pattern subject transactions from the University of Illinois...

    • databank.illinois.edu
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jim Hahn, Frequent pattern subject transactions from the University of Illinois Library (2016 - 2018) [Dataset]. http://doi.org/10.13012/B2IDB-9440404_V1
    Explore at:
    Authors
    Jim Hahn
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Illinois
    Dataset funded by
    Research and Publications Committee of the University of Illinois Library
    Description

    The data are provided to illustrate methods in evaluating systematic transactional data reuse in machine learning. A library account-based recommender system was developed using machine learning processing over transactional data of 383,828 transactions (or check-outs) sourced from a large multi-unit research library. The machine learning process utilized the FP-growth algorithm over the subject metadata associated with physical items that were checked-out together in the library. The purpose of this research is to evaluate the results of systematic transactional data reuse in machine learning. The analysis herein contains a large-scale network visualization of 180,441 subject association rules and corresponding node metrics.

  3. MOESM1 of Large-scale e-learning recommender system based on Spark and...

    • springernature.figshare.com
    c
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Karim Dahdouh; Ahmed Dakkak; Lahcen Oughdir; Abdelali Ibriz (2023). MOESM1 of Large-scale e-learning recommender system based on Spark and Hadoop [Dataset]. http://doi.org/10.6084/m9.figshare.7564160.v1
    Explore at:
    cAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Karim Dahdouh; Ahmed Dakkak; Lahcen Oughdir; Abdelali Ibriz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1. Spark application of the e-learning recommender system.

  4. f

    Mined rules by FP-Growth algorithm.

    • plos.figshare.com
    xls
    Updated Jun 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samira Yousefinaghani; Rozita Dara; Zvonimir Poljak; Fei Song; Shayan Sharif (2023). Mined rules by FP-Growth algorithm. [Dataset]. http://doi.org/10.1371/journal.pone.0245116.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 12, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Samira Yousefinaghani; Rozita Dara; Zvonimir Poljak; Fei Song; Shayan Sharif
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mined rules by FP-Growth algorithm.

  5. f

    Co-Existence Features Derived from CICMalDroid 2020

    • figshare.com
    txt
    Updated Aug 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Habtamu Girum (2024). Co-Existence Features Derived from CICMalDroid 2020 [Dataset]. http://doi.org/10.6084/m9.figshare.26852755.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 31, 2024
    Dataset provided by
    figshare
    Authors
    Habtamu Girum
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset presents a collection of co-existence features extracted from the original CICMalDroid 2020 using the FP-Growth algorithm. The co-existence features are combinations of two features that frequently occur together within the dataset.

  6. f

    Additional file 1 of Network analysis of autistic disease comorbidities in...

    • springernature.figshare.com
    xlsx
    Updated Feb 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaojun Li; Guangjian Liu; Wenxiong Chen; Zhisheng Bi; Huiying Liang (2024). Additional file 1 of Network analysis of autistic disease comorbidities in Chinese children based on ICD-10 codes [Dataset]. http://doi.org/10.6084/m9.figshare.13107708.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 5, 2024
    Dataset provided by
    figshare
    Authors
    Xiaojun Li; Guangjian Liu; Wenxiong Chen; Zhisheng Bi; Huiying Liang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1. Table 1. Disease generalization in ICD-10 codes. Table 2. Comparison among OMOP ID, Concept Code and the generlization ICD-10 codes. Table 3. The rules verified by literatures. Table 4. The rules discoveried by FP-growth algorithm.

  7. f

    DataSheet1_Search strategy and line association analysis of cascading...

    • figshare.com
    pdf
    Updated Nov 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xueting Cheng; Wenxu Liu; Yueshuang Bao; Xinyuan Liu (2023). DataSheet1_Search strategy and line association analysis of cascading failure accident chain in new energy power systems.PDF [Dataset]. http://doi.org/10.3389/fenrg.2023.1283436.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Nov 29, 2023
    Dataset provided by
    Frontiers
    Authors
    Xueting Cheng; Wenxu Liu; Yueshuang Bao; Xinyuan Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    As the penetration rate of new energy in the power system gradually increases and the complexity of cascading faults increases, it is of great significance for the power system to comprehensively explore the chain of cascading faults in the new energy power system and quickly determine the closely related lines in the cascading faults. In response to the lack of consideration in existing research of the changes in the importance of transmission lines after the introduction of new energy, this paper proposes a cascading failure prediction index that integrates the importance and operational status of transmission lines in new energy power systems and applies it to the search for cascading failures in new energy power systems. First, the development characteristics of cascading faults were analyzed, and the main factors influencing cascading faults were identified: the importance of the transmission line and operating status of the new energy power system. Based on these factors, a prediction index for cascading faults was established, and the accident chain was searched using this index. Then, the FP-growth algorithm was used to analyze the lines in the fault chain concentration, and based on the analysis results, the correlation relationship suitable for the cascading failure lines in the new energy power system was determined. Finally, a simulation was conducted on an IEEE 10 machine 39 node system containing new energy wind turbines, and the results verified the effectiveness of the proposed indicators and strategies.

  8. f

    Data_Sheet_1_Effect of the chronic medication use on outcome measures of...

    • frontiersin.figshare.com
    pdf
    Updated Jun 6, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammad-Reza Malekpour; Mohsen Abbasi-Kangevari; Ali Shojaee; Sahar Saeedi Moghaddam; Seyyed-Hadi Ghamari; Mohammad-Mahdi Rashidi; Alireza Namazi Shabestari; Mohammad Effatpanah; Mohammadmehdi Nasehi; Mehdi Rezaei; Farshad Farzadfar (2023). Data_Sheet_1_Effect of the chronic medication use on outcome measures of hospitalized COVID-19 patients: Evidence from big data.PDF [Dataset]. http://doi.org/10.3389/fpubh.2023.1061307.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 6, 2023
    Dataset provided by
    Frontiers
    Authors
    Mohammad-Reza Malekpour; Mohsen Abbasi-Kangevari; Ali Shojaee; Sahar Saeedi Moghaddam; Seyyed-Hadi Ghamari; Mohammad-Mahdi Rashidi; Alireza Namazi Shabestari; Mohammad Effatpanah; Mohammadmehdi Nasehi; Mehdi Rezaei; Farshad Farzadfar
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundConcerns about the role of chronically used medications in the clinical outcomes of the coronavirus disease 2019 (COVID-19) have remarkable potential for the breakdown of non-communicable diseases (NCDs) management by imposing ambivalence toward medication continuation. This study aimed to investigate the association of single or combinations of chronically used medications in NCDs with clinical outcomes of COVID-19.MethodsThis retrospective study was conducted on the intersection of two databases, the Iranian COVID-19 registry and Iran Health Insurance Organization. The primary outcome was death due to COVID-19 hospitalization, and secondary outcomes included length of hospital stay, Intensive Care Unit (ICU) admission, and ventilation therapy. The Anatomical Therapeutic Chemical (ATC) classification system was used for medication grouping. The frequent pattern growth algorithm was utilized to investigate the effect of medication combinations on COVID-19 outcomes.FindingsAspirin with chronic use in 10.8% of hospitalized COVID-19 patients was the most frequently used medication, followed by Atorvastatin (9.2%) and Losartan (8.0%). Adrenergics in combination with corticosteroids inhalants (ACIs) with an odds ratio (OR) of 0.79 (95% confidence interval: 0.68–0.92) were the most associated medications with less chance of ventilation therapy. Oxicams had the least OR of 0.80 (0.73–0.87) for COVID-19 death, followed by ACIs [0.85 (0.77–0.95)] and Biguanides [0.86 (0.82–0.91)].ConclusionThe chronic use of most frequently used medications for NCDs management was not associated with poor COVID-19 outcomes. Thus, when indicated, physicians need to discourage patients with NCDs from discontinuing their medications for fear of possible adverse effects on COVID-19 prognosis.

  9. f

    Table1_The influence of herbal medicine on serum motilin and its effect on...

    • frontiersin.figshare.com
    docx
    Updated Dec 14, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Min-Seok Cho; Jae-Woo Park; Jinsung Kim; Seok-Jae Ko (2023). Table1_The influence of herbal medicine on serum motilin and its effect on human and animal model: a systematic review.DOCX [Dataset]. http://doi.org/10.3389/fphar.2023.1286333.s002
    Explore at:
    docxAvailable download formats
    Dataset updated
    Dec 14, 2023
    Dataset provided by
    Frontiers
    Authors
    Min-Seok Cho; Jae-Woo Park; Jinsung Kim; Seok-Jae Ko
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Introduction: Motilin (MLN) is a gastrointestinal (GI) hormone produced in the upper small intestine. Its most well understood function is to participate in Phase III of the migrating myoelectric complex component of GI motility. Changes in MLN availability are associated with GI diseases such as gastroesophageal reflux disease and functional dyspepsia. Furthermore, herbal medicines have been used for several years to treat various GI disorders. We systematically reviewed clinical and animal studies on how herbal medicine affects the modulation of MLN and subsequently brings the therapeutic effects mainly focused on GI function.Methods: We searched the PubMed, Embase, Cochrane, and Web of Science databases to collect all articles published until 30 July 2023, that reported the measurement of plasma MLN levels in human randomized controlled trials and in vivo herbal medicine studies. The collected characteristics of the articles included the name and ingredients of the herbal medicine, physiological and symptomatic changes after administering the herbal medicine, changes in plasma MLN levels, key findings, and mechanisms of action. The frequency patterns (FPs) of botanical drug use and their correlations were investigated using an FP growth algorithm.Results: Nine clinical studies with 1,308 participants and 20 animal studies were included in the final analyses. Herbal medicines in clinical studies have shown therapeutic effects in association with increased levels of MLN, including GI motility regulation and symptom improvement. Herbal medicines have also shown anti-stress, anti-tumor, and anti-inflammatory effects in vivo. Various biochemical markers may correlate with MLN levels. Markers may have a positive correlation with plasma MLN levels included ghrelin, acetylcholine, and secretin, whereas a negative correlation included triglycerides and prostaglandin E2. Markers, such as gastrin and somatostatin, did not show any correlation with plasma MLN levels. Based on the FP growth algorithm, Glycyrrhiza uralensis and Paeonia japonica were the most frequently used species.Conclusion: Herbal medicine may have therapeutic effects mainly on GI symptoms with involvement of MLN regulation and may be considered as an alternative option for the treatment of GI diseases. Further studies with more solid evidence are needed to confirm the efficacy and mechanisms of action of herbal medicines.Systematic Review Registration:https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=443244, identifier CRD42023443244.

  10. f

    Threshold values of variables for Sudanian Zone.

    • plos.figshare.com
    xls
    Updated Feb 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sèton Calmette Ariane Houetohossou; Vinasetan Ratheil Houndji; Rachidatou Sikirou; Romain Glèlè Kakaï (2024). Threshold values of variables for Sudanian Zone. [Dataset]. http://doi.org/10.1371/journal.pone.0297983.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 8, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Sèton Calmette Ariane Houetohossou; Vinasetan Ratheil Houndji; Rachidatou Sikirou; Romain Glèlè Kakaï
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Tomato is one of the most appreciated vegetables in the world. Predicting its yield and optimizing its culture is important for global food security. This paper addresses the challenge of finding optimum climatic values for a high tomato yield. The Frequent Pattern Growth (FPG) algorithm was considered to establish the associations between six climate variables: minimum and maximum temperatures, maximum humidity, sunshine (Sun), rainfall, and evapotranspiration (ET), collected over 26 years in the three agro-ecological Zones of Benin. Monthly climate data were aggregated with yield data over the same period. After aggregation, the data were transformed into ‘low’, ‘medium’, and ‘high’ attributes using the threshold values defined. Then, the rules were generated using the minimum support set to 0.2 and the confidence to 0.8. Only the rules with the consequence ‘high yield’ were screened. The best yield patterns were observed in the Guinean Zone, followed by the Sudanian. The results indicated that high tomato yield was associated with low ET in all areas considered. Minimum and maximum temperatures, maximum humidity, and Sun were medium in every Zone. Moreover, rainfall was high in the Sudanian Zone, unlike the other regions where it remained medium. These results are useful in assessing climate variability’s impact on tomato production. Thus, they can help farmers make informed decisions on cultivation practices to optimize production in a changing environment. In addition, the findings of this study can be considered in other regions and adapted to other crops.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Shashank (2020). Dataset for Apriori and FP growth Algorithm [Dataset]. https://www.kaggle.com/newshuntkannada/dataset-for-apriori-and-fp-growth-algorithm/code
Organization logo

Dataset for Apriori and FP growth Algorithm

Association rules and Frequent pattern Problems

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 4, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shashank
Description

Dataset

This dataset was created by Shashank

Contents

Search
Clear search
Close search
Google apps
Main menu