9 datasets found
  1. f

    RTML with insulin dataset.

    • figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). RTML with insulin dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t010
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  2. f

    FHD dataset.

    • figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). FHD dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t009
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  3. f

    PIMA dataset.

    • figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). PIMA dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t008
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  4. f

    Comparative analysis table.

    • figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Comparative analysis table. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  5. f

    Diabetes health tracer dataset description.

    • figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Diabetes health tracer dataset description. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  6. Proposed dataset.

    • plos.figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Proposed dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  7. Data from: Related work.

    • plos.figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Related work. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  8. Features with number of missing values.

    • plos.figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Features with number of missing values. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  9. Class distribution.

    • plos.figshare.com
    xls
    Updated Oct 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). Class distribution. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 21, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

  10. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi (2025). RTML with insulin dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0327661.t010

RTML with insulin dataset.

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Oct 21, 2025
Dataset provided by
PLOS ONE
Authors
Muhammad Noman; Maria Hanif; Abdul Hameed; Muhammad Babar; Basit Qureshi
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Diabetes mellitus presents a significant global health challenge, particularly in regions like Pakistan, India, and Bangladesh. Machine learning (ML) techniques offer promising solutions for diabetes prediction, surpassing traditional methods in reliability and efficiency. This research conducts a comparative analysis of ML algorithms including Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), K-nearest neighbors (KNN), Gradient Boosting (GB), RaSK_GraDe (Proposed Voting), and RaSK_GraDeL (Proposed Stacking). Evaluation is performed using datasets, such as PIMA Indian, Frankfurt Hospitals Diabetes, RTML with Insulin, and the proposed Diabetes Health Tracer (DHT) dataset comprising 2877 observations with nine features. Data pre-processing techniques address missing values, outliers, normalization, and class balancing (SMOTE), enhancing model robustness. Hyperparameter tuning via cross-validation and Random Search optimizes model performance. Additionally, ensemble methods—Voting Classifier (RaSK GraDe) and Stacking Model (RaSK GraDeL with Logistic Regression) are applied, achieving notable accuracies of 98.03% and 98.55%, respectively, on the DHT dataset. The study underscores ML’s potential in diabetes prediction, advocating for personalized treatment and healthcare management advancements.

Search
Clear search
Close search
Google apps
Main menu