4 datasets found
  1. f

    Pseudocode for machine learning models.

    • figshare.com
    xls
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang (2025). Pseudocode for machine learning models. [Dataset]. http://doi.org/10.1371/journal.pone.0310410.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.

  2. f

    List of abbreviations.

    • figshare.com
    xls
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang (2025). List of abbreviations. [Dataset]. http://doi.org/10.1371/journal.pone.0310410.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.

  3. f

    Base learner parameters.

    • figshare.com
    xls
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang (2025). Base learner parameters. [Dataset]. http://doi.org/10.1371/journal.pone.0310410.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.

  4. f

    Experimental environment configuration table.

    • figshare.com
    xls
    Updated Feb 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang (2025). Experimental environment configuration table. [Dataset]. http://doi.org/10.1371/journal.pone.0310410.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang (2025). Pseudocode for machine learning models. [Dataset]. http://doi.org/10.1371/journal.pone.0310410.t005

Pseudocode for machine learning models.

Related Article
Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
xlsAvailable download formats
Dataset updated
Feb 26, 2025
Dataset provided by
PLOS ONE
Authors
Jingru Dong; Ruijiao Lei; Feiyang Ma; Lu Yu; Lanlan Wang; Shangzhi Xu; Yunhua Hu; Jialin Sun; Wenwen Zhang; Haixia Wang; Li Zhang
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

More than 90% of deaths due to breast cancer (BC) are due to metastasis-related complications, with invasive ductal carcinoma (IDC) of the breast being the most common pathologic type of breast cancer and highly susceptible to metastasis to distant organs. BC patients who develop cancer metastases are more likely to have a poor prognosis and poor quality of life, so it is extremely important to recognize and diagnose whether distant metastases have occurred in IDC as early as possible. In this study, we develop a non-invasive breast cancer classification system for detecting cancer metastasis. We used Anaconda-Jupyter notebooks to develop various Python programming modules for text mining, data processing, and machine learning (ML) methods. A risk prediction model was constructed based on four algorithms: Random Forest, XGBoost, Logistic Regression, and SVM. Additionally, we developed a hybrid model based on a voting mechanism using these four algorithms as the base models. The models were compared and evaluated by the following metrics: accuracy, precision, recall, F1-score, and area under the ROC curve (AUC) values. The experimental results show that the hybrid model based on the voting mechanism exhibits the best prediction performance (accuracy: 0.867, precision: 0.929, recall: 0.805, F1-score: 0.856, AUC: 0.94). This stable risk prediction model provides a valuable reference support for doctors in assessing and diagnosing the risk of IDC hematogenous metastasis. It also improves the work efficiency of doctors and strives to provide patients with increased chances of survival.

Search
Clear search
Close search
Google apps
Main menu