44 ensembles de données trouvés
  1. Breast Cancer Wisconsin - benign or malignant

    • kaggle.com
    Dernière mise à jour : Jul 20, 2020
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Ninja Coding (2020). Breast Cancer Wisconsin - benign or malignant [Dataset]. https://www.kaggle.com/datasets/ninjacoding/breast-cancer-wisconsin-benign-or-malignant
    Découvrir sur :
    CroissantCroissant est un format pour les ensembles de données de machine learning. Pour en savoir plus, consultez mlcommons.org/croissant.
    Ensemble de données mis à jour
    Jul 20, 2020
    Ensemble de données fourni par
    Kagglehttp://kaggle.com/
    Auteurs
    Ninja Coding
    Description

    Context

    It is quite common to find ML-based applications embedded with real-time patient data available from different healthcare systems in multiple countries, thereby increasing the efficacy of new treatment options which were unavailable before. This data set is all about predicting whether the cancer cells are benign or malignant.

    Content

    Information about attributes:

    There are total 10 attributes(int)- Sample code number: id number Clump Thickness: 1 - 10 Uniformity of Cell Size: 1 - 10 Uniformity of Cell Shape: 1 - 10 Marginal Adhesion: 1 - 10 Single Epithelial Cell Size: 1 - 10 Bare Nuclei: 1 - 10 Bland Chromatin: 1 - 10 Normal Nucleoli: 1 - 10 Mitoses: 1 - 10 Predicted class: 2 for benign and 4 for malignant

    Acknowledgements

    This data set(Original Wisconsin Breast Cancer Database) is taken from UCI Machine Learning Repository.

    Inspiration

    This is the first ever data set I am sharing in Kaggle. It would be a great pleasure if you find this data set useful to develop your own model. Hope this simple data set will help beginners to develop their own models for classification and learn how to make their model even better.

  2. d

    Breast Cancer WI (Diagnostic)

    • data.world
    csv, zip
    Dernière mise à jour : May 25, 2024
    + d'autres versions
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    UCI (2024). Breast Cancer WI (Diagnostic) [Dataset]. https://data.world/uci/breast-cancer-wi-diagnostic
    Découvrir sur :
    csv, zipFormats de téléchargement disponibles
    Ensemble de données mis à jour
    May 25, 2024
    Ensemble de données fourni par
    data.world, Inc.
    Auteurs
    UCI
    Zone géographique couverte
    Wisconsin
    Description

    Source:

    Creators:

    1. Dr. William H. Wolberg, General Surgery Dept. University of Wisconsin, Clinical Sciences Center Madison, WI 53792 wolberg '@' eagle.surgery.wisc.edu

    2. W. Nick Street, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 street '@' cs.wisc.edu 608-262-6619

    3. Olvi L. Mangasarian, Computer Sciences Dept. University of Wisconsin, 1210 West Dayton St., Madison, WI 53706 olvi '@' cs.wisc.edu

    Donor:

    Nick Street

    Data Set Information:

    Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at [Web Link]

    Separating plane described above was obtained using Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree Construction Via Linear Programming." Proceedings of the 4th Midwest Artificial Intelligence and Cognitive Science Society, pp. 97-101, 1992], a classification method which uses linear programming to construct a decision tree. Relevant features were selected using an exhaustive search in the space of 1-4 features and 1-3 separating planes.

    The actual linear program used to obtain the separating plane in the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34].

    This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/

    Attribute Information:

    1) ID number 2) Diagnosis (M = malignant, B = benign) 3-32)

    Ten real-valued features are computed for each cell nucleus:

    a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) concave points (number of concave portions of the contour) i) symmetry j) fractal dimension ("coastline approximation" - 1)

    For more information about **Relevant Papers and Papers That Cite This Data Set, please go here

    Citation Policy:

    If you publish material based on databases obtained from this repository, then, in your acknowledgements, please note the assistance you received by using this repository. This will help others to obtain the same data sets and replicate your experiments. We suggest the following pseudo-APA reference format for referring to this repository:

    Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

    Here is a BiBTeX citation as well:

    @misc{Lichman:2013 ,
    author = "M. Lichman",
    year = "2013",
    title = "{UCI} Machine Learning Repository",
    url = "http://archive.ics.uci.edu/ml",
    institution = "University of California, Irvine, School of Information and Computer Sciences" }

    A few UCI data sets have additional citation requests. These requests can be found on the bottom of each data set's web page.

    Source: UCI Machine Learning Repository

  3. Wisconsin Breast Cancer Dataset

    • kaggle.com
    Dernière mise à jour : Jun 18, 2018
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Ana M. (2018). Wisconsin Breast Cancer Dataset [Dataset]. https://www.kaggle.com/datasets/anacoder1/wisc-bc-data
    Découvrir sur :
    CroissantCroissant est un format pour les ensembles de données de machine learning. Pour en savoir plus, consultez mlcommons.org/croissant.
    Ensemble de données mis à jour
    Jun 18, 2018
    Ensemble de données fourni par
    Kagglehttp://kaggle.com/
    Auteurs
    Ana M.
    Zone géographique couverte
    Wisconsin
    Description

    This data was donated by researchers of the University of Wisconsin and includes the measurements from digitized images of fine-needle aspirate of a breast mass.

    You can find the dataset at https://github.com/dataspelunking/MLwR/blob/master/Machine%20Learning%20with%20R%20(2nd%20Ed.)/Chapter%2003/wisc_bc_data.csv.

    The breast cancer data includes 569 examples of cancer biopsies, each with 32 features. One feature is an identification number, another is the cancer diagnosis and 30 are numeric-valued laboratory measurements. The diagnosis is coded as "M" to indicate malignant or "B" to indicate benign.

    The other 30 numeric measurements comprise the mean, standard error and worst (i.e. largest) value for 10 different characteristics of the digitized cell nuclei, which are as follows:-

    • Radius
    • Texture
    • Perimeter
    • Area
    • Smoothness
    • Compactness
    • Concavity
    • Concave Points
    • Symmetry
    • Fractal dimension
  4. d

    Breast Cancer Wisconsin (Original)

    • data.world
    csv, zip
    Dernière mise à jour : Jun 2, 2024
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    UCI (2024). Breast Cancer Wisconsin (Original) [Dataset]. https://data.world/uci/breast-cancer-wisconsin-original
    Découvrir sur :
    zip, csvFormats de téléchargement disponibles
    Ensemble de données mis à jour
    Jun 2, 2024
    Ensemble de données fourni par
    data.world, Inc.
    Auteurs
    UCI
    Zone géographique couverte
    Wisconsin
    Description

    Source:

    Creator:
    Dr. WIlliam H. Wolberg (physician)
    University of Wisconsin Hospitals
    Madison, Wisconsin, USA

    Donor:
    Olvi Mangasarian (mangasarian '@' cs.wisc.edu)
    Received by David W. Aha (aha '@' cs.jhu.edu)

    Data Set Information:

    Samples arrive periodically as Dr. Wolberg reports his clinical cases. The database therefore reflects this chronological grouping of the data. This grouping information appears immediately below, having been removed from the data itself:
    Group 1: 367 instances (January 1989)
    Group 2: 70 instances (October 1989)
    Group 3: 31 instances (February 1990)
    Group 4: 17 instances (April 1990)
    Group 5: 48 instances (August 1990)
    Group 6: 49 instances (Updated January 1991)
    Group 7: 31 instances (June 1991)

    Group 8: 86 instances (November 1991)

    Total: 699 points (as of the donated datbase on 15 July 1992)

    Note that the results summarized above in Past Usage refer to a dataset of size 369, while Group 1 has only 367 instances. This is because it originally contained 369 instances; 2 were removed. The following statements summarizes changes to the original Group 1's set of data:

    Group 1 : 367 points: 200B 167M (January 1989)
    Revised Jan 10, 1991: Replaced zero bare nuclei in 1080185 & 1187805
    Revised Nov 22,1991: Removed 765878,4,5,9,7,10,10,10,3,8,1 no record
    : Removed 484201,2,7,8,8,4,3,10,3,4,1 zero epithelial
    : Changed 0 to 1 in field 6 of sample 1219406
    : Changed 0 to 1 in field 8 of following sample:
    : 1182404,2,3,1,1,1,2,0,1,1,1

    Attribute Information:

    1. Sample code number: id number
    2. Clump Thickness: 1 - 10
    3. Uniformity of Cell Size: 1 - 10
    4. Uniformity of Cell Shape: 1 - 10
    5. Marginal Adhesion: 1 - 10
    6. Single Epithelial Cell Size: 1 - 10
    7. Bare Nuclei: 1 - 10
    8. Bland Chromatin: 1 - 10
    9. Normal Nucleoli: 1 - 10
    10. Mitoses: 1 - 10
    11. Class: (2 for benign, 4 for malignant)

    Relevant Papers:

    Wolberg, W.H., & Mangasarian, O.L. (1990). Multisurface method of pattern separation for medical diagnosis applied to breast cytology. In Proceedings of the National Academy of Sciences, 87, 9193*9196.

    Zhang, J. (1992). Selecting typical instances in instance-based learning. In Proceedings of the Ninth International Machine Learning Conference (pp. 470*479). Aberdeen, Scotland: Morgan Kaufmann.

    Papers That Cite This Data Set:

    • Gavin Brown. Diversity in Neural Network Ensembles. The University of Birmingham. 2004.
    • Krzysztof Grabczewski and Wl/odzisl/aw Duch. Heterogeneous Forests of Decision Trees. ICANN. 2002.
    • András Antos and Balázs Kégl and Tamás Linder and Gábor Lugosi. Data-dependent margin-based generalization bounds for classification. Journal of Machine Learning Research, 3. 2002.
    • Kristin P. Bennett and Ayhan Demiriz and Richard Maclin. Exploiting unlabeled data in ensemble methods. KDD. 2002.
    • Hussein A. Abbass. An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine, 25. 2002.
    • Baback Moghaddam and Gregory Shakhnarovich. Boosted Dyadic Kernel Discriminants. NIPS. 2002.
    • Robert Burbidge and Matthew Trotter and Bernard F. Buxton and Sean B. Holden. STAR - Sparsity through Automated Rejection. IWANN (1). 2001.
    • Nikunj C. Oza and Stuart J. Russell. Experimental comparisons of online and batch versions of bagging and boosting. KDD. 2001.
    • Yuh-Jeng Lee. Smooth Support Vector Machines. Preliminary Thesis Proposal Computer Sciences Department University of Wisconsin. 2000.
    • Justin Bradley and Kristin P. Bennett and Bennett A. Demiriz. Constrained K-Means Clustering. Microsoft Research Dept. of Mathematical Sciences One Microsoft Way Dept. of Decision Sciences and Eng. Sys. 2000.
    • Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Improved Generalization Through Explicit Optimization of Margins. Machine Learning, 38. 2000.
    • P. S and Bradley K. P and Bennett A. Demiriz. Constrained K-Means Clustering. Microsoft Research Dept. of Mathematical Sciences One Microsoft Way Dept. of Decision Sciences and Eng. Sys. 2000.
    • Endre Boros and Peter Hammer and Toshihide Ibaraki and Alexander Kogan and Eddy Mayoraz and Ilya B. Muchnik. An Implementation of Logical Analysis of Data. IEEE Trans. Knowl. Data Eng, 12. 2000.
    • Chun-Nan Hsu and Hilmar Schuschel and Ya-Ting Yang. The ANNIGMA-Wrapper Approach to Neural Nets Feature Selection for Knowledge Discovery and Data Mining. Institute of Information Science. 1999.
    • Huan Liu and Hiroshi Motoda and Manoranjan Dash. A Monotonic Measure for Optimal Feature Selection. ECML. 1998.
    • Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Direct Optimization of Margins Improves Generalization in Combined Classifiers. NIPS. 1998.
    • W. Nick Street. A Neural Network Model for Prognostic Prediction. ICML. 1998.
    • Yk Huhtala and Juha Kärkkäinen and Pasi Porkka and Hannu Toivonen. Efficient Discovery of Functional and Approximate Dependencies Using Partitions. ICDE. 1998.
    • . Prototype Selection for Composite Nearest Neighbor Classifiers. Department of Computer Science University of Massachusetts. 1997.
    • Kristin P. Bennett and Erin J. Bredensteiner. A Parametric Optimization Method for Machine Learning. INFORMS Journal on Computing, 9. 1997.
    • Rudy Setiono and Huan Liu. NeuroLinear: From neural networks to oblique decision rules. Neurocomputing, 17. 1997.
    • Erin J. Bredensteiner and Kristin P. Bennett. Feature Minimization within Decision Trees. National Science Foundation. 1996.
    • Ismail Taha and Joydeep Ghosh. Characterization of the Wisconsin Breast cancer Database Using a Hybrid Symbolic-Connectionist System. Proceedings of ANNIE. 1996.
    • Jennifer A. Blue and Kristin P. Bennett. Hybrid Extreme Point Tabu Search. Department of Mathematical Sciences Rensselaer Polytechnic Institute. 1996.
    • Geoffrey I. Webb. OPUS: An Efficient Admissible Algorithm for Unordered Search. J. Artif. Intell. Res. (JAIR, 3. 1995.
    • Wl odzisl/aw Duch and Rudy Setiono and Jacek M. Zurada. Computational intelligence methods for rule-based data understanding.
    • Rafael S. Parpinelli and Heitor S. Lopes and Alex Alves Freitas. An Ant Colony Based System for Data Mining: Applications to Medical Data. CEFET-PR, CPGEI Av. Sete de Setembro, 3165.
    • Wl/odzisl/aw Duch and Rafal/ Adamczak Email:duchraad@phys. uni. torun. pl. Statistical methods for construction of neural networks. Department of Computer Methods, Nicholas Copernicus University.
    • Rafael S. Parpinelli and Heitor S. Lopes and Alex Alves Freitas. PART FOUR: ANT COLONY OPTIMIZATION AND IMMUNE SYSTEMS Chapter X An Ant Colony Algorithm for Classification Rule Discovery. CEFET-PR, Curitiba.
    • Adam H. Cannon and Lenore J. Cowen and Carey E. Priebe. Approximate Distance Classification. Department of Mathematical Sciences The Johns Hopkins University.
    • Andrew I. Schein and Lyle H. Ungar. A-Optimality for Active Learning of Logistic Regression Classifiers. Department of Computer and Information Science Levine Hall.
    • Bart Baesens and Stijn Viaene and Tony Van Gestel and J. A. K Suykens and Guido Dedene and Bart De Moor and Jan Vanthienen and Katholieke Universiteit Leuven. An Empirical Assessment of Kernel Type Performance for Least Squares Support Vector Machine Classifiers. Dept. Applied Economic Sciences.
    • Adil M. Bagirov and Alex Rubinov and A. N. Soukhojak and John Yearwood. Unsupervised and supervised data classification via nonsmooth and global optimization. School of Information Technology and Mathematical Sciences, The University of Ballarat.
    • Rudy Setiono and Huan Liu. Neural-Network Feature Selector. Department of Information Systems and Computer Science National University of Singapore.
    • Huan Liu. A Family of Efficient Rule Generators. Department of Information Systems and Computer Science National University of Singapore.
    • Rudy Setiono. Extracting M-of-N Rules from Trained Neural Networks. School of Computing National University of Singapore.
    • Jarkko Salojarvi and Samuel Kaski and Janne Sinkkonen. Discriminative clustering in Fisher metrics. Neural Networks Research Centre Helsinki University of Technology.
    • Wl odzisl and Rafal Adamczak and Krzysztof Grabczewski and Grzegorz Zal. A hybrid method for extraction of logical rules from data. Department of Computer Methods, Nicholas Copernicus University.
    • Charles Campbell and Nello Cristianini. Simple Learning Algorithms for Training Support Vector Machines. Dept. of Engineering Mathematics.
    • Chotirat Ann and Dimitrios Gunopulos. Scaling up the Naive Bayesian Classifier: Using Decision Trees for Feature Selection. Computer Science Department University of California.

    Citation Request:

    This breast cancer databases was obtained from the University of Wisconsin Hospitals, Madison from Dr. William H. Wolberg. If you publish results when using this database, then please include this information in your acknowledgements. Also, please cite one or more of:1. O. L. Mangasarian and W. H. Wolberg: "Cancer diagnosis via linear programming", SIAM News, Volume 23, Number 5, September 1990, pp 1 & 18.2. William H. Wolberg and O.L. Mangasarian: "Multisurface method of pattern separation for medical diagnosis applied to breast cytology", Proceedings of the National Academy of Sciences, U.S.A., Volume 87, December 1990, pp 9193-9196.3. O. L. Mangasarian, R. Setiono, and W.H. Wolberg: "Pattern recognition via linear programming: Theory and application to medical diagnosis", in: "Large-scale numerical optimization", Thomas F. Coleman and Yuying Li, editors, SIAM Publications, Philadelphia 1990, pp 22-30.4. K. P. Bennett & O. L. Mangasarian: "Robust linear programming discrimination of two linearly inseparable sets", Optimization Methods and Software 1, 1992, 23-34 (Gordon & Breach Science

  5. A

    ‘Breast Cancer Wisconsin (Diagnostic) Data Set’ analyzed by Analyst-2

    • analyst-2.ai
    Dernière mise à jour : Feb 1, 2001
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2001). ‘Breast Cancer Wisconsin (Diagnostic) Data Set’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-breast-cancer-wisconsin-diagnostic-data-set-2558/latest
    Découvrir sur :
    Ensemble de données mis à jour
    Feb 1, 2001
    Ensemble de données créé et fourni par
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Zone géographique couverte
    Wisconsin
    Description

    Analysis of ‘Breast Cancer Wisconsin (Diagnostic) Data Set’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/uciml/breast-cancer-wisconsin-data on 28 January 2022.

    --- Dataset description provided by original source is as follows ---

    Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. n the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34].

    This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/

    Also can be found on UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

    Attribute Information:

    1) ID number 2) Diagnosis (M = malignant, B = benign) 3-32)

    Ten real-valued features are computed for each cell nucleus:

    a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) concave points (number of concave portions of the contour) i) symmetry j) fractal dimension ("coastline approximation" - 1)

    The mean, standard error and "worst" or largest (mean of the three largest values) of these features were computed for each image, resulting in 30 features. For instance, field 3 is Mean Radius, field 13 is Radius SE, field 23 is Worst Radius.

    All feature values are recoded with four significant digits.

    Missing attribute values: none

    Class distribution: 357 benign, 212 malignant

    --- Original source retains full ownership of the source dataset ---

  6. Data from: Breast-Cancer Wisconsin

    • kaggle.com
    zip
    Dernière mise à jour : Dec 14, 2017
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Thulani Tembo (2017). Breast-Cancer Wisconsin [Dataset]. https://www.kaggle.com/datasets/thulani96/breastcancer-wisconsin
    Découvrir sur :
    zip(6051 bytes)Formats de téléchargement disponibles
    Ensemble de données mis à jour
    Dec 14, 2017
    Auteurs
    Thulani Tembo
    Licence

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    Les informations sur la licence ont été dérivées automatiquement

    Zone géographique couverte
    Wisconsin
    Description

    Dataset

    This dataset was created by Thulani Tembo

    Released under CC0: Public Domain

    Contents

  7. Cancer classification

    • 583507255.xyz
    zip
    Dernière mise à jour : Apr 11, 2024
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Sahil Bajaj (2024). Cancer classification [Dataset]. https://583507255.xyz/datasets/sahilnbajaj/cancer-classification
    Découvrir sur :
    zip(53037 bytes)Formats de téléchargement disponibles
    Ensemble de données mis à jour
    Apr 11, 2024
    Auteurs
    Sahil Bajaj
    Licence

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Diagnostic Wisconsin Breast Cancer Database.Dataset Characteristics Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at http://www.cs.wisc.edu/~street/images/

    Separating plane described above was obtained using Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree Construction Via Linear Programming." Proceedings of the 4th Midwest Artificial Intelligence and Cognitive Science Society, pp. 97-101, 1992], a classification method which uses linear programming to construct a decision tree. Relevant features were selected using an exhaustive search in the space of 1-4 features and 1-3 separating planes.

    The actual linear program used to obtain the separating plane in the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34].

    This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/

  8. A

    ‘Wisconsin Diagnostic Breast Cancer (WDBC)’ analyzed by Analyst-2

    • analyst-2.ai
    Dernière mise à jour : Sep 30, 2021
    + d'autres versions
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Wisconsin Diagnostic Breast Cancer (WDBC)’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-wisconsin-diagnostic-breast-cancer-wdbc-b8cd/5b08ae03/?iid=009-999&v=presentation
    Découvrir sur :
    Ensemble de données mis à jour
    Sep 30, 2021
    Ensemble de données créé et fourni par
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Analysis of ‘Wisconsin Diagnostic Breast Cancer (WDBC)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/mohaiminul101/wisconsin-diagnostic-breast-cancer-wdbc on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    Context

    Breast cancer is a disease in which cells in the breast grow out of control. There are different kinds of breast cancer. The kind of breast cancer depends on which cells in the breast turn into cancer. Wisconsin Diagnostic Breast Cancer (WDBC) dataset obtained by the university of Wisconsin Hospital is used to classify tumors as benign or malignant.

    Content

    Attribute Information:

    1) ID number 2) Diagnosis (M = malignant, B = benign) 3-32)

    Ten real-valued features are computed for each cell nucleus:

    a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) concave points (number of concave portions of the contour) i) symmetry j) fractal dimension ("coastline approximation" - 1)

    The mean, standard error and "worst" or largest (mean of the three largest values) of these features were computed for each image, resulting in 30 features. For instance, field 3 is Mean Radius, field 13 is Radius SE, field 23 is Worst Radius.

    All feature values are recoded with four significant digits.

    Missing attribute values: none

    Class distribution: 357 benign, 212 malignant

    Acknowledgements

    Creator: Dr. WIlliam H. Wolberg (physician) University of Wisconsin Hospitals Madison, Wisconsin, USA

    This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/

    Also can be found on UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

    --- Original source retains full ownership of the source dataset ---

  9. A

    ‘Breast Cancer Wisconsin (Prognostic) Data Set’ analyzed by Analyst-2

    • analyst-2.ai
    Dernière mise à jour : Nov 12, 2021
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Breast Cancer Wisconsin (Prognostic) Data Set’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-breast-cancer-wisconsin-prognostic-data-set-804d/14ab563f/?iid=009-947&v=presentation
    Découvrir sur :
    Ensemble de données mis à jour
    Nov 12, 2021
    Ensemble de données créé et fourni par
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Zone géographique couverte
    Wisconsin
    Description

    Analysis of ‘Breast Cancer Wisconsin (Prognostic) Data Set’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/sarahvch/breast-cancer-wisconsin-prognostic-data-set on 29 August 2021.

    --- Dataset description provided by original source is as follows ---

    Context

    Data From: UCI Machine Learning Repository http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/wpbc.names

    Content

    "Each record represents follow-up data for one breast cancer case. These are consecutive patients seen by Dr. Wolberg since 1984, and include only those cases exhibiting invasive breast cancer and no evidence of distant metastases at the time of diagnosis.

    The first 30 features are computed from a digitized image of a
    fine needle aspirate (FNA) of a breast mass. They describe
    characteristics of the cell nuclei present in the image.
    A few of the images can be found at
    http://www.cs.wisc.edu/~street/images/
    
    The separation described above was obtained using
    Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree
    Construction Via Linear Programming." Proceedings of the 4th
    Midwest Artificial Intelligence and Cognitive Science Society,
    pp. 97-101, 1992], a classification method which uses linear
    programming to construct a decision tree. Relevant features
    were selected using an exhaustive search in the space of 1-4
    features and 1-3 separating planes.
    
    The actual linear program used to obtain the separating plane
    in the 3-dimensional space is that described in:
    [K. P. Bennett and O. L. Mangasarian: "Robust Linear
    Programming Discrimination of Two Linearly Inseparable Sets",
    Optimization Methods and Software 1, 1992, 23-34].
    
    The Recurrence Surface Approximation (RSA) method is a linear
    programming model which predicts Time To Recur using both
    recurrent and nonrecurrent cases. See references (i) and (ii)
    above for details of the RSA method. 
    
    This database is also available through the UW CS ftp server:
    
    ftp ftp.cs.wisc.edu
    cd math-prog/cpo-dataset/machine-learn/WPBC/
    

    1) ID number 2) Outcome (R = recur, N = nonrecur) 3) Time (recurrence time if field 2 = R, disease-free time if field 2 = N) 4-33) Ten real-valued features are computed for each cell nucleus:

    a) radius (mean of distances from center to points on the perimeter)
    b) texture (standard deviation of gray-scale values)
    c) perimeter
    d) area
    e) smoothness (local variation in radius lengths)
    f) compactness (perimeter^2 / area - 1.0)
    g) concavity (severity of concave portions of the contour)
    h) concave points (number of concave portions of the contour)
    i) symmetry 
    j) fractal dimension ("coastline approximation" - 1)"
    

    Acknowledgements

    Creators:

    Dr. William H. Wolberg, General Surgery Dept., University of
    Wisconsin, Clinical Sciences Center, Madison, WI 53792
    wolberg@eagle.surgery.wisc.edu
    
    W. Nick Street, Computer Sciences Dept., University of
    Wisconsin, 1210 West Dayton St., Madison, WI 53706
    street@cs.wisc.edu 608-262-6619
    
    Olvi L. Mangasarian, Computer Sciences Dept., University of
    Wisconsin, 1210 West Dayton St., Madison, WI 53706
    olvi@cs.wisc.edu 
    

    Inspiration

    I'm really interested in trying out various machine learning algorithms on some real life science data.

    --- Original source retains full ownership of the source dataset ---

  10. A

    ‘Breast Cancer Wisconsin - benign or malignant’ analyzed by Analyst-2

    • analyst-2.ai
    Dernière mise à jour : Sep 30, 2021
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Breast Cancer Wisconsin - benign or malignant’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-breast-cancer-wisconsin-benign-or-malignant-ca20/daff23d1/?iid=042-127&v=presentation
    Découvrir sur :
    Ensemble de données mis à jour
    Sep 30, 2021
    Ensemble de données créé et fourni par
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Analysis of ‘Breast Cancer Wisconsin - benign or malignant’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/ninjacoding/breast-cancer-wisconsin-benign-or-malignant on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    Context

    It is quite common to find ML-based applications embedded with real-time patient data available from different healthcare systems in multiple countries, thereby increasing the efficacy of new treatment options which were unavailable before. This data set is all about predicting whether the cancer cells are benign or malignant.

    Content

    Information about attributes:

    There are total 10 attributes(int)- Sample code number: id number Clump Thickness: 1 - 10 Uniformity of Cell Size: 1 - 10 Uniformity of Cell Shape: 1 - 10 Marginal Adhesion: 1 - 10 Single Epithelial Cell Size: 1 - 10 Bare Nuclei: 1 - 10 Bland Chromatin: 1 - 10 Normal Nucleoli: 1 - 10 Mitoses: 1 - 10 Predicted class: 2 for benign and 4 for malignant

    Acknowledgements

    This data set(Original Wisconsin Breast Cancer Database) is taken from UCI Machine Learning Repository.

    Inspiration

    This is the first ever data set I am sharing in Kaggle. It would be a great pleasure if you find this data set useful to develop your own model. Hope this simple data set will help beginners to develop their own models for classification and learn how to make their model even better.

    --- Original source retains full ownership of the source dataset ---

  11. d

    Breast Cancer Wisconsin (Diagnostic)

    • dataportal.asia
    data, json, names
    Dernière mise à jour : Sep 14, 2021
    + d'autres versions
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    scidm.nchc.org.tw (2021). Breast Cancer Wisconsin (Diagnostic) [Dataset]. https://dataportal.asia/dataset/212571056_breast-cancer-wisconsin-diagnostic
    Découvrir sur :
    names(5657), (21363), names(4708), data(44234), data(124103), json(5878), names(5671), data(19889), (326)Formats de téléchargement disponibles
    Ensemble de données mis à jour
    Sep 14, 2021
    Ensemble de données fourni par
    scidm.nchc.org.tw
    Zone géographique couverte
    Wisconsin
    Description

    Diagnostic Wisconsin Breast Cancer Database

  12. d

    Breast Cancer Wisconsin (Prognostic)

    • dataportal.asia
    data, json, names
    Dernière mise à jour : Sep 14, 2021
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    scidm.nchc.org.tw (2021). Breast Cancer Wisconsin (Prognostic) [Dataset]. https://dataportal.asia/dataset/212571056_breast-cancer-wisconsin-prognostic
    Découvrir sur :
    data(124103), names(5657), names(5671), data(19889), names(4708), (21363), data(44234), (326), json(6226)Formats de téléchargement disponibles
    Ensemble de données mis à jour
    Sep 14, 2021
    Ensemble de données fourni par
    scidm.nchc.org.tw
    Zone géographique couverte
    Wisconsin
    Description

    Prognostic Wisconsin Breast Cancer Database

  13. o

    Wisconsin-breast-cancer-cytology-features

    • openml.org
    Dernière mise à jour : Mar 24, 2022
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    (2022). Wisconsin-breast-cancer-cytology-features [Dataset]. https://www.openml.org/d/43611
    Découvrir sur :
    CroissantCroissant est un format pour les ensembles de données de machine learning. Pour en savoir plus, consultez mlcommons.org/croissant.
    Ensemble de données mis à jour
    Mar 24, 2022
    Licence

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Context Cytology features of breast cancer biopsy. It can be used to predict breast cancer from cytology features. The data was obtained from https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Original) Data description can be found at https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.names Content Data contains cytology features of breast cancer biopsies - clump thickness, uniformity of cell size, uniformity of cell shape, marginal adhesion, single epithelial cell size, bare nuclei, bland chromatin, normal nuceloli, mitosis. The class variable denotes whether it was cancer or not. Cancer = 1 and not cancer = 0 Attribute Information:

    Sample code number: id number Clump Thickness: 1 - 10 Uniformity of Cell Size: 1 - 10 Uniformity of Cell Shape: 1 - 10 Marginal Adhesion: 1 - 10 Single Epithelial Cell Size: 1 - 10 Bare Nuclei: 1 - 10 Bland Chromatin: 1 - 10 Normal Nucleoli: 1 - 10 Mitoses: 1 - 10 Class: (0 for benign, 1 for malignant)

    Acknowledgements Data obtained from : UCI machine learning repository Dua, D. and Karra Taniskidou, E. (2017). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. Picture courtesy: Photo by Pablo Heimplatz on Unsplash

  14. O

    Breast Cancer Wisconsin (Diagnostic)

    • opendatalab.com
    zip
    Dernière mise à jour : Apr 21, 2023
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    University of Wisconsin (2023). Breast Cancer Wisconsin (Diagnostic) [Dataset]. https://opendatalab.com/OpenDataLab/Breast_Cancer_Wisconsin_Diagnostic
    Découvrir sur :
    zipFormats de téléchargement disponibles
    Ensemble de données mis à jour
    Apr 21, 2023
    Ensemble de données fourni par
    University of Wisconsin
    Description

    UCI Breast Cancer Raw Dataset is a breast cancer dataset that contains three sets of breast cancer cytopathology image data. Features are calculated from digitized images of fine needle aspiration (FNA) of breast masses. They describe the image The characteristics of the nuclei appearing in . The original UCI Breast Cancer dataset was published in 1995 by Dr. William H. Wolberg, General Surgery Dept. W. Nick Street, Computer Sciences Dept. Olvi L. Mangasarian, Computer Sciences Dept. Related papers are Breast cancer diagnosis and prognosis via linear programming etc.

  15. UCI Wisconsin Breast Cancer

    • kaggle.com
    Dernière mise à jour : Jun 5, 2020
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Steve Coffie (2020). UCI Wisconsin Breast Cancer [Dataset]. https://www.kaggle.com/datasets/stevecoffie/uci-wisconsin-breast-cancer
    Découvrir sur :
    CroissantCroissant est un format pour les ensembles de données de machine learning. Pour en savoir plus, consultez mlcommons.org/croissant.
    Ensemble de données mis à jour
    Jun 5, 2020
    Ensemble de données fourni par
    Kagglehttp://kaggle.com/
    Auteurs
    Steve Coffie
    Zone géographique couverte
    Wisconsin
    Description

    Context

    Diagnostic Wisconsin Breast Cancer Database

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. n the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34].

    This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/

    Also can be found on UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+%28Diagnostic%29

    Attribute Information:

    1) ID number 2) Diagnosis (M = malignant, B = benign) 3-32)

    Ten real-valued features are computed for each cell nucleus:

    a) radius (mean of distances from center to points on the perimeter) b) texture (standard deviation of gray-scale values) c) perimeter d) area e) smoothness (local variation in radius lengths) f) compactness (perimeter^2 / area - 1.0) g) concavity (severity of concave portions of the contour) h) concave points (number of concave portions of the contour) i) symmetry j) fractal dimension ("coastline approximation" - 1)

    The mean, standard error and "worst" or largest (mean of the three largest values) of these features were computed for each image, resulting in 30 features. For instance, field 3 is Mean Radius, field 13 is Radius SE, field 23 is Worst Radius.

    All feature values are recoded with four significant digits.

    Missing attribute values: none

    Class distribution: 357 benign, 212 malignant

    Inspiration

    Using Machine Learning and AI to contribute in cancer treatment

  16. H

    Replication Data for: Wisconsin Breast Cancer Diagnostic

    • dataverse.harvard.edu
    tsv
    Dernière mise à jour : Apr 6, 2016
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Harvard Dataverse (2016). Replication Data for: Wisconsin Breast Cancer Diagnostic [Dataset]. http://doi.org/10.7910/DVN/SP6VXJ
    Découvrir sur :
    tsv(14610)Formats de téléchargement disponibles
    Ensemble de données mis à jour
    Apr 6, 2016
    Ensemble de données fourni par
    Harvard Dataverse
    Licence

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    Les informations sur la licence ont été dérivées automatiquement

    Zone géographique couverte
    Wisconsin
    Description

    Original data from: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic). Changes made: - 16 rows with '?' for Bare Nuclei removed, leaving 683 records # Attribute Domain -- ----------------------------------------- 0. Class: (-1 for benign, +1 for malignant) 1. Clump Thickness 1 - 10 2. Uniformity of Cell Size 1 - 10 3. Uniformity of Cell Shape 1 - 10 4. Marginal Adhesion 1 - 10 5. Single Epithelial Cell Size 1 - 10 6. Bare Nuclei 1 - 10 7. Bland Chromatin 1 - 10 8. Normal Nucleoli 1 - 10 9. Mitoses 1 - 10

  17. f

    breast cancer

    • figshare.com
    txt
    Dernière mise à jour : Jan 17, 2022
    + d'autres versions
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Deepchecks Data (2022). breast cancer [Dataset]. http://doi.org/10.6084/m9.figshare.18551381.v1
    Découvrir sur :
    txtFormats de téléchargement disponibles
    Ensemble de données mis à jour
    Jan 17, 2022
    Ensemble de données fourni par
    figshare
    Auteurs
    Deepchecks Data
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image.n the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34].This database is also available through the UW CS ftp server:ftp ftp.cs.wisc.educd math-prog/cpo-dataset/machine-learn/WDBC/Also can be found on UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)

  18. w

    Breast-Cancer-Patients-Wisconsin-Biography

    • workwithdata.com
    Dernière mise à jour : Apr 11, 2024
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    Work With Data (2024). Breast-Cancer-Patients-Wisconsin-Biography [Dataset]. https://www.workwithdata.com/topic/breast-cancer-patients-wisconsin-biography
    Découvrir sur :
    Ensemble de données mis à jour
    Apr 11, 2024
    Ensemble de données créé et fourni par
    Work With Data
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Zone géographique couverte
    Wisconsin
    Description

    The book Cancer is a bitch, or, I'd rather be having a midlife crisis is about Breast-Cancer-Patients-Wisconsin-Biography and was written by Gail Konop Baker. It was published in 2008.

  19. d

    Breast Cancer

    • data.world
    csv, zip
    Dernière mise à jour : Jul 3, 2024
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    UCI (2024). Breast Cancer [Dataset]. https://data.world/uci/breast-cancer
    Découvrir sur :
    csv, zipFormats de téléchargement disponibles
    Ensemble de données mis à jour
    Jul 3, 2024
    Auteurs
    UCI
    Description

    Source:

    Creators: Matjaz Zwitter & Milan Soklic (physicians)
    Institute of Oncology University Medical Center
    Ljubljana, Yugoslavia

    Donors:
    Ming Tan and Jeff Schlimmer (Jeffrey.Schlimmer '@' a.gp.cs.cmu.edu)

    Data Set Information:

    This is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature. (See also lymphography and primary-tumor.)
    This data set includes 201 instances of one class and 85 instances of another class. The instances are described by 9 attributes, some of which are linear and some are nominal.

    Attribute Information:

    1. Class: no-recurrence-events, recurrence-events
    2. age: 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99.
    3. menopause: lt40, ge40, premeno.
    4. tumor-size: 0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40-44, 45-49, 50-54, 55-59.
    5. inv-nodes: 0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26, 27-29, 30-32, 33-35, 36-39.
    6. node-caps: yes, no.
    7. deg-malig: 1, 2, 3.
    8. breast: left, right.
    9. breast-quad: left-up, left-low, right-up, right-low, central.
    10. irradiat: yes, no.

    Relevant Papers:

    Michalski,R.S., Mozetic,I., Hong,J., & Lavrac,N. (1986). The Multi-Purpose Incremental Learning System AQ15 and its Testing Application to Three Medical Domains. In Proceedings of the Fifth National Conference on Artificial Intelligence, 1041-1045, Philadelphia, PA: Morgan Kaufmann.
    Clark,P. & Niblett,T. (1987). Induction in Noisy Domains. In Progress in Machine Learning (from the Proceedings of the 2nd European Working Session on Learning), 11-30, Bled, Yugoslavia: Sigma Press.
    Tan, M., & Eshelman, L. (1988). Using weighted networks to represent classification knowledge in noisy domains. Proceedings of the Fifth International Conference on Machine Learning, 121-134, Ann Arbor, MI.
    Cestnik,G., Konenenko,I, & Bratko,I. (1987). Assistant-86: A Knowledge-Elicitation Tool for Sophisticated Users. In I.Bratko & N.Lavrac (Eds.) Progress in Machine Learning, 31-45, Sigma Press.

    Papers That Cite This Data Set1:

    • Igor Fischer and Jan Poland. Amplifying the Block Matrix Structure for Spectral Clustering. Telecommunications Lab. 2005.
      • Saher Esmeir and Shaul Markovitch. Lookahead-based algorithms for anytime induction of decision trees. ICML. 2004.
      • Gavin Brown. Diversity in Neural Network Ensembles. The University of Birmingham. 2004.
      • Kaizhu Huang and Haiqin Yang and Irwin King and Michael R. Lyu and Laiwan Chan. Biased Minimax Probability Machine for Medical Diagnosis. AMAI. 2004.
      • Qingping Tao Ph. D. MAKING EFFICIENT LEARNING ALGORITHMS WITH EXPONENTIALLY MANY FEATURES. Qingping Tao A DISSERTATION Faculty of The Graduate College University of Nebraska In Partial Fulfillment of Requirements. 2004.
      • Krzysztof Grabczewski and Wl/odzisl/aw Duch. Heterogeneous Forests of Decision Trees. ICANN. 2002.
      • Hussein A. Abbass. An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine, 25. 2002.
      • Fei Sha and Lawrence K. Saul and Daniel D. Lee. Multiplicative Updates for Nonnegative Quadratic Programming in Support Vector Machines. NIPS. 2002.
      • Kristin P. Bennett and Ayhan Demiriz and Richard Maclin. Exploiting unlabeled data in ensemble methods. KDD. 2002.
      • Baback Moghaddam and Gregory Shakhnarovich. Boosted Dyadic Kernel Discriminants. NIPS. 2002.
      • András Antos and Balázs Kégl and Tamás Linder and Gábor Lugosi. Data-dependent margin-based generalization bounds for classification. Journal of Machine Learning Research, 3. 2002.
      • Michael G. Madden. Evaluation of the Performance of the Markov Blanket Bayesian Classifier Algorithm. CoRR, csLG/0211003. 2002.
      • Yongmei Wang and Ian H. Witten. Modeling for Optimal Probability Prediction. ICML. 2002.
      • Remco R. Bouckaert. Accuracy bounds for ensembles under 0 { 1 loss. Xtal Mountain Information Technology & Computer Science Department, University of Waikato. 2002.
      • Nikunj C. Oza and Stuart J. Russell. Experimental comparisons of online and batch versions of bagging and boosting. KDD. 2001.
      • Bernhard Pfahringer and Geoffrey Holmes and Richard Kirkby. Optimizing the Induction of Alternating Decision Trees. PAKDD. 2001.
      • Robert Burbidge and Matthew Trotter and Bernard F. Buxton and Sean B. Holden. STAR - Sparsity through Automated Rejection. IWANN (1). 2001.
      • Bernhard Pfahringer and Geoffrey Holmes and Gabi Schmidberger. Wrapping Boosters against Noise. Australian Joint Conference on Artificial Intelligence. 2001.
      • W. Nick Street and Yoo-Hyon Kim. A streaming ensemble algorithm (SEA) for large-scale classification. KDD. 2001.
      • Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Improved Generalization Through Explicit Optimization of Margins. Machine Learning, 38. 2000.
      • Endre Boros and Peter Hammer and Toshihide Ibaraki and Alexander Kogan and Eddy Mayoraz and Ilya B. Muchnik. An Implementation of Logical Analysis of Data. IEEE Trans. Knowl. Data Eng, 12. 2000.
      • P. S and Bradley K. P and Bennett A. Demiriz. Constrained K-Means Clustering. Microsoft Research Dept. of Mathematical Sciences One Microsoft Way Dept. of Decision Sciences and Eng. Sys. 2000.
      • Sally A. Goldman and Yan Zhou. Enhancing Supervised Learning with Unlabeled Data. ICML. 2000.
      • Justin Bradley and Kristin P. Bennett and Bennett A. Demiriz. Constrained K-Means Clustering. Microsoft Research Dept. of Mathematical Sciences One Microsoft Way Dept. of Decision Sciences and Eng. Sys. 2000.
      • Yuh-Jeng Lee. Smooth Support Vector Machines. Preliminary Thesis Proposal Computer Sciences Department University of Wisconsin. 2000.
      • Petri Kontkanen and Petri Myllym and Tomi Silander and Henry Tirri and Peter Gr. On predictive distributions and Bayesian networks. Department of Computer Science, Stanford University. 2000.
      • Kristin P. Bennett and Ayhan Demiriz and John Shawe-Taylor. A Column Generation Algorithm For Boosting. ICML. 2000.
      • Matthew Mullin and Rahul Sukthankar. Complete Cross-Validation for Nearest Neighbor Classifiers. ICML. 2000.
      • David W. Opitz and Richard Maclin. Popular Ensemble Methods: An Empirical Study. J. Artif. Intell. Res. (JAIR, 11. 1999.
      • Chun-Nan Hsu and Hilmar Schuschel and Ya-Ting Yang. The ANNIGMA-Wrapper Approach to Neural Nets Feature Selection for Knowledge Discovery and Data Mining. Institute of Information Science. 1999.
      • David M J Tax and Robert P W Duin. Support vector domain description. Pattern Recognition Letters, 20. 1999.
      • Kai Ming Ting and Ian H. Witten. Issues in Stacked Generalization. J. Artif. Intell. Res. (JAIR, 10. 1999.
      • Ismail Taha and Joydeep Ghosh. Symbolic Interpretation of Artificial Neural Networks. IEEE Trans. Knowl. Data Eng, 11. 1999.
      • Lorne Mason and Jonathan Baxter and Peter L. Bartlett and Marcus Frean. Boosting Algorithms as Gradient Descent. NIPS. 1999.
      • Iñaki Inza and Pedro Larrañaga and Basilio Sierra and Ramon Etxeberria and Jose Antonio Lozano and Jos Manuel Peña. Representing the behaviour of supervised classification learning algorithms by Bayesian networks. Pattern Recognition Letters, 20. 1999.
      • Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Direct Optimization of Margins Improves Generalization in Combined Classifiers. NIPS. 1998.
      • Richard Maclin. Boosting Classifiers Regionally. AAAI/IAAI. 1998.
      • Huan Liu and Hiroshi Motoda and Manoranjan Dash. A Monotonic Measure for Optimal Feature Selection. ECML. 1998.
      • Yk Huhtala and Juha Kärkkäinen and Pasi Porkka and Hannu Toivonen. Efficient Discovery of Functional and Approximate Dependencies Using Partitions. ICDE. 1998.
      • W. Nick Street. A Neural Network Model for Prognostic Prediction. ICML. 1998.
      • Kristin P. Bennett and Erin J. Bredensteiner. A Parametric Optimization Method for Machine Learning. INFORMS Journal on Computing, 9. 1997.
      • Pedro Domingos. Control-Sensitive Feature Selection for Lazy Learners. Artif. Intell. Rev, 11. 1997.
      • Rudy Setiono and Huan Liu. NeuroLinear: From neural networks to oblique decision rules. Neurocomputing, 17. 1997.
      • . Prototype Selection for Composite Nearest Neighbor Classifiers. Department of Computer Science University of Massachusetts. 1997.
      • Erin J. Bredensteiner and Kristin P. Bennett. Feature Minimization within Decision Trees. National Science Foundation. 1996.
      • Ismail Taha and Joydeep Ghosh. Characterization of the Wisconsin Breast cancer Database Using a Hybrid Symbolic-Connectionist System. Proceedings of ANNIE. 1996.
      • Kamal Ali and Michael J. Pazzani. Error Reduction through Learning Multiple Descriptions. Machine Learning, 24. 1996.
      • Jennifer A. Blue and Kristin P. Bennett. Hybrid Extreme Point Tabu Search. Department of Mathematical Sciences Rensselaer Polytechnic Institute. 1996.
      • Pedro Domingos. Unifying Instance-Based and Rule-Based Induction. Machine Learning, 24. 1996.
      • Geoffrey I. Webb. OPUS: An Efficient Admissible Algorithm for Unordered Search. J. Artif. Intell. Res. (JAIR, 3. 1995.
      • Christophe Giraud and Tony Martinez and Christophe G. Giraud-Carrier. University of Bristol Department of Computer Science ILA: Combining Inductive Learning with Prior Knowledge and Reasoning. 1995.
      • Ron Kohavi. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. IJCAI. 1995.
      • M. A. Galway and Michael G. Madden. DEPARTMENT OF INFORMATION TECHNOLOGY technical report NUIG-IT-011002 Evaluation of the Performance of the Markov Blanket Bayesian Classifier Algorithm. Department of Information Technology National University of Ireland, Galway.
      • John G. Cleary and Leonard E. Trigg. Experiences with OB1, An Optimal Bayes Decision Tree Learner. Department of Computer Science University of Waikato.
      • Wl/odzisl/aw Duch and Rafal/ Adamczak Email:duchraad@phys. uni. torun. pl. Statistical methods for construction of neural networks. Department of Computer Methods, Nicholas Copernicus University.
      • Rong-En Fan and P. -H Chen
  20. d

    https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)...

    • b2find.dkrz.de
    Dernière mise à jour : Oct 23, 2023
    Partager
    FacebookFacebook
    TwitterTwitter
    Email
    Cliquer pour copier le lien
    Lien copié
    Close
    Citer
    (2023). https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic) - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/7548e859-9b6f-5aed-a2df-ead0d2853f91
    Découvrir sur :
    Ensemble de données mis à jour
    Oct 23, 2023
    Licence

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    Les informations sur la licence ont été dérivées automatiquement

    Description

    Breast Cancer Data-set is also used for collecting the data for constructing the patient record files. The numbers of attributes are Id number, cell size, and shape, class attributes such as a benign tumor or malicious tumor andso on. THIS DATASET IS ARCHIVED AT DANS/EASY, BUT NOT ACCESSIBLE HERE. TO VIEW A LIST OF FILES AND ACCESS THE FILES IN THIS DATASET CLICK ON THE DOI-LINK ABOVE

Partager
FacebookFacebook
TwitterTwitter
Email
Cliquer pour copier le lien
Lien copié
Close
Citer
Ninja Coding (2020). Breast Cancer Wisconsin - benign or malignant [Dataset]. https://www.kaggle.com/datasets/ninjacoding/breast-cancer-wisconsin-benign-or-malignant
Logo de l'organisation

Breast Cancer Wisconsin - benign or malignant

Predicting the stage of breast cancer

Découvrir sur :
3 articles Google Scholar citent cet ensemble de données (Afficher dans Google Scholar)
CroissantCroissant est un format pour les ensembles de données de machine learning. Pour en savoir plus, consultez mlcommons.org/croissant.
Ensemble de données mis à jour
Jul 20, 2020
Ensemble de données fourni par
Kagglehttp://kaggle.com/
Auteurs
Ninja Coding
Description

Context

It is quite common to find ML-based applications embedded with real-time patient data available from different healthcare systems in multiple countries, thereby increasing the efficacy of new treatment options which were unavailable before. This data set is all about predicting whether the cancer cells are benign or malignant.

Content

Information about attributes:

There are total 10 attributes(int)- Sample code number: id number Clump Thickness: 1 - 10 Uniformity of Cell Size: 1 - 10 Uniformity of Cell Shape: 1 - 10 Marginal Adhesion: 1 - 10 Single Epithelial Cell Size: 1 - 10 Bare Nuclei: 1 - 10 Bland Chromatin: 1 - 10 Normal Nucleoli: 1 - 10 Mitoses: 1 - 10 Predicted class: 2 for benign and 4 for malignant

Acknowledgements

This data set(Original Wisconsin Breast Cancer Database) is taken from UCI Machine Learning Repository.

Inspiration

This is the first ever data set I am sharing in Kaggle. It would be a great pleasure if you find this data set useful to develop your own model. Hope this simple data set will help beginners to develop their own models for classification and learn how to make their model even better.

Recherche
Effacer la recherche
Fermer le champ de recherche
Applications Google
Menu principal