21 datasets found

Iris_Classification
kaggle.com
Updated Sep 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
end_of_night.17j03 (2023). Iris_Classification [Dataset]. https://www.kaggle.com/datasets/endofnight17j03/iris-classification
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 14, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
end_of_night.17j03
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The Iris Classification dataset is a well-known dataset in machine learning, commonly used for classification tasks. It contains measurements of various iris flowers, including sepal length, sepal width, petal length, and petal width, as well as the corresponding species label.

Columns Description:

1. Sepal Length (cm): Length of the sepals of the iris flower.

2. Sepal Width (cm): Width of the sepals of the iris flower.

3. Petal Length (cm): Length of the petals of the iris flower.

4. Petal Width (cm): Width of the petals of the iris flower.

5. Species: The species of the iris flower, which is the target variable to be predicted.

This dataset is commonly used for practicing classification algorithms and exploring data analysis techniques.
🌼 Unveiling the Iris Dataset 🌸
kaggle.com
Updated Jul 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HARISH KUMARdatalab (2023). 🌼 Unveiling the Iris Dataset 🌸 [Dataset]. http://doi.org/10.34740/kaggle/dsv/6209742
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/6209742
Dataset updated
Jul 28, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
HARISH KUMARdatalab
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context: 🌼 The Iris flower dataset, an iconic multivariate set, was first introduced by the renowned British statistician and biologist, Ronald Fisher in 1936 📝. Commonly known as Anderson's Iris dataset, it was curated by Edgar Anderson to measure the morphologic variation of three Iris species 🌸: Iris Setosa, Iris Virginica, and Iris Versicolor.

📊 The set comprises 100 samples from each species, with four features - sepal length, sepal width, petal length, and petal width, measured in centimetres.

🔬 This dataset has since served as a standard test case for various statistical classification techniques in machine learning, including the widely used support vector machines (SVM).

So, whether you're a newbie dipping your toes into the ML pond or a seasoned data scientist testing out a new classification method, the Iris dataset is a classic starting point! 🎯🚀

Columns:

Sepal Length: The length of the sepal of the iris flower, is measured in centimetres.

Sepal Width: The width of the sepal of the iris flower, measured in centimetres.

Petal Length: The length of the petal of the iris flower, is measured in centimetres.

Petal Width: The width of the petal of the iris flower, measured in centimetres.

Species:The specific species of the iris flower, categorized into Sentosa, Virginica, and Versicolor.

Problem Statement:

1.🎯 Classification Challenge: Can you accurately predict the species of an Iris flower based on the four given measurements: sepal length, sepal width, petal length, and petal width?

2.💡 Feature Importance: Which feature (sepal length, sepal width, petal length, or petal width) is the most significant in distinguishing between the species of Iris flowers?

3.📈 Data Scaling: How does standardization (or normalization) of the features affect the performance of your classification models?

4.🧪 Model Experimentation: Can simpler models such as Logistic Regression perform as well as more complex models like Support Vector Machines or Neural Networks on the Iris dataset? Compare the performance of various models.

5.🤖 AutoML Challenge: Use AutoML tools (like Google's AutoML or H2O's AutoML) to build a classification model. How does its performance compare with your handcrafted models?

Kindly, upvote if you find the dataset interesting
Iris Species Dataset and Database
kaggle.com
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ghanshyam Saini (2025). Iris Species Dataset and Database [Dataset]. https://www.kaggle.com/datasets/ghnshymsaini/iris-species-dataset-and-database
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 15, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ghanshyam Saini
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Iris Flower Dataset

This is a classic and very widely used dataset in machine learning and statistics, often serving as a first dataset for classification problems. Introduced by the British statistician and biologist Ronald Fisher in his 1936 paper "The use of multiple measurements in taxonomic problems," it is a foundational resource for learning classification algorithms.

Overview:

The dataset contains measurements for 150 samples of iris flowers. Each sample belongs to one of three species of iris:

Iris setosa

Iris versicolor

Iris virginica

For each flower, four features were measured:

Sepal length (in cm)

Sepal width (in cm)

Petal length (in cm)

Petal width (in cm)

The goal is typically to build a model that can classify iris flowers into their correct species based on these four features.

File Structure:

The dataset is usually provided as a single CSV (Comma Separated Values) file, often named iris.csv or similar. This file typically contains the following columns:

sepal_length (cm): Numerical. The length of the sepal of the iris flower.

sepal_width (cm): Numerical. The width of the sepal of the iris flower.

petal_length (cm): Numerical. The length of the petal of the iris flower.

petal_width (cm): Numerical. The width of the petal of the iris flower.

species: Categorical. The species of the iris flower (either 'setosa', 'versicolor', or 'virginica'). This is the target variable for classification.

Content of the Data:

The dataset contains an equal number of samples (50) for each of the three iris species. The measurements of the sepal and petal dimensions vary between the species, allowing for their differentiation using machine learning models.

How to Use This Dataset:

Download the iris.csv file.

Load the data using libraries like Pandas in Python.

Explore the data through visualization and statistical analysis to understand the relationships between the features and the different species.

Build classification models (e.g., Logistic Regression, Support Vector Machines, Decision Trees, K-Nearest Neighbors) using the sepal and petal measurements as features and the 'species' column as the target variable.

Evaluate the performance of your model using appropriate metrics (e.g., accuracy, precision, recall, F1-score).

The dataset is small and well-behaved, making it excellent for learning and experimenting with various classification techniques.

Citation:

When using the Iris dataset, it is common to cite Ronald Fisher's original work:

Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2), 179-188.

Data Contribution:

Thank you for providing this classic and fundamental dataset to the Kaggle community. The Iris dataset remains an invaluable resource for both beginners learning the basics of classification and experienced practitioners testing new algorithms. Its simplicity and clear class separation make it an ideal starting point for many data science projects.

If you find this dataset description helpful and the dataset itself useful for your learning or projects, please consider giving it an upvote after downloading. Your appreciation is valuable!
A
‘Iris Flower Dataset’ analyzed by Analyst-2
analyst-2.ai
Updated Jan 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘Iris Flower Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-iris-flower-dataset-bb8a/eb51f303/?iid=001-010&v=presentation
Explore at:
Dataset updated
Jan 28, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Iris Flower Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/arshid/iris-flower-dataset on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Context

The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters.

This dataset became a typical test case for many statistical classification techniques in machine learning such as support vector machines

Content

The dataset contains a set of 150 records under 5 attributes - Petal Length, Petal Width, Sepal Length, Sepal width and Class(Species).

Acknowledgements

This dataset is free and is publicly available at the UCI Machine Learning Repository

--- Original source retains full ownership of the source dataset ---
h
iris-clase
huggingface.co
Updated Apr 5, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrés Eduardo García Herrera (2025). iris-clase [Dataset]. https://huggingface.co/datasets/aegarciaherrera/iris-clase
Explore at:
Dataset updated
Apr 5, 2025
Authors
Andrés Eduardo García Herrera
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Dataset Card for "iris"

Dataset Summary

The Iris dataset is one of the most classic datasets in machine learning, often used for classification and clustering tasks. It contains 150 samples of iris flowers, each described by four features: sepal length, sepal width, petal length, and petal width. The task is to classify the samples into one of three species: Iris setosa, Iris versicolor, or Iris virginica. This dataset is especially useful for:

Supervised learning… See the full description on the dataset page: https://huggingface.co/datasets/aegarciaherrera/iris-clase.
O
iris
opendatalab.com
tensorflow.org
zip
Updated Sep 22, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Tsukuba (2022). iris [Dataset]. https://opendatalab.com/OpenDataLab/iris
Explore at:
zip(4551 bytes)Available download formats
Dataset updated
Sep 22, 2022
Dataset provided by
University of Tsukuba
Description
The Iris flower data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems. It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. The data set consists of 50 samples from each of three species of Iris (Iris Setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters.This dataset became a typical test case for many statistical classification techniques in machine learning such as support vector machines
c
Iris Species Dataset
cubig.ai
Updated May 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Iris Species Dataset [Dataset]. https://cubig.ai/store/products/387/iris-species-dataset
Explore at:
Dataset updated
May 29, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
Description
1) Data Introduction • The Iris Species Dataset is a classic multi-class classification data that collected a total of 150 samples, 50 for each of the three iris species (Setosa, Versicolor, Virginica), consisting of four numerical characteristics and species labels, including calyx length, width, petal length, and width.

2) Data Utilization (1) The Iris Species Dataset has characteristics that: • This dataset consists of a total of six columns and is labeled as one of three types, making it suitable for class division and basic statistical analysis. (2) The Iris Species Dataset can be used to: • Classification Algorithm Practice: You can easily practice various machine learning classification models such as logistic regression, SVM, and decision tree by inputting four characteristics: calyx and petal length and width. • Visualize data and analyze basic statistics: Visualize the distribution of characteristics by variety into scatterplots, boxplots, etc. to explore differences between classes and correlations between characteristics.
h
iris
huggingface.co
Updated Apr 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bernardo Ronquillo (2025). iris [Dataset]. https://huggingface.co/datasets/brjapon/iris
Explore at:
Dataset updated
Apr 3, 2025
Authors
Bernardo Ronquillo
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Iris Species Dataset

The Iris dataset is a classic dataset in machine learning, originally published by Ronald Fisher. It contains 150 instances of iris flowers, each described by four features (sepal length, sepal width, petal length, and petal width), along with the corresponding species label (setosa, versicolor, or virginica). It is commonly used as an introductory dataset for classification tasks and for demonstrating basic data exploration and model training workflows.… See the full description on the dataset page: https://huggingface.co/datasets/brjapon/iris.
h
iris
huggingface.co
Updated Sep 23, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
scikit-learn (2022). iris [Dataset]. https://huggingface.co/datasets/scikit-learn/iris
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 23, 2022
Dataset authored and provided by
scikit-learn
License
https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Description
Iris Species Dataset

The Iris dataset was used in R.A. Fisher's classic 1936 paper, The Use of Multiple Measurements in Taxonomic Problems, and can also be found on the UCI Machine Learning Repository. It includes three iris species with 50 samples each as well as some properties about each flower. One flower species is linearly separable from the other two, but the other two are not linearly separable from each other. The dataset is taken from UCI Machine Learning Repository's… See the full description on the dataset page: https://huggingface.co/datasets/scikit-learn/iris.
Classification Analysis Using Python
kaggle.com
Updated Jul 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nibedita Sahu (2023). Classification Analysis Using Python [Dataset]. https://www.kaggle.com/datasets/nibeditasahu/classification-analysis-using-python
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 3, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Nibedita Sahu
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The Iris dataset is a classic and widely used dataset in machine learning for classification tasks. It consists of measurements of different iris flowers, including sepal length, sepal width, petal length, and petal width, along with their corresponding species. With a total of 150 samples, the dataset is balanced and serves as an excellent choice for understanding and implementing classification algorithms. This notebook explores the dataset, preprocesses the data, builds a decision tree classification model, and evaluates its performance, showcasing the effectiveness of decision trees in solving classification problems.
Data from: IrisDataset
kaggle.com
zip
Updated Apr 15, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sourav Bhattacharya (2019). IrisDataset [Dataset]. https://www.kaggle.com/souravbhattacharya10/irisdataset
Explore at:
zip(1039 bytes)Available download formats
Dataset updated
Apr 15, 2019
Authors
Sourav Bhattacharya
Description
The simple Iris dataset for Multiclass Classification (required dataset for Hello World program in Machine Learning). It has data for three species (Iris-setosa, Iris-versicolor and Iris-virginica) of Iris flower. It contains 4 features as input (petal_length, petal-width, sepal_length and sepal_width) and label as output. It contains 150 rows, 50 rows for each species of Iris.
i
NIR Iris images Under Alcohol Effect
ieee-dataport.org
Updated May 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
JUAN TAPIA (2025). NIR Iris images Under Alcohol Effect [Dataset]. https://ieee-dataport.org/documents/nir-iris-images-under-alcohol-effect
Explore at:
Dataset updated
May 21, 2025
Authors
JUAN TAPIA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This research proposes a method to detect alcohol consumption from Near-Infra-Red (NIR) periocular eye images. The study focuses on determining the effect of external factors such as alcohol on the Central Nervous System (CNS). The goal is to analyse how this impacts on iris and pupil movements and if it is possible to capture these changes with a standard iris NIR camera. This paper proposes a novel Fused Capsule Network (F-CapsNet) to classify iris NIR images taken under alcohol consumption subjects.
h
iris-clase
huggingface.co
Updated Apr 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fatima Godoy Rodriguez (2025). iris-clase [Dataset]. https://huggingface.co/datasets/fatimag29/iris-clase
Explore at:
Dataset updated
Apr 4, 2025
Authors
Fatima Godoy Rodriguez
Description
language:

"en" pretty_name: "Iris Dataset" tags: tabular-data classification machine-learning license: "public-domain" task_categories: classification task_ids: multi-class-classification size_categories: small source_datasets: original annotations_creators: expert-generated dataset_creators: Edgar Anderson paperswithcode_id: iris multilinguality: "monolingual"
Iris Species Classifier using Flux.jl
kaggle.com
Updated Jun 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sadia Mazhar26 (2025). Iris Species Classifier using Flux.jl [Dataset]. https://www.kaggle.com/datasets/sadiamazhar26/iris-species-classifier-using-flux-jl
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 5, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sadia Mazhar26
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This project uses the Iris dataset from the RDatasets Julia package. It consists of 150 flower samples equally distributed across three species: Setosa, Versicolor, and Virginica. Each sample includes four numerical features: sepal length, sepal width, petal length, and petal width. The features are normalized for model input. The dataset is split into 80% training and 20% testing to evaluate a neural network model developed using Flux.jl for accurate species classification.
f
Table_1_Validation of the Relationship Between Iris Color and Uveal Melanoma...
frontiersin.figshare.com
datasetcatalog.nlm.nih.gov
+1more
xlsx
Updated May 31, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Haihan Zhang; Yueming Liu; Kai Zhang; Shiqi Hui; Yu Feng; Jingting Luo; Yang Li; Wenbin Wei (2023). Table_1_Validation of the Relationship Between Iris Color and Uveal Melanoma Using Artificial Intelligence With Multiple Paths in a Large Chinese Population.XLSX [Dataset]. http://doi.org/10.3389/fcell.2021.713209.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fcell.2021.713209.s001
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Haihan Zhang; Yueming Liu; Kai Zhang; Shiqi Hui; Yu Feng; Jingting Luo; Yang Li; Wenbin Wei
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Previous studies have shown that light iris color is a predisposing factor for the development of uveal melanoma (UM) in a population of Caucasian ancestry. However, in all these studies, a remarkably low percentage of patients have brown eyes, so we applied deep learning methods to investigate the correlation between iris color and the prevalence of UM in the Chinese population. All anterior segment photos were automatically segmented with U-NET, and only the iris regions were retained. Then the iris was analyzed with machine learning methods (random forests and convolutional neural networks) to obtain the corresponding iris color spectra (classification probability). We obtained satisfactory segmentation results with high consistency with those from experts. The iris color spectrum is consistent with the raters’ view, but there is no significant correlation with UM incidence.
Iris Species
kaggle.com
zip
Updated Sep 27, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UCI Machine Learning (2016). Iris Species [Dataset]. https://www.kaggle.com/uciml/iris
Explore at:
zip(3687 bytes)Available download formats
Dataset updated
Sep 27, 2016
Dataset authored and provided by
UCI Machine Learning
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The Iris dataset was used in R.A. Fisher's classic 1936 paper, The Use of Multiple Measurements in Taxonomic Problems, and can also be found on the UCI Machine Learning Repository.

It includes three iris species with 50 samples each as well as some properties about each flower. One flower species is linearly separable from the other two, but the other two are not linearly separable from each other.

The columns in this dataset are:

Id

SepalLengthCm

SepalWidthCm

PetalLengthCm

PetalWidthCm

Species
h
iris_clase
huggingface.co
Updated Apr 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Soriano (2025). iris_clase [Dataset]. https://huggingface.co/datasets/mariaasoriaano/iris_clase
Explore at:
Dataset updated
Apr 4, 2025
Authors
Soriano
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Card for Iris Dataset

Dataset Summary

The Iris dataset is a classic multivariate dataset introduced by Ronald Fisher in 1936. It contains 150 samples of iris flowers from three different species: Iris setosa, Iris versicolor, and Iris virginica. Each sample has four features: sepal length, sepal width, petal length, and petal width. This dataset is widely used for classification tasks, especially in machine learning tutorials and benchmarks.

Dataset… See the full description on the dataset page: https://huggingface.co/datasets/mariaasoriaano/iris_clase.
Iris Dataset (Dummy Version)
kaggle.com
Updated Feb 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rim El Guennouni (2025). Iris Dataset (Dummy Version) [Dataset]. https://www.kaggle.com/datasets/rimelguennouni/iris-dataset-dummy-version
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 27, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Rim El Guennouni
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This is a simulated version of the classic Iris dataset, commonly used for machine learning and data analysis practice. It includes measurements for sepal length, sepal width, petal length, and petal width, along with species classification. Ideal for beginners learning data science and machine learning.
f
Data_Sheet_1_Deep learning-based ultrasonographic classification of canine...
frontiersin.figshare.com
docx
Updated Sep 4, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Heejung Yu; In-Gyu Lee; Jun-Young Oh; Jaehwan Kim; Ji-Hoon Jeong; Kidong Eom (2024). Data_Sheet_1_Deep learning-based ultrasonographic classification of canine chronic kidney disease.docx [Dataset]. http://doi.org/10.3389/fvets.2024.1443234.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fvets.2024.1443234.s001
Dataset updated
Sep 4, 2024
Dataset provided by
Frontiers
Authors
Heejung Yu; In-Gyu Lee; Jun-Young Oh; Jaehwan Kim; Ji-Hoon Jeong; Kidong Eom
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ObjectivesIn veterinary medicine, attempts to apply artificial intelligence (AI) to ultrasonography have rarely been reported, and few studies have investigated the value of AI in ultrasonographic diagnosis. This study aimed to develop a deep learning-based model for classifying the status of canine chronic kidney disease (CKD) using renal ultrasonographic images and assess its diagnostic performance in comparison with that of veterinary imaging specialists, thereby verifying its clinical utility.Materials and methodsIn this study, 883 ultrasonograms were obtained from 198 dogs, including those diagnosed with CKD according to the International Renal Interest Society (IRIS) guidelines and healthy dogs. After preprocessing and labeling each image with its corresponding IRIS stage, the renal regions were extracted and classified based on the IRIS stage using the convolutional neural network-based object detection algorithm You Only Look Once. The training scenarios consisted of multi-class classification, categorization of images into IRIS stages, and four binary classifications based on specific IRIS stages. To prevent model overfitting, we balanced the dataset, implemented early stopping, used lightweight models, and applied dropout techniques. Model performance was assessed using accuracy, recall, precision, F1 score, and receiver operating characteristic curve and compared with the diagnostic accuracy of four specialists. Inter- and intra-observer variabilities among specialists were also evaluated.ResultsThe developed model exhibited a low accuracy of 0.46 in multi-class classification. However, a significant performance improvement was observed in binary classifications, with the model designed to distinguish stage 3 or higher showing the highest accuracy of 0.85. In this classification, recall, precision, and F1 score values were all 0.85, and the area under the curve was 0.89. Compared with radiologists, whose accuracy ranged from 0.48 to 0.62 in this experimental scenario, the AI model exhibited superiority. Intra-observer reliability among radiologists was substantial, whereas inter-observer variability showed a moderate level of agreement.ConclusionsThis study developed a deep-learning framework capable of reliably classifying CKD IRIS stages 3 and 4 in dogs using ultrasonograms. The developed framework demonstrated higher accuracy than veterinary imaging specialists and provided more objective and consistent interpretations. Therefore, deep-learning-based ultrasound diagnostics are potentially valuable tools for diagnosing CKD in dogs.
B
Classificateurs bayésiens naïfs | Naive Bayes
borealisdata.ca
Updated Oct 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jarno Van der Kolk; Peter Darveau; Felicity Tayler (2024). Classificateurs bayésiens naïfs | Naive Bayes [Dataset]. http://doi.org/10.5683/SP3/QBRCTQ
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/QBRCTQ
Dataset updated
Oct 22, 2024
Dataset provided by
Borealis
Authors
Jarno Van der Kolk; Peter Darveau; Felicity Tayler
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Nous verrons dans ce tutoriel comment détecter des modèles à l’aide du classificateur bayésien naïf, une technique d’apprentissage-machine efficace pour détecter certains modèles et prévoir les dépendances au sein de votre jeu de données. Nous réexaminerons dans la première partie de ce tutoriel le jeu de données Iris utilisé dans le tutoriel précédent pour apprendre à utiliser le classificateur bayésien naïf. Nous appliquerons par la suite vos nouvelles connaissances pour déceler les pourriels parmi vos messages textes (SMS), de manière à identifier les messages que vous ne désirerez pas lire. Le jeu de données que nous utiliserons s’agit d’un jeu de données de source libre du Référentiel d’apprentissage-machine UCI. Nous examinerons ensuite la classification multi-étiquettes via le jeu de données CMU que nous avons utilisé antérieurement pour le classificateur des plus proches voisins. Enfin, nous vous donnerons un exemple d’utilisation non aboutie du classificateur bayésien et vous expliquerons pourquoi cela n’a pas fonctionné. The tutorial revisits the Iris flower dataset to introduce the basic steps of working with the Naive Bayes Classifier. It then applies the classifier to detect spam in SMS messages using the SMS Spam collection dataset from the UCI Machine Learning Repository, and performs multi-label classification using the CMU book dataset. The tutorial also presents a scenario where the Naive Bayes Classifier fails, providing an explanation for the failure. By the end of this tutorial, participants will have a solid understanding of the Naive Bayes classifier, be able to split data into training and testing sets, make predictions, evaluate classifier performance, identify spam, classify books, train a Gaussian Naive Bayes classifier for single or multiple labels, and utilize imputation techniques for handling missing data.

Facebook

Twitter

Click to copy link

Link copied

Cite

end_of_night.17j03 (2023). Iris_Classification [Dataset]. https://www.kaggle.com/datasets/endofnight17j03/iris-classification

Iris_Classification

Predicting Iris Species Using Machine Learning

Explore at:

3 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Sep 14, 2023

Dataset provided by

Kagglehttp://kaggle.com/

Authors

end_of_night.17j03

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The Iris Classification dataset is a well-known dataset in machine learning, commonly used for classification tasks. It contains measurements of various iris flowers, including sepal length, sepal width, petal length, and petal width, as well as the corresponding species label.

Columns Description:

1. Sepal Length (cm): Length of the sepals of the iris flower.

2. Sepal Width (cm): Width of the sepals of the iris flower.

3. Petal Length (cm): Length of the petals of the iris flower.

4. Petal Width (cm): Width of the petals of the iris flower.

5. Species: The species of the iris flower, which is the target variable to be predicted.

This dataset is commonly used for practicing classification algorithms and exploring data analysis techniques.

Clear search

Close search

Google apps

Main menu

Iris_Classification

🌼 Unveiling the Iris Dataset 🌸

Iris Species Dataset and Database

Iris Flower Dataset

‘Iris Flower Dataset’ analyzed by Analyst-2

Context

Content

Acknowledgements

iris-clase

iris

Iris Species Dataset

iris

iris

Classification Analysis Using Python

Data from: IrisDataset

NIR Iris images Under Alcohol Effect

iris-clase

Iris Species Classifier using Flux.jl

Table_1_Validation of the Relationship Between Iris Color and Uveal Melanoma...

Iris Species

iris_clase

Iris Dataset (Dummy Version)

Data_Sheet_1_Deep learning-based ultrasonographic classification of canine...

Classificateurs bayésiens naïfs | Naive Bayes

Iris_Classification

Predicting Iris Species Using Machine Learning