100+ datasets found
  1. Iris Species

    • kaggle.com
    zip
    Updated Sep 27, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UCI Machine Learning (2016). Iris Species [Dataset]. https://www.kaggle.com/datasets/uciml/iris
    Explore at:
    zip(3687 bytes)Available download formats
    Dataset updated
    Sep 27, 2016
    Dataset authored and provided by
    UCI Machine Learning
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The Iris dataset was used in R.A. Fisher's classic 1936 paper, The Use of Multiple Measurements in Taxonomic Problems, and can also be found on the UCI Machine Learning Repository.

    It includes three iris species with 50 samples each as well as some properties about each flower. One flower species is linearly separable from the other two, but the other two are not linearly separable from each other.

    The columns in this dataset are:

    • Id
    • SepalLengthCm
    • SepalWidthCm
    • PetalLengthCm
    • PetalWidthCm
    • Species

    Sepal Width vs. Sepal Length

  2. Clustering Iris Data Set

    • kaggle.com
    Updated Sep 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rifki Ilham (2023). Clustering Iris Data Set [Dataset]. https://www.kaggle.com/datasets/rifkiilham/clustering-iris-data-set
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 2, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rifki Ilham
    Description

    The Iris flower data set or Fisher's Iris data set is a multivariate data set used and made famous by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems as an example of linear discriminant analysis. Please use this data set to clustering the iris flowers data. You can use k-means clustering algorithm.

  3. Iris Dataset

    • kaggle.com
    Updated Aug 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MahtabLatifpour (2024). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/mslatifpour/iris-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 29, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    MahtabLatifpour
    Description

    The Iris dataset is one of the most famous datasets used in machine learning and statistics. It was introduced by the British biologist and statistician Ronald A. Fisher in 1936. The dataset consists of 150 observations of iris flowers, with each observation belonging to one of three species (classes) of the Iris flower. The dataset is widely used for classification purposes and is often used as a beginner's dataset for learning machine learning techniques.

    Features The Iris dataset contains four features (attributes), which are:

    Sepal Length: The length of the sepal in centimeters. Sepal Width: The width of the sepal in centimeters. Petal Length: The length of the petal in centimeters. Petal Width: The width of the petal in centimeters. Each of these features is a continuous numerical value. These features are measured for each of the 150 iris flowers in the dataset.

    Classes The dataset contains three classes, which correspond to three different species of the Iris flower:

    Iris Setosa: This class is often linearly separable from the other two classes, making it easy to classify. Iris Versicolor: This class is somewhat more challenging to distinguish from the Iris Virginica class. Iris Virginica: The third class, which can sometimes be difficult to distinguish from Iris Versicolor based on the given features. Each class has 50 observations, making the dataset balanced with equal representation of each class. The goal when using this dataset is typically to build a model that can predict the species of an iris flower based on its measurements.

    In summary, the Iris dataset is a small, well-structured dataset that includes:

    150 samples (observations) of iris flowers. 4 features (sepal length, sepal width, petal length, petal width). 3 classes (Iris Setosa, Iris Versicolor, Iris Virginica), each with 50 samples. The dataset's simplicity and clear structure make it ideal for demonstrating basic cl

  4. 🌼 Unveiling the Iris Dataset 🌸

    • kaggle.com
    Updated Jul 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HARISH KUMARdatalab (2023). 🌼 Unveiling the Iris Dataset 🌸 [Dataset]. http://doi.org/10.34740/kaggle/dsv/6209742
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 28, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    HARISH KUMARdatalab
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context: 🌼 The Iris flower dataset, an iconic multivariate set, was first introduced by the renowned British statistician and biologist, Ronald Fisher in 1936 πŸ“. Commonly known as Anderson's Iris dataset, it was curated by Edgar Anderson to measure the morphologic variation of three Iris species 🌸: Iris Setosa, Iris Virginica, and Iris Versicolor.

    πŸ“Š The set comprises 100 samples from each species, with four features - sepal length, sepal width, petal length, and petal width, measured in centimetres.

    πŸ”¬ This dataset has since served as a standard test case for various statistical classification techniques in machine learning, including the widely used support vector machines (SVM).

    So, whether you're a newbie dipping your toes into the ML pond or a seasoned data scientist testing out a new classification method, the Iris dataset is a classic starting point! πŸŽ―πŸš€

    Columns:

    1. Sepal Length: The length of the sepal of the iris flower, is measured in centimetres.
    2. Sepal Width: The width of the sepal of the iris flower, measured in centimetres.
    3. Petal Length: The length of the petal of the iris flower, is measured in centimetres.
    4. Petal Width: The width of the petal of the iris flower, measured in centimetres.
    5. Species:The specific species of the iris flower, categorized into Sentosa, Virginica, and Versicolor.

    Problem Statement:

    1.🎯 Classification Challenge: Can you accurately predict the species of an Iris flower based on the four given measurements: sepal length, sepal width, petal length, and petal width?

    2.πŸ’‘ Feature Importance: Which feature (sepal length, sepal width, petal length, or petal width) is the most significant in distinguishing between the species of Iris flowers?

    3.πŸ“ˆ Data Scaling: How does standardization (or normalization) of the features affect the performance of your classification models?

    4.πŸ§ͺ Model Experimentation: Can simpler models such as Logistic Regression perform as well as more complex models like Support Vector Machines or Neural Networks on the Iris dataset? Compare the performance of various models.

    5.πŸ€– AutoML Challenge: Use AutoML tools (like Google's AutoML or H2O's AutoML) to build a classification model. How does its performance compare with your handcrafted models?

    Kindly, upvote if you find the dataset interesting

  5. Iris Dataset

    • kaggle.com
    Updated Jun 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NotePub (2021). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/notepub/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 4, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    NotePub
    Description

    Dataset

    This dataset was created by NotePub

    Contents

  6. Iris Dataset

    • kaggle.com
    Updated May 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taylan Torres (2021). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/taylantorres/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 3, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Taylan Torres
    Description

    Dataset

    This dataset was created by Taylan Torres

    Contents

  7. Refined Iris Dataset

    • kaggle.com
    Updated Jun 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GIRITHARAN MANI (2023). Refined Iris Dataset [Dataset]. https://www.kaggle.com/mystifoe77/iris-clean-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    GIRITHARAN MANI
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset provides a refined version of the popular Iris dataset, tailored for enhanced usability in machine learning and data science applications. Key improvements include:
    - Data Quality: Removal of duplicate and inconsistent entries.
    - Feature Consistency: Verified feature distributions to ensure better modeling accuracy.
    - Enhanced Labeling: Clear and intuitive labels for easier interpretability.

    This dataset is ideal for beginners and professionals alike, offering a robust foundation for testing classification algorithms and exploring supervised learning workflows.

    Tags

    Classification, Machine Learning, Data Cleaning, Iris, Clean Data, Data Analysis

    File Details

    File Name: Iris_clean_dataset.csv
    - Size: 5.11 KB
    - Rows: 150
    - Columns: 6
    - Columns:
    1. Id
    2. SepalLengthCm
    3. SepalWidthCm
    4. PetalLengthCm
    5. PetalWidthCm
    6. Species

    Each row corresponds to a single observation of Iris flower measurements, including species classifications (Iris-setosa, Iris-versicolor, Iris-virginica).

    Usability

    Usability Score: 1.76
    This score reflects the dataset's ease of use for various machine learning and data analysis tasks.

    License

    License Type: CC BY 4.0
    You are free to use, modify, and distribute this dataset, provided appropriate credit is given to the original author.

    Expected Update Frequency

    Frequency: This dataset will not receive regular updates. However, feedback is welcomed for future revisions.

    Provenance

    Source: Original Iris dataset with modifications.
    Methodology: Data cleaning involved removing anomalies, revalidating measurements, and restructuring for compatibility with modern ML workflows.

    Encourage interaction:
    "_Your engagement improves this dataset’s visibility. Feel free to comment or share your use case._"

    Example Use Cases

    • Learning: Ideal for beginners experimenting with machine learning models like Logistic Regression, Random Forest, and KNN.
    • Research: Test your novel classification techniques on this cleaned version.
    • Application: Use it in practical ML projects for training supervised learning models.

    Notes to Users

    If you find this dataset helpful, consider leaving feedback or sharing your implementation in the Kaggle discussions section. Collaboration and suggestions are always welcome!

    Let me know if you'd like further refinements or adjustments!

  8. Iris Dataset

    • kaggle.com
    Updated Jun 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pranav Joshi (2023). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/joshipranav/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 18, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pranav Joshi
    Description

    Dataset

    This dataset was created by Pranav Joshi

    Contents

  9. Iris Dataset

    • kaggle.com
    Updated Apr 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jaymin151617 (2023). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/jaymin151617/iris-dataset/discussion?sort=undefined
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jaymin151617
    Description

    Dataset

    This dataset was created by Jaymin151617

    Contents

  10. Iris Dataset

    • kaggle.com
    Updated Jun 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Zeeshan (2024). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/mzeeshan786/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Zeeshan
    Description

    The Iris dataset originated from a seminal paper by British statistician and biologist Ronald Fisher titled "The Use of Multiple Measurements in Taxonomic Problems," published in 1936. Fisher collected and measured samples of iris flowers from three different species: Setosa, Versicolor, and Virginica.

    The dataset comprises 150 samples, with each sample having four features measured: 1. Sepal Length 2. Sepal Width 3. Petal Length 4. Petal Width

    Additionally, each sample is labeled with its corresponding species, making it a supervised learning problem with three target variables: 1. Setosa 2. Versicolor 3. Virginia

    The Iris dataset is often used as a benchmark in machine learning and pattern recognition for tasks like classification and clustering. Its simplicity and clarity make it an excellent starting point for learning various algorithms and techniques.

  11. Iris Dataset

    • kaggle.com
    Updated Jan 11, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Raman V (2020). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/vraman1806/iris-dataset/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 11, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Raman V
    Description

    Dataset

    This dataset was created by Raman V

    Contents

  12. iris dataset

    • kaggle.com
    Updated Aug 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    farida gaber (2024). iris dataset [Dataset]. https://www.kaggle.com/datasets/faridagaber/iris-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 7, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    farida gaber
    Description

    Dataset

    This dataset was created by farida gaber

    Contents

  13. Iris Dataset

    • kaggle.com
    Updated Oct 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    7pramod (2020). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/pramod7/iris-dataset/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 21, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    7pramod
    Description

    Dataset

    This dataset was created by 7pramod

    Contents

  14. Iris Dataset

    • kaggle.com
    Updated Apr 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhannad Khaled (2024). Iris Dataset [Dataset]. https://www.kaggle.com/datasets/muhannadkhaled/iris-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhannad Khaled
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Muhannad Khaled

    Released under MIT

    Contents

  15. Iris dataset

    • kaggle.com
    Updated Feb 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Obaid Qadri (2024). Iris dataset [Dataset]. https://www.kaggle.com/datasets/muhammadobaidqadri/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 14, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Obaid Qadri
    Description

    Dataset

    This dataset was created by Muhammad Obaid Qadri

    Contents

  16. iris dataset

    • kaggle.com
    Updated Jan 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Varun D (2024). iris dataset [Dataset]. https://www.kaggle.com/datasets/varund2003/iris-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 17, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Varun D
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Varun D

    Released under MIT

    Contents

  17. IRIS Dataset

    • kaggle.com
    Updated Aug 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Dutta (2020). IRIS Dataset [Dataset]. https://www.kaggle.com/datasets/gauravduttakiit/iris-dataset/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 10, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gaurav Dutta
    Description

    Dataset

    This dataset was created by Gaurav Dutta

    Contents

  18. Iris DataSet

    • kaggle.com
    Updated Aug 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harsh_sf (2023). Iris DataSet [Dataset]. https://www.kaggle.com/datasets/harshsf/iris-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 16, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Harsh_sf
    Description

    Dataset

    This dataset was created by Harsh_sf

    Contents

  19. IRIS Dataset

    • kaggle.com
    Updated Jul 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syed Touqeer (2020). IRIS Dataset [Dataset]. https://www.kaggle.com/datasets/syedtouqeer/iris-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 2, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Syed Touqeer
    Description

    Dataset

    This dataset was created by Syed Touqeer

    Contents

  20. iris dataset

    • kaggle.com
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chhavideora11@ (2024). iris dataset [Dataset]. https://www.kaggle.com/datasets/chhavideora11/iris-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Chhavideora11@
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Chhavideora11@

    Released under Apache 2.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
UCI Machine Learning (2016). Iris Species [Dataset]. https://www.kaggle.com/datasets/uciml/iris
Organization logo

Iris Species

Classify iris plants into three species in this classic dataset

Explore at:
zip(3687 bytes)Available download formats
Dataset updated
Sep 27, 2016
Dataset authored and provided by
UCI Machine Learning
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

The Iris dataset was used in R.A. Fisher's classic 1936 paper, The Use of Multiple Measurements in Taxonomic Problems, and can also be found on the UCI Machine Learning Repository.

It includes three iris species with 50 samples each as well as some properties about each flower. One flower species is linearly separable from the other two, but the other two are not linearly separable from each other.

The columns in this dataset are:

  • Id
  • SepalLengthCm
  • SepalWidthCm
  • PetalLengthCm
  • PetalWidthCm
  • Species

Sepal Width vs. Sepal Length

Search
Clear search
Close search
Google apps
Main menu