5 datasets found
  1. h

    food101

    • huggingface.co
    • paperswithcode.com
    • +3more
    Updated Jun 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ETH Zurich (2024). food101 [Dataset]. https://huggingface.co/datasets/ethz/food101
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 3, 2024
    Dataset authored and provided by
    ETH Zurich
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    Dataset Card for Food-101

      Dataset Summary
    

    This dataset consists of 101 food categories, with 101'000 images. For each class, 250 manually reviewed test images are provided as well as 750 training images. On purpose, the training images were not cleaned, and thus still contain some amount of noise. This comes mostly in the form of intense colors and sometimes wrong labels. All images were rescaled to have a maximum side length of 512 pixels.

      Supported Tasks and… See the full description on the dataset page: https://huggingface.co/datasets/ethz/food101.
    
  2. h

    Food121

    • huggingface.co
    Updated Nov 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit (2023). Food121 [Dataset]. https://huggingface.co/datasets/ItsNotRohit/Food121
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 2, 2023
    Authors
    Rohit
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Details

      Dataset Description
    

    This dataset is the combination of the Food101, Indian Food Classification and The-massive-Indian-Food-Dataset datasets. This Dataset aims to be a viable dataset for Image Classification of Foods with an added Indian context. This dataset has 121 classes with each class having 800 images in the train split and 200 images in the test split. Maximum resolution of images is 512*512. The Food121-224 dataset has all images downscaled to a… See the full description on the dataset page: https://huggingface.co/datasets/ItsNotRohit/Food121.

  3. i

    SEG-FOOD Semantic Food Segmentation Through Deep Learning

    • ieee-dataport.org
    Updated May 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chu Kiong Loo (2022). SEG-FOOD Semantic Food Segmentation Through Deep Learning [Dataset]. https://ieee-dataport.org/open-access/seg-food-semantic-food-segmentation-through-deep-learning
    Explore at:
    Dataset updated
    May 18, 2022
    Authors
    Chu Kiong Loo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PFID

  4. pizza_steak

    • kaggle.com
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SHUBHAM BALASAHEB RAGADE (2025). pizza_steak [Dataset]. https://www.kaggle.com/datasets/shubhamragade/pizza-steak/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 20, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    SHUBHAM BALASAHEB RAGADE
    Description

    We're going to work with are from the Food-101 dataset, a collection of 101 different categories of 101,000 (1000 images per category) real-world images of food dishes.

    • A train directory which contains all of the images in the training dataset with subdirectories each named after a certain class containing images of that class.
    • A test directory with the same structure as the train directory.
    Example of file structure
    
    pizza_steak <- top level folder
    └───train <- training images
    │  └───pizza
    │  │  │  1008104.jpg
    │  │  │  1638227.jpg
    │  │  │  ...   
    │  └───steak
    │    │  1000205.jpg
    │    │  1647351.jpg
    │    │  ...
    │  
    └───test <- testing images
    │  └───pizza
    │  │  │  1001116.jpg
    │  │  │  1507019.jpg
    │  │  │  ...   
    │  └───steak
    │    │  100274.jpg
    │    │  1653815.jpg
    │    │  ...  
     ```
    
    Let's inspect each of the directories we've downloaded.
    
    To so do, we can use the command `ls` which stands for list.
    
  5. Non and Biodegradable Material Dataset

    • kaggle.com
    Updated Jun 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rayhan Zamzamy (2021). Non and Biodegradable Material Dataset [Dataset]. https://www.kaggle.com/rayhanzamzamy/non-and-biodegradable-waste-dataset/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 12, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Rayhan Zamzamy
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Motivation

    Waste. As we all know that waste has become commonplace in many countries in the world. There is nothing wrong, in terms of waste itself defined as the final product that can no longer be used (by humans); residue. The problem lies in 'how do we manage this waste, while we can't use it anymore?'. Several countries have issues related to waste because the rate of waste production is not comparable to its management efforts. These things can be a big problem for the ecosystem.

    With this dataset. I hope we can help waste management efforts with computer vision technology. With this technology, we can identify, track, sort and process it accordingly.

    Content

    This dataset contains approximately 256K images (156K original data) representing two classes, Biodegradable and Non-biodegradable. - Biodegradable, contains materials which can be decomposed naturally by microorganisms, such as foods, plants, fruits, etc. The waste from this material can be processed into compost. - Non-biodegradable, contains materials that cannot be decomposed naturally, for example plastics, metals, inorganic elements, etc. The waste from this material will be recycled into new materials.

    I add augmented data against imbalanced class. Augmented data made by manipulating original data. Image transformation used: horizontal flip, vertical flip, 90deg CW rotation, 90deg CCW rotation.

    In this dataset, I divide the data into two subsets, training set and evaluation set. The training set itself was splitted into 4 parts due to some technical constraints (my internet bandwidth). The thing to know is that the part of the training set don't have a good data distribution. So, don't pass each part directly to your model. Concatenate each part to single dataset. See Quickstart.

    Data files in this dataset have unique name to prevent them from overwritten theirself when concatenating. Below is filename reference. You will need this for filtering this dataset.

    SUBSET.PART_CLASS_CATEGORY_ID.EXT

    SUBSET, the subset where data belong in. Either TEST or TRAIN. PART, part number of subset. Only if the subset splitted into several parts. CLASS, the class of data. BIODEG for biodegradable, or NBIODEG for non-biodegradable. CATEGORY, category of data. ORI for original data, HFL for horizontal flip, VFL for vertical flip, CWR for clockwise rotation, CCW for counter clockwise rotation. ID, data identification number. EXT, data extension. Either .jpg or .jpeg.

    Acknowledgements

    In this part, i would like to give an attribution to several Kaggle's users because without their great work, this dataset would be incomplete. As i mentioned that this dataset's source consist of another Kaggle dataset. So, this is my responsibility to do this. - Food Images (Food-101) - (K Scott Mader) - Fruit and Vegetable Image Recognition - (Kritik Seth) - Waste Classification data - (Sashaank Sekar) - Waste Classification Data v2 - (sapal6) - waste_pictures - (且听风吟)

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
ETH Zurich (2024). food101 [Dataset]. https://huggingface.co/datasets/ethz/food101

food101

Food-101

ethz/food101

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 3, 2024
Dataset authored and provided by
ETH Zurich
License

https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

Description

Dataset Card for Food-101

  Dataset Summary

This dataset consists of 101 food categories, with 101'000 images. For each class, 250 manually reviewed test images are provided as well as 750 training images. On purpose, the training images were not cleaned, and thus still contain some amount of noise. This comes mostly in the form of intense colors and sometimes wrong labels. All images were rescaled to have a maximum side length of 512 pixels.

  Supported Tasks and… See the full description on the dataset page: https://huggingface.co/datasets/ethz/food101.
Search
Clear search
Close search
Google apps
Main menu