28 datasets found
  1. T

    stl10

    • tensorflow.org
    Updated Jan 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). stl10 [Dataset]. https://www.tensorflow.org/datasets/catalog/stl10
    Explore at:
    Dataset updated
    Jan 13, 2023
    Description

    The STL-10 dataset is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. It is inspired by the CIFAR-10 dataset but with some modifications. In particular, each class has fewer labeled training examples than in CIFAR-10, but a very large set of unlabeled examples is provided to learn image models prior to supervised training. The primary challenge is to make use of the unlabeled data (which comes from a similar but different distribution from the labeled data) to build a useful prior. All images were acquired from labeled examples on ImageNet.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('stl10', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/stl10-1.0.0.png" alt="Visualization" width="500px">

  2. STL10-Labeled Image Recognition Dataset

    • kaggle.com
    Updated Aug 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semih Yagli (2025). STL10-Labeled Image Recognition Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/12688697
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Semih Yagli
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This public dataset contains labels for the unlabeled 100,000 pictures in the STL-10 dataset.

    The dataset is human labeled with AI aid through Etiqueta, the one and only gamified mobile data labeling application. stl10.py is a python script written by Martin Tutek to download the complete STL10 dataset. labels.json contains labels for the 100,000 previously unlabeled images in the STL10 dataset legend.json is a mapping of the labels used. stats.ipynb presents a few statistics regarding the 100,000 newly labeled images.

    If you use this dataset in your research please cite the following:

    @techreport{yagli2025etiqueta,
     author = {Semih Yagli},
     title = {Etiqueta: AI-Aided, Gamified Data Labeling to Label and Segment Data},
     year = {2025},
     number = {TR-2025-0001},
     address = {NJ, USA},
     month = Apr.,
     url = {https://www.aidatalabel.com/technical_reports/aidatalabel_tr_2025_0001.pdf},
     institution = {AI Data Label},
    }
    
    @inproceedings{coates2011analysis,
      title = {An analysis of single-layer networks in unsupervised feature learning},
      author = {Coates, Adam and Ng, Andrew and Lee, Honglak},
      booktitle = {Proceedings of the fourteenth international conference on artificial intelligence and statistics},
      pages = {215--223},
      year = {2011},
      organization = {JMLR Workshop and Conference Proceedings}
    }
    

    Note: The dataset is imported to Kaggle from: https://github.com/semihyagli/STL10-Labeled See also: https://github.com/semihyagli/STL10_Segmentation

    If you have comments and questions about Etiqueta or about this dataset, please reach us out at contact@aidatalabel.com

  3. a

    Stanford STL-10 Image Dataset

    • academictorrents.com
    bittorrent
    Updated Nov 26, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Coates and Honglak Lee and Andrew Y. Ng (2015). Stanford STL-10 Image Dataset [Dataset]. https://academictorrents.com/details/a799a2845ac29a66c07cf74e2a2838b6c5698a6a
    Explore at:
    bittorrent(2640397119)Available download formats
    Dataset updated
    Nov 26, 2015
    Dataset authored and provided by
    Adam Coates and Honglak Lee and Andrew Y. Ng
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    ![]() The STL-10 dataset is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. It is inspired by the CIFAR-10 dataset but with some modifications. In particular, each class has fewer labeled training examples than in CIFAR-10, but a very large set of unlabeled examples is provided to learn image models prior to supervised training. The primary challenge is to make use of the unlabeled data (which comes from a similar but different distribution from the labeled data) to build a useful prior. We also expect that the higher resolution of this dataset (96x96) will make it a challenging benchmark for developing more scalable unsupervised learning methods. Overview 10 classes: airplane, bird, car, cat, deer, dog, horse, monkey, ship, truck. Images are 96x96 pixels, color. 500 training images (10 pre-defined folds), 800 test images per class. 100000 unlabeled images for uns

  4. t

    CIFAR-10 and STL-10

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). CIFAR-10 and STL-10 [Dataset]. https://service.tib.eu/ldmservice/dataset/cifar-10-and-stl-10
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The dataset used in the paper is CIFAR-10 and STL-10, which are commonly used datasets for image classification tasks.

  5. t

    CIFAR-10, STL-10, and ImageNet

    • service.tib.eu
    Updated Dec 2, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). CIFAR-10, STL-10, and ImageNet [Dataset]. https://service.tib.eu/ldmservice/dataset/cifar-10--stl-10--and-imagenet
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The dataset used in the paper is CIFAR-10, STL-10, and ImageNet.

  6. h

    STL10-Labeled

    • huggingface.co
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semih (2025). STL10-Labeled [Dataset]. https://huggingface.co/datasets/semihyagli/STL10-Labeled
    Explore at:
    Dataset updated
    Jul 9, 2025
    Authors
    Semih
    Description

    STL10-Labeled

    This public repo contains labels for the unlabeled pictures in the stl10 dataset. Please refer to files Files and versions tab above. You can also refer to my original repo https://github.com/semihyagli/STL10-Labeled Please consider sponsoring this repo so that we can continue to develop high-quality datasets for the ML/AI research. To become a sponsor: GitHub Sponsors Buy me a coffee You can also sponsor us by downloading our free application, Etiqueta, to your… See the full description on the dataset page: https://huggingface.co/datasets/semihyagli/STL10-Labeled.

  7. t

    MNIST, CIFAR10 and STL10 - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). MNIST, CIFAR10 and STL10 - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/mnist--cifar10-and-stl10
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The dataset used in the paper is MNIST, CIFAR10 and STL10. These are datasets for image classification tasks.

  8. t

    STL-10 dataset - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). STL-10 dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/stl-10-dataset
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The dataset used in this paper is a collection of images from the STL-10 dataset, preprocessed and used for training and evaluation of the proposed diffusion spectral entropy and diffusion spectral mutual information methods.

  9. h

    CleanSTL-10

    • huggingface.co
    Updated Jun 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Yann (2025). CleanSTL-10 [Dataset]. https://huggingface.co/datasets/Shu1L0n9/CleanSTL-10
    Explore at:
    Dataset updated
    Jun 5, 2025
    Authors
    Rui Yann
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for STL-10 Cleaned (Deduplicated Training Set)

    Paper | Code

      Dataset Description
    

    This dataset is a modified version of the STL-10 dataset. The primary modification involves deduplicating the training set by removing any images that are exact byte-for-byte matches (based on SHA256 hash) with images present in the original STL-10 test set. The dataset comprises this cleaned training set and the original, unmodified STL-10 test set. The goal is to provide a… See the full description on the dataset page: https://huggingface.co/datasets/Shu1L0n9/CleanSTL-10.

  10. h

    STL10-Segmented

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semih, STL10-Segmented [Dataset]. https://huggingface.co/datasets/semihyagli/STL10-Segmented
    Explore at:
    Authors
    Semih
    Description

    STL10 - Segmentation

    Please consider sponsoring this repo so that we can continue to develop high-quality datasets for the AI and ML research. To become a sponsor: GitHub Sponsors Buy me a coffee You can also sponsor us by downloading our free application, Etiqueta, to your devices: Etiqueta on iOS or Apple Chip Macs Etiqueta on Android This repo contains segmented images for the labeled part of the STL-10 Dataset.
    If you are looking for STL10-Labeled variant of the dataset… See the full description on the dataset page: https://huggingface.co/datasets/semihyagli/STL10-Segmented.

  11. t

    MNIST, KMNIST, FashionMNIST, STL-10 and CIFAR-10

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). MNIST, KMNIST, FashionMNIST, STL-10 and CIFAR-10 [Dataset]. https://service.tib.eu/ldmservice/dataset/mnist--kmnist--fashionmnist--stl-10-and-cifar-10
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The MNIST, KMNIST, FashionMNIST, STL-10 and CIFAR-10 datasets are used for few-shot learning experiments.

  12. Model Zoo: A Dataset of Diverse Populations of Neural Network Models - STL10...

    • zenodo.org
    • data.niaid.nih.gov
    json, zip
    Updated Jun 13, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth; Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth (2022). Model Zoo: A Dataset of Diverse Populations of Neural Network Models - STL10 - Raw Datasets [Dataset]. http://doi.org/10.5281/zenodo.6631784
    Explore at:
    zip, jsonAvailable download formats
    Dataset updated
    Jun 13, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth; Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    In the last years, neural networks have evolved from laboratory environments to the state-of-the-art for many real-world problems. Our hypothesis is that neural network models (i.e., their weights and biases) evolve on unique, smooth trajectories in weight space during training. Following, a population of such neural network models (refereed to as “model zoo”) would form topological structures in weight space. We think that the geometry, curvature and smoothness of these structures contain information about the state of training and can be reveal latent properties of individual models. With such zoos, one could investigate novel approaches for (i) model analysis, (ii) discover unknown learning dynamics, (iii) learn rich representations of such populations, or (iv) exploit the model zoos for generative modelling of neural network weights and biases. Unfortunately, the lack of standardized model zoos and available benchmarks significantly increases the friction for further research about populations of neural networks. With this work, we publish a novel dataset of model zoos containing systematically generated and diverse populations of neural network models for further research. In total the proposed model zoo dataset is based on six image datasets, consist of 24 model zoos with varying hyperparameter combinations are generated and includes 47’360 unique neural network models resulting in over 2’415’360 collected model states. Additionally, to the model zoo data we provide an in-depth analysis of the zoos and provide benchmarks for multiple downstream tasks as mentioned before.

    Dataset

    This dataset is part of a larger collection of model zoos and contains the zoos trained on the labelled samples from STL10. All zoos with extensive information and code can be found at www.modelzoos.cc.

    This repository contains the raw model zoos as collections of models (file names beginning with "cifar_"). Zoos are trained with small and large CNN models, in three configurations varying the seed only (seed), varying hyperparameters with fixed seeds (hyp_fix) or varying hyperparameters with random seeds (hyp_rand). Due to the large filesize, the preprocessed datasets are hosted in a separate repository. The index_dict.json files contain information on how to read the vectorized models.

    For more information on the zoos and code to access and use the zoos, please see www.modelzoos.cc.

  13. f

    Results on STL-10 at 500 epoch.

    • plos.figshare.com
    xls
    Updated Aug 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yiming Kuang; Jianwu Guan; Hongyun Liu; Fei Chen; Zihua Wang; Weidong Wang (2025). Results on STL-10 at 500 epoch. [Dataset]. http://doi.org/10.1371/journal.pone.0329273.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 25, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Yiming Kuang; Jianwu Guan; Hongyun Liu; Fei Chen; Zihua Wang; Weidong Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    St. Louis
    Description

    We introduce PiCCL (Primary Component Contrastive Learning), a self-supervised contrastive learning framework that utilizes a multiplex Siamese network structure consisting of many identical branches rather than 2 to maximize learning efficiency. PiCCL is simple and light weight, it does not use asymmetric networks, intricate pretext tasks, hard to compute loss functions or multimodal data, which are common for multiview contrastive learning frameworks and could hinder performance, simplicity, generalizability and explainability. PiCCL obtains multiple positive samples by applying the same image augmentation paradigm to the same image numerous times, the network loss is calculated using a custom designed Loss function named PiCLoss (Primary Component Loss) to take advantage of PiCCL’s unique structure while keeping it computationally lightweight. To demonstrate its strength, we benchmarked PiCCL against various state-of-the-art self-supervised algorithms on multiple datasets including CIFAR-10, CIFAR-100, and STL-10. PiCCL achieved top performance in most of our tests, with top-1 accuracy of 94%, 72%, and 97% for the 3 datasets respectively. But where PiCCL excels is in the small batch learning scenarios. When testing on STL-10 using a batch size of 8, PiCCL still achieved 93% accuracy, outperforming the competition by about 3 percentage points.

  14. m

    Stl10

    • rgd.mcw.edu
    Updated Dec 15, 2003
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rat Genome Database (2003). Stl10 [Dataset]. https://rgd.mcw.edu/rgdweb/report/qtl/main.html?id=1354665
    Explore at:
    Dataset updated
    Dec 15, 2003
    Dataset authored and provided by
    Rat Genome Database
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    QTL Report

  15. t

    Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan (2024). Dataset:...

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan (2024). Dataset: CIFAR-10, CIFAR-100, and STL-10 datasets. https://doi.org/10.57702/m16c6gw8 [Dataset]. https://service.tib.eu/ldmservice/dataset/cifar-10--cifar-100--and-stl-10-datasets
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10, CIFAR-100, and STL-10 datasets for training and testing the embedding functions.

  16. f

    Results on CIFAR-10 & CIFAR-100.

    • plos.figshare.com
    • figshare.com
    xls
    Updated Aug 25, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yiming Kuang; Jianwu Guan; Hongyun Liu; Fei Chen; Zihua Wang; Weidong Wang (2025). Results on CIFAR-10 & CIFAR-100. [Dataset]. http://doi.org/10.1371/journal.pone.0329273.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 25, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Yiming Kuang; Jianwu Guan; Hongyun Liu; Fei Chen; Zihua Wang; Weidong Wang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We introduce PiCCL (Primary Component Contrastive Learning), a self-supervised contrastive learning framework that utilizes a multiplex Siamese network structure consisting of many identical branches rather than 2 to maximize learning efficiency. PiCCL is simple and light weight, it does not use asymmetric networks, intricate pretext tasks, hard to compute loss functions or multimodal data, which are common for multiview contrastive learning frameworks and could hinder performance, simplicity, generalizability and explainability. PiCCL obtains multiple positive samples by applying the same image augmentation paradigm to the same image numerous times, the network loss is calculated using a custom designed Loss function named PiCLoss (Primary Component Loss) to take advantage of PiCCL’s unique structure while keeping it computationally lightweight. To demonstrate its strength, we benchmarked PiCCL against various state-of-the-art self-supervised algorithms on multiple datasets including CIFAR-10, CIFAR-100, and STL-10. PiCCL achieved top performance in most of our tests, with top-1 accuracy of 94%, 72%, and 97% for the 3 datasets respectively. But where PiCCL excels is in the small batch learning scenarios. When testing on STL-10 using a batch size of 8, PiCCL still achieved 93% accuracy, outperforming the competition by about 3 percentage points.

  17. t

    Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth...

    • service.tib.eu
    Updated Dec 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth (2024). Dataset: CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet. https://doi.org/10.57702/sw7lwn2n [Dataset]. https://service.tib.eu/ldmservice/dataset/cifar-10--cifar-100--stl-10--and-tiny-imagenet
    Explore at:
    Dataset updated
    Dec 3, 2024
    Description

    The dataset used in the paper is a de-noising diffusion probabilistic model (DDPM) trained on CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet.

  18. Z

    Baseline models and optimized CNN models for 8 datasets

    • data.niaid.nih.gov
    • scidb.cn
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thoma, Martin (2020). Baseline models and optimized CNN models for 8 datasets [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_582892
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset authored and provided by
    Thoma, Martin
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Datasets:

    Asirra

    CIFAR-10

    CIFAR-100

    GTSRB

    HASYv2

    MNIST

    STL-10

    SVHN

  19. t

    Visual Domain Adaptation - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Visual Domain Adaptation - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/visual-domain-adaptation
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The MNIST, MNIST-M, Street View House Numbers (SVHN), Synthetic Digits (SYN DIGITS), CIFAR-10 and STL-10 datasets are used for visual domain adaptation experiments.

  20. w

    Dataset of book subjects that contain Data structures with STL

    • workwithdata.com
    Updated Nov 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Dataset of book subjects that contain Data structures with STL [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=j0-book&fop0=%3D&fval0=Data+structures+with+STL&j=1&j0=books
    Explore at:
    Dataset updated
    Nov 7, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects. It has 2 rows and is filtered where the books is Data structures with STL. It features 10 columns including number of authors, number of books, earliest publication date, and latest publication date.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2023). stl10 [Dataset]. https://www.tensorflow.org/datasets/catalog/stl10

stl10

Explore at:
Dataset updated
Jan 13, 2023
Description

The STL-10 dataset is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. It is inspired by the CIFAR-10 dataset but with some modifications. In particular, each class has fewer labeled training examples than in CIFAR-10, but a very large set of unlabeled examples is provided to learn image models prior to supervised training. The primary challenge is to make use of the unlabeled data (which comes from a similar but different distribution from the labeled data) to build a useful prior. All images were acquired from labeled examples on ImageNet.

To use this dataset:

import tensorflow_datasets as tfds

ds = tfds.load('stl10', split='train')
for ex in ds.take(4):
 print(ex)

See the guide for more informations on tensorflow_datasets.

https://storage.googleapis.com/tfds-data/visualization/fig/stl10-1.0.0.png" alt="Visualization" width="500px">

Search
Clear search
Close search
Google apps
Main menu