11 datasets found
  1. P

    STL-10 Dataset

    • paperswithcode.com
    Updated Feb 2, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Coates; Andrew Y. Ng; Honglak Lee (2022). STL-10 Dataset [Dataset]. https://paperswithcode.com/dataset/stl-10
    Explore at:
    Dataset updated
    Feb 2, 2021
    Authors
    Adam Coates; Andrew Y. Ng; Honglak Lee
    Description

    The STL-10 is an image dataset derived from ImageNet and popularly used to evaluate algorithms of unsupervised feature learning or self-taught learning. Besides 100,000 unlabeled images, it contains 13,000 labeled images from 10 object classes (such as birds, cats, trucks), among which 5,000 images are partitioned for training while the remaining 8,000 images for testing. All the images are color images with 96×96 pixels in size.

  2. t

    MNIST, CIFAR10 and STL10 - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). MNIST, CIFAR10 and STL10 - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/mnist--cifar10-and-stl10
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The dataset used in the paper is MNIST, CIFAR10 and STL10. These are datasets for image classification tasks.

  3. a

    Stanford STL-10 Image Dataset

    • academictorrents.com
    bittorrent
    Updated Nov 26, 2015
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Coates and Honglak Lee and Andrew Y. Ng (2015). Stanford STL-10 Image Dataset [Dataset]. https://academictorrents.com/details/a799a2845ac29a66c07cf74e2a2838b6c5698a6a
    Explore at:
    bittorrent(2640397119)Available download formats
    Dataset updated
    Nov 26, 2015
    Dataset authored and provided by
    Adam Coates and Honglak Lee and Andrew Y. Ng
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    ![]() The STL-10 dataset is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. It is inspired by the CIFAR-10 dataset but with some modifications. In particular, each class has fewer labeled training examples than in CIFAR-10, but a very large set of unlabeled examples is provided to learn image models prior to supervised training. The primary challenge is to make use of the unlabeled data (which comes from a similar but different distribution from the labeled data) to build a useful prior. We also expect that the higher resolution of this dataset (96x96) will make it a challenging benchmark for developing more scalable unsupervised learning methods. Overview 10 classes: airplane, bird, car, cat, deer, dog, horse, monkey, ship, truck. Images are 96x96 pixels, color. 500 training images (10 pre-defined folds), 800 test images per class. 100000 unlabeled images for uns

  4. Z

    Model Zoo: A Dataset of Diverse Populations of Neural Network Models - STL10...

    • data.niaid.nih.gov
    Updated Jun 13, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Schürholt, Konstantin (2022). Model Zoo: A Dataset of Diverse Populations of Neural Network Models - STL10 - Raw Datasets [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6631783
    Explore at:
    Dataset updated
    Jun 13, 2022
    Dataset provided by
    Borth, Damian
    Taskiran, Diyar
    Giró-i-Nieto, Xavier
    Schürholt, Konstantin
    Knyazev, Boris
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    In the last years, neural networks have evolved from laboratory environments to the state-of-the-art for many real-world problems. Our hypothesis is that neural network models (i.e., their weights and biases) evolve on unique, smooth trajectories in weight space during training. Following, a population of such neural network models (refereed to as “model zoo”) would form topological structures in weight space. We think that the geometry, curvature and smoothness of these structures contain information about the state of training and can be reveal latent properties of individual models. With such zoos, one could investigate novel approaches for (i) model analysis, (ii) discover unknown learning dynamics, (iii) learn rich representations of such populations, or (iv) exploit the model zoos for generative modelling of neural network weights and biases. Unfortunately, the lack of standardized model zoos and available benchmarks significantly increases the friction for further research about populations of neural networks. With this work, we publish a novel dataset of model zoos containing systematically generated and diverse populations of neural network models for further research. In total the proposed model zoo dataset is based on six image datasets, consist of 24 model zoos with varying hyperparameter combinations are generated and includes 47’360 unique neural network models resulting in over 2’415’360 collected model states. Additionally, to the model zoo data we provide an in-depth analysis of the zoos and provide benchmarks for multiple downstream tasks as mentioned before.

    Dataset

    This dataset is part of a larger collection of model zoos and contains the zoos trained on the labelled samples from STL10. All zoos with extensive information and code can be found at www.modelzoos.cc.

    This repository contains the raw model zoos as collections of models (file names beginning with "cifar_"). Zoos are trained with small and large CNN models, in three configurations varying the seed only (seed), varying hyperparameters with fixed seeds (hyp_fix) or varying hyperparameters with random seeds (hyp_rand). Due to the large filesize, the preprocessed datasets are hosted in a separate repository. The index_dict.json files contain information on how to read the vectorized models.

    For more information on the zoos and code to access and use the zoos, please see www.modelzoos.cc.

  5. h

    wds_stl10_test

    • huggingface.co
    Updated Apr 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dhruba Ghosh (2023). wds_stl10_test [Dataset]. https://huggingface.co/datasets/djghosh/wds_stl10_test
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 14, 2023
    Authors
    Dhruba Ghosh
    Description

    STL-10 (Test set only)

    Original paper: An Analysis of Single Layer Networks in Unsupervised Feature Learning Homepage: https://cs.stanford.edu/~acoates/stl10/ Bibtex: @InProceedings{pmlr-v15-coates11a, title = {An Analysis of Single-Layer Networks in Unsupervised Feature Learning}, author = {Coates, Adam and Ng, Andrew and Lee, Honglak}, booktitle = {Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics}, pages = {215--223}… See the full description on the dataset page: https://huggingface.co/datasets/djghosh/wds_stl10_test.

  6. t

    Mixture of Gaussians, CIFAR-10, STL-10, CelebA, and ImageNet - Dataset - LDM...

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Mixture of Gaussians, CIFAR-10, STL-10, CelebA, and ImageNet - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/mixture-of-gaussians--cifar-10--stl-10--celeba--and-imagenet
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The dataset used in the paper is a mixture of Gaussians, CIFAR-10, STL-10, CelebA, and ImageNet.

  7. Baseline models and optimized CNN models for 8 datasets

    • zenodo.org
    • scidb.cn
    • +1more
    application/gzip
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Thoma; Martin Thoma (2020). Baseline models and optimized CNN models for 8 datasets [Dataset]. http://doi.org/10.5281/zenodo.582892
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Thoma; Martin Thoma
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Datasets:

    • Asirra
    • CIFAR-10
    • CIFAR-100
    • GTSRB
    • HASYv2
    • MNIST
    • STL-10
    • SVHN
  8. t

    Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth...

    • service.tib.eu
    Updated Dec 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth (2024). Dataset: CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet. https://doi.org/10.57702/sw7lwn2n [Dataset]. https://service.tib.eu/ldmservice/dataset/cifar-10--cifar-100--stl-10--and-tiny-imagenet
    Explore at:
    Dataset updated
    Dec 3, 2024
    Description

    The dataset used in the paper is a de-noising diffusion probabilistic model (DDPM) trained on CIFAR-10, CIFAR-100, STL-10, and Tiny-ImageNet.

  9. f

    Classification performance of the WideResNet-16 architecture on the STL-10...

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junhyeok An; Soojin Jang; Junehyoung Kwon; Kyohoon Jin; YoungBin Kim (2023). Classification performance of the WideResNet-16 architecture on the STL-10 dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0274767.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Junhyeok An; Soojin Jang; Junehyoung Kwon; Kyohoon Jin; YoungBin Kim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Classification performance of the WideResNet-16 architecture on the STL-10 dataset.

  10. h

    CleanSTL-10

    • huggingface.co
    Updated Jun 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Yann (2025). CleanSTL-10 [Dataset]. https://huggingface.co/datasets/Shu1L0n9/CleanSTL-10
    Explore at:
    Dataset updated
    Jun 5, 2025
    Authors
    Rui Yann
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for STL-10 Cleaned (Deduplicated Training Set)

      Dataset Description
    

    This dataset is a modified version of the STL-10 dataset. The primary modification involves deduplicating the training set by removing any images that are exact byte-for-byte matches (based on SHA256 hash) with images present in the original STL-10 test set. The dataset comprises this cleaned training set and the original, unmodified STL-10 test set. The goal is to provide a cleaner separation… See the full description on the dataset page: https://huggingface.co/datasets/Shu1L0n9/CleanSTL-10.

  11. t

    Progressive Feedforward Collapse of ResNet Training - Dataset - LDM

    • service.tib.eu
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Progressive Feedforward Collapse of ResNet Training - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/progressive-feedforward-collapse-of-resnet-training
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The dataset used in the paper is a ResNet trained on various datasets, including MNIST, Fashion MNIST, CIFAR10, STL10, and CIFAR100.

  12. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Adam Coates; Andrew Y. Ng; Honglak Lee (2022). STL-10 Dataset [Dataset]. https://paperswithcode.com/dataset/stl-10

STL-10 Dataset

Self-Taught Learning 10

Explore at:
Dataset updated
Feb 2, 2021
Authors
Adam Coates; Andrew Y. Ng; Honglak Lee
Description

The STL-10 is an image dataset derived from ImageNet and popularly used to evaluate algorithms of unsupervised feature learning or self-taught learning. Besides 100,000 unlabeled images, it contains 13,000 labeled images from 10 object classes (such as birds, cats, trucks), among which 5,000 images are partitioned for training while the remaining 8,000 images for testing. All the images are color images with 96×96 pixels in size.

Search
Clear search
Close search
Google apps
Main menu