8 datasets found
  1. MNIST-Pytorch

    • kaggle.com
    Updated Aug 18, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mlagunas (2017). MNIST-Pytorch [Dataset]. https://www.kaggle.com/mlagunas/mnist-pytorch/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 18, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    mlagunas
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    MNIST dataset as downloaded by Pytorch libraries.

  2. Deep-Learning-using-MNIST-Dataset

    • kaggle.com
    Updated Feb 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adeolu Joseph (2023). Deep-Learning-using-MNIST-Dataset [Dataset]. https://www.kaggle.com/datasets/adeolujoseph/deep-learning-using-mnist-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 26, 2023
    Dataset provided by
    Kaggle
    Authors
    Adeolu Joseph
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Pytorch The MNIST database of handwritten digits has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger set available from NIST. The digits have been size-normalized and centered in a fixed-size image THE ORIGINAL DATA SET CAN BE FOUND IN http://yann.lecun.com/exdb/mnist/ This projects uses 2 hidden Layers with 128 and 64 units. SGD optimizer was used to improve the Weights and bias

  3. MPI-MNIST Dataset

    • zenodo.org
    application/gzip, pdf
    Updated Jan 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meira Iske; Meira Iske; Hannes Albers; Hannes Albers; Tobias Kluth; Tobias Kluth; Tobias Knopp; Tobias Knopp (2025). MPI-MNIST Dataset [Dataset]. http://doi.org/10.5281/zenodo.12799417
    Explore at:
    application/gzip, pdfAvailable download formats
    Dataset updated
    Jan 14, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Meira Iske; Meira Iske; Hannes Albers; Hannes Albers; Tobias Kluth; Tobias Kluth; Tobias Knopp; Tobias Knopp
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A dataset for magnetic particle imaging based on the MNIST dataset.

    This dataset contains simulated MPI measurements along with ground truth phantoms selected from the https://yann.lecun.com/exdb/mnist/" target="_blank" rel="noopener">MNIST database of handwritten digits. A state-of-the-art model-based system matrix is used to simulate the MPI measurements of the MNIST phantoms. These measurements are equipped with noise perturbations captured by the preclinical MPI system (Bruker, Ettlingen, Germany). The dataset can be utilized in its provided form, while additional data is included to offer flexibility for creating customized versions.

    MPI-MNIST features four different system matrices, each available in three spatial resolutions. The provided data is generated using a specified system matrix at highest spatial resolution. Reconstruction operations can be performed by using any of the provided system matrices at a lower resolution. This setup allows for simulating reconstructions from either an exact or an inexact forward operator. To cover further operator deviation setups, we provide additional noise data for the application of pixelwise noise to the reconstruction system matrix.

    For supporting the development of learning-based methods, a large amount of further noise samples, captured by the Bruker scanner, is provided.

    For a detailed description of the dataset, see arxiv.org/abs/2501.05583.

    The Python-based GitHub repository available at https://github.com/meiraiske/MPI-MNIST" href="https://github.com/meiraiske/MPI-MNIST" target="_blank" rel="noopener">https://github.com/meiraiske/MPI-MNIST can be used for downloading the data from this website and preparing it for project use which includes an integration to PyTorch or PyTorch Lightning modules.

    File Structure

    All data, except for the phantoms, is provided in the MDF file format. This format is specifically tailored to store MPI data and contains metadata corresponding to the experimental setup. The ground truth phantoms are provided as HDF5 files since they do not require any metadata.

    • SM: Contains twelve system matrices named SM_{physical model}_{resolution}.mdf. It covers four physical models given in three resolutions ('coarse', 'int' and 'fine'). The highest resolution ('fine') is used for data generation.
    • large_noise: Contains large_NoiseMeas.mdf with 390060 noise measurements. Each noise measurement has been averaged over ten empty scanner measurements. This can be used e.g. for learning-based methods.

    For dataset in ['train', 'test']:

    • {dataset}_noise: Contains four noise matrices, where each noise measurement has been averaged over ten empty scanner measurements:
      1. NoiseMeas_phantom_{dataset}.mdf : Additive measurement noise for simulated measurements.
      2. NoiseMeas_phantom_bg_{dataset}.mdf : Unused noise reserved for background correction of 1.
      3. NoiseMeas_SM_{dataset}.mdf : System Matrix noise, that can be applied to each pixel of the reconstruction system matrix.
      4. NoiseMeas_SM_bg_{dataset}.mdf : Unused noise reserved for background correction of 3.
    • {dataset}_gt: Contains {dataset}_gt.hdf5 with flattened and preprocessed ground truth MNIST phantoms given in coarse resolution (15x17=255 pixels) with pixel values in [0, 10].
    • {dataset}_obs: Contains {dataset}_obs.mdf with noise free simulated measurements (observations) of {dataset}_gt.hdf5 using the system matrix stored in SM_fluid_opt_fine.mdf.
    • {dataset}_obsnoisy: Contains {dataset}_obsnoisy.mdf with noise contained simulated measurements, resulting from {dataset}_obs.mdf and {dataset}_phantom_noise.mdf.


    In line with MNIST, each MDF/HDF5 file in {dataset}_gt, {dataset}_obs, {dataset}_obsnoisy for dataset in ['train', 'test'] contains 60000 samples for 'train' and 10000 samples for 'test'. The data can be manually reproduced in the intermediate resolution (45x51=2295 pixels) from the files in this dataset using the system matrices in intermediate ('int') resolution for reconstruction and upsampling the ground truth phantoms by 3 pixels per dimension. This case is also implemented in the Github repository .

    The PDF file MPI-MNIST_Metadata.pdf contains a list of meta information for each of the MDF files of this dataset.

  4. h

    MNIST

    • huggingface.co
    Updated Aug 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    P2PFL (2024). MNIST [Dataset]. https://huggingface.co/datasets/p2pfl/MNIST
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2024
    Authors
    P2PFL
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    🖼️ MNIST (Extracted from PyTorch Vision)

    MNIST is a classic dataset of handwritten digits, widely used for image classification tasks in machine learning.

      ℹ️ Dataset Details
    
    
    
    
    
      📖 Dataset Description
    

    The MNIST database of handwritten digits is a commonly used benchmark dataset in machine learning. It consists of 70,000 grayscale images of handwritten digits (0-9), each with a size of 28x28 pixels. The dataset is split into 60,000 training images and 10,000… See the full description on the dataset page: https://huggingface.co/datasets/p2pfl/MNIST.

  5. o

    Data from: Federated Learning Demonstrator MNIST Example (Version 1.0.1)

    • explore.openaire.eu
    Updated Oct 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Florian Heinrich; Benedikt Franke (2024). Federated Learning Demonstrator MNIST Example (Version 1.0.1) [Dataset]. https://explore.openaire.eu/search/other?orpId=od_1640::02069c46417b50d8cd5088c9b8fbf7d6
    Explore at:
    Dataset updated
    Oct 18, 2024
    Authors
    Florian Heinrich; Benedikt Franke
    Description

    Federated Learning Demonstrator MNIST Example (Version 1.0.1)

  6. Model Zoo: A Dataset of Diverse Populations of Neural Network Models -...

    • zenodo.org
    • data.niaid.nih.gov
    bin, json, zip
    Updated Jun 13, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth; Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth (2022). Model Zoo: A Dataset of Diverse Populations of Neural Network Models - Fashion-MNIST [Dataset]. http://doi.org/10.5281/zenodo.6632105
    Explore at:
    bin, zip, jsonAvailable download formats
    Dataset updated
    Jun 13, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth; Konstantin Schürholt; Diyar Taskiran; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract

    In the last years, neural networks have evolved from laboratory environments to the state-of-the-art for many real-world problems. Our hypothesis is that neural network models (i.e., their weights and biases) evolve on unique, smooth trajectories in weight space during training. Following, a population of such neural network models (refereed to as “model zoo”) would form topological structures in weight space. We think that the geometry, curvature and smoothness of these structures contain information about the state of training and can be reveal latent properties of individual models. With such zoos, one could investigate novel approaches for (i) model analysis, (ii) discover unknown learning dynamics, (iii) learn rich representations of such populations, or (iv) exploit the model zoos for generative modelling of neural network weights and biases. Unfortunately, the lack of standardized model zoos and available benchmarks significantly increases the friction for further research about populations of neural networks. With this work, we publish a novel dataset of model zoos containing systematically generated and diverse populations of neural network models for further research. In total the proposed model zoo dataset is based on six image datasets, consist of 24 model zoos with varying hyperparameter combinations are generated and includes 47’360 unique neural network models resulting in over 2’415’360 collected model states. Additionally, to the model zoo data we provide an in-depth analysis of the zoos and provide benchmarks for multiple downstream tasks as mentioned before.

    Dataset

    This dataset is part of a larger collection of model zoos and contains the zoos trained on the labelled samples from Fashion-MNIST. All zoos with extensive information and code can be found at www.modelzoos.cc.

    This repository contains two types of files: the raw model zoos as collections of models (file names beginning with "fmnist_"), as well as preprocessed model zoos wrapped in a custom pytorch dataset class (filenames beginning with "dataset"). Zoos are trained in three configurations varying the seed only (seed), varying hyperparameters with fixed seeds (hyp_fix) or varying hyperparameters with random seeds (hyp_rand). The index_dict.json files contain information on how to read the vectorized models.

    For more information on the zoos and code to access and use the zoos, please see www.modelzoos.cc.

  7. h

    mnist_augmented

    • huggingface.co
    Updated Aug 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Anis Ur Rahman (2025). mnist_augmented [Dataset]. https://huggingface.co/datasets/ianisdev/mnist_augmented
    Explore at:
    Dataset updated
    Aug 4, 2025
    Authors
    Muhammad Anis Ur Rahman
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset Card for mnist_augmented

    This dataset contains augmented versions of the MNIST dataset, created to benchmark how various augmentation strategies impact digit classification accuracy using deep learning models. The dataset is provided as a .zip file and must be unzipped before use. It follows the ImageFolder structure compatible with PyTorch and other DL frameworks.

      📥 Download & Extract
    

    wget… See the full description on the dataset page: https://huggingface.co/datasets/ianisdev/mnist_augmented.

  8. Pigpen Cipher 9-Grid Classification

    • kaggle.com
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joe Norton (2024). Pigpen Cipher 9-Grid Classification [Dataset]. https://www.kaggle.com/datasets/joenorton/pigpen-cipher-9-grid-classification/
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Dataset provided by
    Kaggle
    Authors
    Joe Norton
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    75,600 Images - spread across TRAINING, VALIDATION & TEST directories

    54,000 Training, 10,800 each in Validation and Test

    The purpose of this project was to create a MNIST-like image dataset of the PIGPEN CIPHER. There are many variations of this cipher, this project - atm - is focused exclusively on the 9-Grid variant.

    The genesis of this project was thinking about how I once bought a codes & ciphers book and had a lot of fun with the 9-grid variety of the Pigpen Cipher, even teaching it to a few friends at one point. That got me wondering, could PyTorch make quick work of Pigpen? (Answer: obviously)

    The 9-Grid Pigpen Cipher uses a 'tic-tac-toe' board layout, along with a simple dot system, that allows for 27 unique symbols. These 27 symbols can then be mapped to the english alphabet, creating a cipher. So, every symbol corresponds with a letter.

    Read more about the Pigpen Cipher on wikipedia

    This dataset was personally created by me for this project. It includes both multiple 'fonts' I drew in PAINT, as well as computer drawn versions of the symbols. I also utilized scripts to add randomness (in orientation, noise, etc) to the images, allowing me to create a more robust and generalized dataset.

    That said, the dataset currently does not have true hand-written images incorporated into the dataset. However, I think models trained on it will perform well on real-world tests despite that.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
mlagunas (2017). MNIST-Pytorch [Dataset]. https://www.kaggle.com/mlagunas/mnist-pytorch/code
Organization logo

MNIST-Pytorch

MNIST dataset as downloaded by Pytorch libraries.

Explore at:
27 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 18, 2017
Dataset provided by
Kagglehttp://kaggle.com/
Authors
mlagunas
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

MNIST dataset as downloaded by Pytorch libraries.

Search
Clear search
Close search
Google apps
Main menu