100+ datasets found

Data from: batches
kaggle.com
zip
Updated May 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NESSMA HAMIDOU (2024). batches [Dataset]. https://www.kaggle.com/datasets/nessmahamidou/batches
Explore at:
zip(3543533328 bytes)Available download formats
Dataset updated
May 9, 2024
Authors
NESSMA HAMIDOU
Description
Dataset

This dataset was created by NESSMA HAMIDOU

Contents
resume_text_batch
kaggle.com
zip
Updated May 28, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rohit Kartik (2020). resume_text_batch [Dataset]. https://www.kaggle.com/datasets/oo7kartik/resume-text-batch
Explore at:
zip(31215830 bytes)Available download formats
Dataset updated
May 28, 2020
Authors
Rohit Kartik
Description
Dataset

This dataset was created by Rohit Kartik

Contents
Data from: batches
kaggle.com
zip
Updated Sep 10, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pawan Kumar Safi (2020). batches [Dataset]. https://www.kaggle.com/datasets/pawankumarsafi/batches
Explore at:
zip(13420 bytes)Available download formats
Dataset updated
Sep 10, 2020
Authors
Pawan Kumar Safi
Description
Dataset

This dataset was created by Pawan Kumar Safi

Contents
Pure, 16 batches, 60 epochs + add. layers
kaggle.com
zip
Updated Jan 15, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sebastian Schnichels (2021). Pure, 16 batches, 60 epochs + add. layers [Dataset]. https://www.kaggle.com/sebastianschnichels/pure-16-batches-60-epochs-add-layers
Explore at:
zip(118787774 bytes)Available download formats
Dataset updated
Jan 15, 2021
Authors
Sebastian Schnichels
Description
Dataset

This dataset was created by Sebastian Schnichels

Contents
original : CIFAR 100
kaggle.com
zip
Updated Dec 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shashwat Pandey (2024). original : CIFAR 100 [Dataset]. https://www.kaggle.com/datasets/shashwat90/original-cifar-100
Explore at:
zip(168517945 bytes)Available download formats
Dataset updated
Dec 28, 2024
Authors
Shashwat Pandey
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
The CIFAR-10 and CIFAR-100 datasets are labeled subsets of the 80 million tiny images dataset. CIFAR-10 and CIFAR-100 were created by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. (Sadly, the 80 million tiny images dataset has been thrown into the memory hole by its authors. Spotting the doublethink which was used to justify its erasure is left as an exercise for the reader.)

The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.

The dataset is divided into five training batches and one test batch, each with 10000 images. The test batch contains exactly 1000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5000 images from each class.

The classes are completely mutually exclusive. There is no overlap between automobiles and trucks. "Automobile" includes sedans, SUVs, things of that sort. "Truck" includes only big trucks. Neither includes pickup trucks.

Baseline results You can find some baseline replicable results on this dataset on the project page for cuda-convnet. These results were obtained with a convolutional neural network. Briefly, they are 18% test error without data augmentation and 11% with. Additionally, Jasper Snoek has a new paper in which he used Bayesian hyperparameter optimization to find nice settings of the weight decay and other hyperparameters, which allowed him to obtain a test error rate of 15% (without data augmentation) using the architecture of the net that got 18%.

Other results Rodrigo Benenson has collected results on CIFAR-10/100 and other datasets on his website; click here to view.

Dataset layout Python / Matlab versions I will describe the layout of the Python version of the dataset. The layout of the Matlab version is identical.

The archive contains the files data_batch_1, data_batch_2, ..., data_batch_5, as well as test_batch. Each of these files is a Python "pickled" object produced with cPickle. Here is a python2 routine which will open such a file and return a dictionary: python def unpickle(file): import cPickle with open(file, 'rb') as fo: dict = cPickle.load(fo) return dict And a python3 version: def unpickle(file): import pickle with open(file, 'rb') as fo: dict = pickle.load(fo, encoding='bytes') return dict Loaded in this way, each of the batch files contains a dictionary with the following elements: data -- a 10000x3072 numpy array of uint8s. Each row of the array stores a 32x32 colour image. The first 1024 entries contain the red channel values, the next 1024 the green, and the final 1024 the blue. The image is stored in row-major order, so that the first 32 entries of the array are the red channel values of the first row of the image. labels -- a list of 10000 numbers in the range 0-9. The number at index i indicates the label of the ith image in the array data.

The dataset contains another file, called batches.meta. It too contains a Python dictionary object. It has the following entries: label_names -- a 10-element list which gives meaningful names to the numeric labels in the labels array described above. For example, label_names[0] == "airplane", label_names[1] == "automobile", etc. Binary version The binary version contains the files data_batch_1.bin, data_batch_2.bin, ..., data_batch_5.bin, as well as test_batch.bin. Each of these files is formatted as follows: <1 x label><3072 x pixel> ... <1 x label><3072 x pixel> In other words, the first byte is the label of the first image, which is a number in the range 0-9. The next 3072 bytes are the values of the pixels of the image. The first 1024 bytes are the red channel values, the next 1024 the green, and the final 1024 the blue. The values are stored in row-major order, so the first 32 bytes are the red channel values of the first row of the image.

Each file contains 10000 such 3073-byte "rows" of images, although there is nothing delimiting the rows. Therefore each file should be exactly 30730000 bytes long.

There is another file, called batches.meta.txt. This is an ASCII file that maps numeric labels in the range 0-9 to meaningful class names. It is merely a list of the 10 class names, one per row. The class name on row i corresponds to numeric label i.

The CIFAR-100 dataset This dataset is just like the CIFAR-10, except it has 100 classes containing 600 images each. There are 500 training images and 100 testing images per class. The 100 classes in the CIFAR-100 are grouped into 20 superclasses. Each image comes with a "fine" label (the class to which it belongs) and a "coarse" label (the superclass to which it belongs). Her...
batch-weight
kaggle.com
zip
Updated Feb 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
YYYYake (2023). batch-weight [Dataset]. https://www.kaggle.com/datasets/yyyyake/batchweight
Explore at:
zip(1107841197 bytes)Available download formats
Dataset updated
Feb 2, 2023
Authors
YYYYake
Description
Dataset

This dataset was created by YYYYake

Contents
Second Section batches 1-5
kaggle.com
zip
Updated Jul 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). Second Section batches 1-5 [Dataset]. https://www.kaggle.com/datasets/abidinawres/second-section-batches-1-5/code
Explore at:
zip(1134790600 bytes)Available download formats
Dataset updated
Jul 14, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
First section batches 11-15
kaggle.com
zip
Updated Jul 15, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). First section batches 11-15 [Dataset]. https://www.kaggle.com/datasets/abidinawres/first-section-batches-11-15/code
Explore at:
zip(1133886921 bytes)Available download formats
Dataset updated
Jul 15, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
third section batches 6-10
kaggle.com
zip
Updated Jul 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). third section batches 6-10 [Dataset]. https://www.kaggle.com/datasets/abidinawres/third-section-batches-6-10/code
Explore at:
zip(1134509261 bytes)Available download formats
Dataset updated
Jul 13, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
second section batches 11-15
kaggle.com
zip
Updated Jul 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). second section batches 11-15 [Dataset]. https://www.kaggle.com/datasets/abidinawres/second-section-batches-11-15/code
Explore at:
zip(1135023987 bytes)Available download formats
Dataset updated
Jul 15, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
First section batches 1-5
kaggle.com
zip
Updated Jul 15, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). First section batches 1-5 [Dataset]. https://www.kaggle.com/datasets/abidinawres/first-section-batches-1-5
Explore at:
zip(1133954470 bytes)Available download formats
Dataset updated
Jul 15, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
att-gnn-40-batches
kaggle.com
zip
Updated Apr 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SemenB (2023). att-gnn-40-batches [Dataset]. https://www.kaggle.com/datasets/semenb/att-gnn-40-batches
Explore at:
zip(1304420296 bytes)Available download formats
Dataset updated
Apr 13, 2023
Authors
SemenB
Description
Dataset

This dataset was created by SemenB

Contents
fourth section batches 17-20
kaggle.com
zip
Updated Jul 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). fourth section batches 17-20 [Dataset]. https://www.kaggle.com/datasets/abidinawres/fourth-section-batches-17-20
Explore at:
zip(908307008 bytes)Available download formats
Dataset updated
Jul 13, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
Fourth section batches 9 - 12
kaggle.com
zip
Updated Jul 13, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Naoures Abidi (2024). Fourth section batches 9 - 12 [Dataset]. https://www.kaggle.com/datasets/abidinawres/fourth-section-batches-9-12
Explore at:
zip(908607912 bytes)Available download formats
Dataset updated
Jul 13, 2024
Authors
Naoures Abidi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Naoures Abidi

Released under Apache 2.0

Contents
cifar-10-batches-py
kaggle.com
zip
Updated Dec 4, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiayi Gui (2022). cifar-10-batches-py [Dataset]. https://www.kaggle.com/datasets/houzitest/cifar-10-batches-py
Explore at:
zip(170063312 bytes)Available download formats
Dataset updated
Dec 4, 2022
Authors
Jiayi Gui
Description
Dataset

This dataset was created by Jiayi Gui

Contents
batch-info
kaggle.com
zip
Updated Oct 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
intissar ziani (2025). batch-info [Dataset]. https://www.kaggle.com/datasets/intissarziani/batch-info
Explore at:
zip(827 bytes)Available download formats
Dataset updated
Oct 29, 2025
Authors
intissar ziani
Description
Dataset

This dataset was created by intissar ziani

Contents
The CIFAR-10 Dataset
kaggle.com
zip
Updated Apr 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eka Antonius Kurniawan (2021). The CIFAR-10 Dataset [Dataset]. https://www.kaggle.com/datasets/ekaakurniawan/the-cifar10-dataset
Explore at:
zip(170063312 bytes)Available download formats
Dataset updated
Apr 6, 2021
Authors
Eka Antonius Kurniawan
Description
Context

60,000 labeled color images divided in 10 classes. Each image is in 32x32 pixels. 50,000 images are for training and 10,000 images are for testing.

Content

50,000 images for training are divided into 5 batches (10,000 images per batch). The filenames are data_batch_1 to data_batch_5. All 5 of them are Python pickle files.

10,000 images for testing in one batch. The filename is test_batch. It is a Python pickle file.

batches.meta file contains metadata information like the number of images per batch (which is 10,000), 10 label names, and parameter size (which is 32x32x3 or 3072)

readme.html file contains the URL to CIFAR page in Alex Krizhevsky's home page.

Acknowledgements

When using this dataset, please cite the following. - Learning Multiple Layers of Features from Tiny Images, Alex Krizhevsky, 2009.

License

The MIT License (MIT) Copyright © 2021 Eka A. Kurniawan Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
BATCH 8 MLOPS PROJECT
kaggle.com
zip
Updated Oct 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saibhargav Ch (2024). BATCH 8 MLOPS PROJECT [Dataset]. https://www.kaggle.com/datasets/saibhargavch/batch-8-mlops-project/code
Explore at:
zip(263060 bytes)Available download formats
Dataset updated
Oct 7, 2024
Authors
Saibhargav Ch
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Saibhargav Ch

Released under Apache 2.0

Contents
All_batches
kaggle.com
zip
Updated Jun 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
grich said (2022). All_batches [Dataset]. https://www.kaggle.com/datasets/saidgrich/all-batches
Explore at:
zip(915415123 bytes)Available download formats
Dataset updated
Jun 7, 2022
Authors
grich said
Description
Dataset

This dataset was created by grich said

Contents
cifar-10-batches-py
kaggle.com
zip
Updated May 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zeyuan (2020). cifar-10-batches-py [Dataset]. https://www.kaggle.com/fengzeyuan/cifar10batchespy
Explore at:
zip(169633852 bytes)Available download formats
Dataset updated
May 6, 2020
Authors
Zeyuan
Description
Context

This version of cifar is not preprocessed.

Pickle

Use pickle and protocol is 4.

Facebook

Twitter

Click to copy link

Link copied

Cite

NESSMA HAMIDOU (2024). batches [Dataset]. https://www.kaggle.com/datasets/nessmahamidou/batches

Data from: batches

Explore at:

zip(3543533328 bytes)Available download formats

Dataset updated

May 9, 2024

Authors

NESSMA HAMIDOU

Description

Dataset

This dataset was created by NESSMA HAMIDOU

Clear search

Close search

Google apps

Main menu

Data from: batches

Dataset

Contents

resume_text_batch

Dataset

Contents

Data from: batches

Dataset

Contents

Pure, 16 batches, 60 epochs + add. layers

Dataset

Contents

original : CIFAR 100

batch-weight

Dataset

Contents

Second Section batches 1-5

Dataset

Contents

First section batches 11-15

Dataset

Contents

third section batches 6-10

Dataset

Contents

second section batches 11-15

Dataset

Contents

First section batches 1-5

Dataset

Contents

att-gnn-40-batches

Dataset

Contents

fourth section batches 17-20

Dataset

Contents

Fourth section batches 9 - 12

Dataset

Contents

cifar-10-batches-py

Dataset

Contents

batch-info

Dataset

Contents

The CIFAR-10 Dataset

Context

Content

Acknowledgements

License

BATCH 8 MLOPS PROJECT

Dataset

Contents

All_batches

Dataset

Contents

cifar-10-batches-py

Context

Pickle

Data from: batches

Dataset

Contents