16 datasets found
  1. P

    Data from: ImageNet Dataset

    • paperswithcode.com
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Fei-Fei Li (2024). ImageNet Dataset [Dataset]. https://paperswithcode.com/dataset/imagenet
    Explore at:
    Dataset updated
    Apr 15, 2024
    Authors
    Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Fei-Fei Li
    Description

    The ImageNet dataset contains 14,197,122 annotated images according to the WordNet hierarchy. Since 2010 the dataset is used in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), a benchmark in image classification and object detection. The publicly released dataset contains a set of manually annotated training images. A set of test images is also released, with the manual annotations withheld. ILSVRC annotations fall into one of two categories: (1) image-level annotation of a binary label for the presence or absence of an object class in the image, e.g., “there are cars in this image” but “there are no tigers,” and (2) object-level annotation of a tight bounding box and class label around an object instance in the image, e.g., “there is a screwdriver centered at position (20,25) with width of 50 pixels and height of 30 pixels”. The ImageNet project does not own the copyright of the images, therefore only thumbnails and URLs of images are provided.

    Total number of non-empty WordNet synsets: 21841 Total number of images: 14197122 Number of images with bounding box annotations: 1,034,908 Number of synsets with SIFT features: 1000 Number of images with SIFT features: 1.2 million

  2. T

    imagenet2012_subset

    • tensorflow.org
    Updated Oct 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). imagenet2012_subset [Dataset]. https://www.tensorflow.org/datasets/catalog/imagenet2012_subset
    Explore at:
    Dataset updated
    Oct 21, 2024
    Description

    ILSVRC 2012, commonly known as 'ImageNet' is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is called a "synonym set" or "synset". There are more than 100,000 synsets in WordNet, majority of them are nouns (80,000+). In ImageNet, we aim to provide on average 1000 images to illustrate each synset. Images of each concept are quality-controlled and human-annotated. In its completion, we hope ImageNet will offer tens of millions of cleanly sorted images for most of the concepts in the WordNet hierarchy.

    The test split contains 100K images but no labels because no labels have been publicly released. We provide support for the test split from 2012 with the minor patch released on October 10, 2019. In order to manually download this data, a user must perform the following operations:

    1. Download the 2012 test split available here.
    2. Download the October 10, 2019 patch. There is a Google Drive link to the patch provided on the same page.
    3. Combine the two tar-balls, manually overwriting any images in the original archive with images from the patch. According to the instructions on image-net.org, this procedure overwrites just a few images.

    The resulting tar-ball may then be processed by TFDS.

    To assess the accuracy of a model on the ImageNet test split, one must run inference on all images in the split, export those results to a text file that must be uploaded to the ImageNet evaluation server. The maintainers of the ImageNet evaluation server permits a single user to submit up to 2 submissions per week in order to prevent overfitting.

    To evaluate the accuracy on the test split, one must first create an account at image-net.org. This account must be approved by the site administrator. After the account is created, one can submit the results to the test server at https://image-net.org/challenges/LSVRC/eval_server.php The submission consists of several ASCII text files corresponding to multiple tasks. The task of interest is "Classification submission (top-5 cls error)". A sample of an exported text file looks like the following:

    771 778 794 387 650
    363 691 764 923 427
    737 369 430 531 124
    755 930 755 59 168
    

    The export format is described in full in "readme.txt" within the 2013 development kit available here: https://image-net.org/data/ILSVRC/2013/ILSVRC2013_devkit.tgz Please see the section entitled "3.3 CLS-LOC submission format". Briefly, the format of the text file is 100,000 lines corresponding to each image in the test split. Each line of integers correspond to the rank-ordered, top 5 predictions for each test image. The integers are 1-indexed corresponding to the line number in the corresponding labels file. See labels.txt.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('imagenet2012_subset', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/imagenet2012_subset-1pct-5.0.0.png" alt="Visualization" width="500px">

  3. t

    O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A....

    • service.tib.eu
    Updated Dec 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. S. Bernstein, A. C. Berg, L. Fei-Fei (2024). Dataset: ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset. https://doi.org/10.57702/7kbsn0gl [Dataset]. https://service.tib.eu/ldmservice/dataset/imagenet-large-scale-visual-recognition-challenge--ilsvrc--dataset
    Explore at:
    Dataset updated
    Dec 16, 2024
    Description

    The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset. It contains over 14 million images from 21,841 categories.

  4. Error rates of large-scale visual recognition challenge 2010-2017

    • statista.com
    Updated Feb 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Error rates of large-scale visual recognition challenge 2010-2017 [Dataset]. https://www.statista.com/statistics/808190/worldwide-large-scale-visual-recognition-challenge-error-rates/
    Explore at:
    Dataset updated
    Feb 8, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The statistic shows the best classification error rate achieved by computer vision algorithms tested on a large-scale visual recognition challenge, from 2010 to 2017. In 2015, the winning algorithm became the first to surpass the average human classification error rate of five percent, and by 2017 machine learning algorithms were able to achieve a classification error rate of 2.3 percent, making fewer than half the number of classification errors as a human.

  5. a

    ImageNet Large Scale Visual Recognition Challenge (V2017)

    • academictorrents.com
    bittorrent
    Updated Mar 6, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei (2019). ImageNet Large Scale Visual Recognition Challenge (V2017) [Dataset]. https://academictorrents.com/details/943977d8c96892d24237638335e481f3ccd54cfb
    Explore at:
    bittorrent(166022728827)Available download formats
    Dataset updated
    Mar 6, 2019
    Dataset authored and provided by
    Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    A BitTorrent file to download data with the title 'ImageNet Large Scale Visual Recognition Challenge (V2017)'

  6. t

    ImageCLEF-DA

    • service.tib.eu
    Updated Dec 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). ImageCLEF-DA [Dataset]. https://service.tib.eu/ldmservice/dataset/imageclef-da
    Explore at:
    Dataset updated
    Dec 3, 2024
    Description

    The ImageCLEF-DA dataset is a benchmark dataset for ImageCLEF 2014 domain adaptation challenges, which contains 12 categories shared by three domains: Caltech-256 (C), ImageNet ILSVRC 2012 (I), and Pascal VOC 2012 (P).

  7. O

    tieredImageNet

    • opendatalab.com
    • paperswithcode.com
    zip
    Updated Dec 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Canadian Institute for Advanced Research (2022). tieredImageNet [Dataset]. https://opendatalab.com/OpenDataLab/tieredImageNet
    Explore at:
    zip(3920817140 bytes)Available download formats
    Dataset updated
    Dec 25, 2022
    Dataset provided by
    Vector Institute for Artificial Intelligence
    Canadian Institute for Advanced Research
    Princeton University
    Google AI Research
    Massachusetts Institute of Technology
    University of Toronto
    License

    https://mtl.yyliu.net/download/https://mtl.yyliu.net/download/

    Description

    The tieredImageNet dataset is a larger subset of ILSVRC-12 with 608 classes (779,165 images) grouped into 34 higher-level nodes in the ImageNet human-curated hierarchy. This set of nodes is partitioned into 20, 6, and 8 disjoint sets of training, validation, and testing nodes, and the corresponding classes form the respective meta-sets. As argued in Ren et al. (2018), this split near the root of the ImageNet hierarchy results in a more challenging, yet realistic regime with test classes that are less similar to training classes.

  8. f

    Deep neural network feature maps

    • plus.figshare.com
    bin
    Updated May 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alessandro Gifford (2024). Deep neural network feature maps [Dataset]. http://doi.org/10.25452/figshare.plus.21514590.v2
    Explore at:
    binAvailable download formats
    Dataset updated
    May 31, 2024
    Dataset provided by
    Figshare+
    Authors
    Alessandro Gifford
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Feature maps summaryHere we release the PCA-downsampled deep neural network (DNN) feature maps used in the data resource paper: "A large and rich EEG dataset for modeling human visual object recognition". We used four DNN architectures (AlexNet, ResNet-50, CORnet-S, MoCo), and extracted their feature map responses to images coming from the THINGS database and from the ILSVRC-2012 challenge.Useful materialAdditional informationFor additional information on the DNNs used, the stimuli images and feature maps extraction procedure please refer to our paper and code.Additional dataset resourcesPlease visit the dataset page for the paper, dataset tutorial, code and more.OSFFor additional data and resources visit our OSF project, where you can find:The stimuli imagesA detailed descriptions of the DNN feature maps data filesCitationsIf you use any of our data, please cite our paper.

  9. T

    imagenet2012_real

    • tensorflow.org
    Updated Jun 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). imagenet2012_real [Dataset]. https://www.tensorflow.org/datasets/catalog/imagenet2012_real
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    This dataset contains ILSVRC-2012 (ImageNet) validation images augmented with a new set of "Re-Assessed" (ReaL) labels from the "Are we done with ImageNet" paper, see https://arxiv.org/abs/2006.07159. These labels are collected using the enhanced protocol, resulting in multi-label and more accurate annotations.

    Important note: about 3500 examples contain no label, these should be excluded from the averaging when computing the accuracy. One possible way of doing this is with the following NumPy code:

    is_correct = [pred in real_labels[i] for i, pred in enumerate(predictions) if real_labels[i]]
    real_accuracy = np.mean(is_correct)
    

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('imagenet2012_real', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/imagenet2012_real-1.0.0.png" alt="Visualization" width="500px">

  10. Configuration of VGG-19 on CIFAR-100.

    • plos.figshare.com
    xls
    Updated Jun 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil (2023). Configuration of VGG-19 on CIFAR-100. [Dataset]. http://doi.org/10.1371/journal.pone.0262386.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Configuration of VGG-19 on CIFAR-100.

  11. P

    COMPASS-XP Dataset

    • paperswithcode.com
    • live.european-language-grid.eu
    • +1more
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lewis D. Griffin; Matthew Caldwell; Jerone T. A. Andrews; Helene Bohler, COMPASS-XP Dataset [Dataset]. https://paperswithcode.com/dataset/compass-xp
    Explore at:
    Authors
    Lewis D. Griffin; Matthew Caldwell; Jerone T. A. Andrews; Helene Bohler
    Description

    COMPASS-XP is a dataset of matched photographic and X-ray images of single objects, made available for use in Machine Learning & Computer Vision research, in particular in the context of transport security. Objects are imaged in multiple poses, and accompanied by metadata including labels for whether we consider the object to be dangerous in the context of aviation. Object classes overlap with those in the popular ImageNet Large Scale Visual Recognition Challenge class set and theWordNet lexical database, and identifiers for shared classes in both schemes are also provided.

    Hardware Configuration Photographs were captured with a Sony DSC-W800 compact digital camera. X-ray scans were obtained using a Gilardoni FEP ME 536 mailroom X-ray machine, distributed in the UK by Todd Research under the name TR50. The scanner is dual energy and generates several image outputs: • Low: Raw 8-bit greyscale data from the scanner’s low energy X-ray channel.
    • High: Raw 8-bit greyscale data from the scanner’s high energy X-ray channel.
    • Density: 8-bit greyscale data representing inferred material density computed from the two channels.
    • Grey: RGB PNG image representing a combination of both low and high energy channels with some appearance improvements. Although nominally greyscale, the image does include subtle duotone-style colouration.
    • Colour RGB PNG image with false colour palette representing material density.

    In practice the grey and colour versions are probably most useful, but for completeness the dataset includes all variants for each scan.

    Data Files Image files are supplied in six subdirectories, corresponding to the five X-ray image variants above plus photos. X-rays are provided in PNG format, while photos are JPEG. Each scan is identified by a numeric index, which is also used to name the file, padded with leading zeros to always be 4 digits long.

    Scan metadata is provided in the accompanying tab-delimited text file, meta.txt. This includes the following columns: • basename: The zero-padded identifier for the scan. All six image type variants for the same class-instance-pose have the same basename. X-ray files are named basename.png while photos are basename.jpg.
    • class: The object class in the scan.
    • instance: An integer identifying the object instance. Instances start at 1 for each class.
    • pose: An integer identifying the object pose. Poses start at 1 for each instance.
    • scan tray: Either A, indicating that the pose was imaged in a weighted tray, or N indicating it was not.
    • dangerous: Whether the object was considered dangerous (True/False).
    • IN id: Numeric index of the object class in the ILSVRC list of 1000 classes, or empty if the class isn’t present there.
    • WN id: WordNet identifier for the object class, or empty if the class isn’t present inWordNet.

    License The COMPASS-XP dataset was acquired as part of a research project funded by the UK Government Future Aviation Security Solutions programme. Both the images and their metadata are licensed under the Creative Commons Attribution 4.0 International License and may be freely used for research and commercial purpose, including derivative works, providing the source is acknowledged.

    COMPASS-XP Dataset Authors Lewis D. Griffin*, Matthew Caldwell, Jerone T. A. Andrews Computational Security Science Group, UCL * l.griffin@cs.ucl.ac.uk

  12. f

    Comparison of VGG-19 on CIFAR-100.

    • plos.figshare.com
    xls
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil (2023). Comparison of VGG-19 on CIFAR-100. [Dataset]. http://doi.org/10.1371/journal.pone.0262386.t008
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of VGG-19 on CIFAR-100.

  13. Comparison of AlexNet on CIFAR-10.

    • plos.figshare.com
    xls
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil (2023). Comparison of AlexNet on CIFAR-10. [Dataset]. http://doi.org/10.1371/journal.pone.0262386.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of AlexNet on CIFAR-10.

  14. n

    首頁 組織 NCHC-資料服務專區 ImageNet ILSVRC2012

    • scidm.nchc.org.tw
    Updated Oct 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). 首頁 組織 NCHC-資料服務專區 ImageNet ILSVRC2012 [Dataset]. https://scidm.nchc.org.tw/dataset/imagenet-ilsvrc2012
    Explore at:
    Dataset updated
    Oct 15, 2024
    Description

    ImageNet is an image database organized according to the WordNet hierarchy (currently only the nouns), in which each node of the hierarchy is depicted by hundreds and thousands of images. Currently we have an average of over five hundred images per node. We hope ImageNet will become a useful resource for researchers, educators, students and all of you who share our passion for pictures.

  15. f

    Parameters overview of different CNN architectures.

    • plos.figshare.com
    xls
    Updated Jun 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil (2023). Parameters overview of different CNN architectures. [Dataset]. http://doi.org/10.1371/journal.pone.0262386.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Parameters overview of different CNN architectures.

  16. Configuration of AlexNet on CIFAR-10.

    • plos.figshare.com
    xls
    Updated Jun 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil (2023). Configuration of AlexNet on CIFAR-10. [Dataset]. http://doi.org/10.1371/journal.pone.0262386.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 15, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Waqas Ahmed; Shahab Ansari; Muhammad Hanif; Akhtar Khalil
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Configuration of AlexNet on CIFAR-10.

  17. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Fei-Fei Li (2024). ImageNet Dataset [Dataset]. https://paperswithcode.com/dataset/imagenet

Data from: ImageNet Dataset

Related Article
Explore at:
Dataset updated
Apr 15, 2024
Authors
Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Fei-Fei Li
Description

The ImageNet dataset contains 14,197,122 annotated images according to the WordNet hierarchy. Since 2010 the dataset is used in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), a benchmark in image classification and object detection. The publicly released dataset contains a set of manually annotated training images. A set of test images is also released, with the manual annotations withheld. ILSVRC annotations fall into one of two categories: (1) image-level annotation of a binary label for the presence or absence of an object class in the image, e.g., “there are cars in this image” but “there are no tigers,” and (2) object-level annotation of a tight bounding box and class label around an object instance in the image, e.g., “there is a screwdriver centered at position (20,25) with width of 50 pixels and height of 30 pixels”. The ImageNet project does not own the copyright of the images, therefore only thumbnails and URLs of images are provided.

Total number of non-empty WordNet synsets: 21841 Total number of images: 14197122 Number of images with bounding box annotations: 1,034,908 Number of synsets with SIFT features: 1000 Number of images with SIFT features: 1.2 million

Search
Clear search
Close search
Google apps
Main menu