100+ datasets found
  1. m

    Dataset of Pairs of an Image and Tags for Cataloging Image-based Records

    • data.mendeley.com
    • narcis.nl
    Updated Feb 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tokinori Suzuki (2022). Dataset of Pairs of an Image and Tags for Cataloging Image-based Records [Dataset]. http://doi.org/10.17632/msyc6mzvhg.1
    Explore at:
    Dataset updated
    Feb 24, 2022
    Authors
    Tokinori Suzuki
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Brief Explanation

    This dataset is created to develop and evaluate a cataloging system which assigns appropriate metadata to an image record for database management in digital libraries. That is assumed for evaluating a task, in which given an image and assigned tags, an appropriate Wikipedia page is selected for each of the given tags.

    A main characteristic of the dataset is including ambiguous tags. Thus, visual contents of images are not unique to their tags. For example, it includes a tag 'mouse' which has double meaning of not a mammal but a computer controller device. The annotations are corresponding Wikipedia articles for tags as correct entities by human judgement.

    The dataset offers both data and programs that reproduce experiments of the above-mentioned task. Its data consist of sources of images and annotations. The image sources are URLs of 420 images uploaded to Flickr. The annotations are a total 2,464 relevant Wikipedia pages manually judged for tags of the images. The dataset also provides programs in Jupiter notebook (scripts.ipynb) to conduct a series of experiments running some baseline methods for the designated task and evaluating the results.

    Structure of the Dataset

    1. data directory 1.1. image_URL.txt This file lists URLs of image files.

      1.2. rels.txt This file lists collect Wikipedia pages for each topic in topics.txt

      1.3. topics.txt This file lists a target pair, which is called a topic in this dataset, of an image and a tag to be disambiguated.

      1.4. enwiki_20171001.xml This file is extracted texts from the title and body parts of English Wikipedia articles as of 1st October 2017. This is a modified data of Wikipedia dump data (https://archive.org/download/enwiki-20171001).

    2. img directory This directory is a placeholder directory to fetch image files for downloading.

    3. results directory This directory is a placeholder directory to store results files for evaluation. It maintains three results of baseline methods in sub-directories. They contain json files each of which is a result of one topic, and are ready to be evaluated using an evaluation scripts in scripts.ipynb for reference of both usage and performance.

    4. scripts.ipynb The scripts for running baseline methods and evaluation are ready in this Jupyter notebook file.

  2. Dresden Image Database

    • kaggle.com
    zip
    Updated Jun 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MicsCodes (2024). Dresden Image Database [Dataset]. https://www.kaggle.com/datasets/micscodes/dresden-image-database/code
    Explore at:
    zip(53247469834 bytes)Available download formats
    Dataset updated
    Jun 27, 2024
    Authors
    MicsCodes
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Dresden
    Description

    The 'Dresden Image Database' comprises original JPEG images from 73 camera devices across 25 camera models. This dataset is primarily used for Source Camera Device and Model Identification, offering over 14,000 images captured under controlled conditions.

    Copyright: "Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee."

    Original Source (Not Working as on 28 June 2024): http://forensics.inf.tu-dresden.de/dresden_image_database/

    Please Cite the corresponding paper "Gloe, T., & Böhme, R. (2010, March). The'Dresden Image Database'for benchmarking digital image forensics. In Proceedings of the 2010 ACM symposium on applied computing (pp. 1584-1590)."

  3. d

    New Image Database

    • dataone.org
    • knb.ecoinformatics.org
    Updated Jan 6, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rick Reeves (2015). New Image Database [Dataset]. http://doi.org/10.5063/AA/reeves.18.1
    Explore at:
    Dataset updated
    Jan 6, 2015
    Dataset provided by
    Knowledge Network for Biocomplexity
    Authors
    Rick Reeves
    Time period covered
    Jan 1, 2003 - Jul 31, 2006
    Area covered
    Variables measured
    Ncols, Nrows, LambdaMax, LambdaMin, ImageAcqDate, Image Location, SpatialResSqMeters
    Description

    Testing the use of Morpho to store image data. Visit https://dataone.org/datasets/doi%3A10.5063%2FAA%2Freeves.18.1 for complete metadata about this dataset.

  4. R

    Digital Image Dataset

    • universe.roboflow.com
    zip
    Updated Nov 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    digital image processing (2025). Digital Image Dataset [Dataset]. https://universe.roboflow.com/digital-image-processing-tfwdf/digital-image-fcebn/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 22, 2025
    Dataset authored and provided by
    digital image processing
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Objects Bounding Boxes
    Description

    Digital Image

    ## Overview
    
    Digital Image is a dataset for object detection tasks - it contains Objects annotations for 264 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  5. Real & Fake Images Dataset-For image forensics

    • kaggle.com
    zip
    Updated Sep 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shivam Ardeshna (2024). Real & Fake Images Dataset-For image forensics [Dataset]. https://www.kaggle.com/datasets/shivamardeshna/real-and-fake-images-dataset-for-image-forensics
    Explore at:
    zip(2962926438 bytes)Available download formats
    Dataset updated
    Sep 4, 2024
    Authors
    Shivam Ardeshna
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The "Real & Fake Images Dataset-For Image Forensics" is meticulously curated to support the development and evaluation of algorithms for detecting fake images. This dataset is organized into three primary directories: train, test, and validation. Each of these directories contains two subdirectories, one with real images and the other with fake images.

    The dataset is an invaluable resource for researchers, data scientists, and developers working in the field of image forensics, deep learning, and computer vision. It provides a solid foundation for training models that can distinguish between authentic and manipulated images. This task has become increasingly crucial with the rise of deepfakes and other forms of image manipulation.

    By utilizing this dataset, you can:

    • Develop robust algorithms for detecting fake images.
    • Enhance existing models by training them on a balanced dataset of real and fake images.
    • Conduct comprehensive evaluations of model performance in real-world scenarios.
    • Explore the nuances of image manipulation techniques to understand better and combat digital forgeries.

    This dataset is particularly useful for those aiming to advance the state of the art in image forensics, ensuring that digital content remains trustworthy in an era of increasingly sophisticated visual deception.

  6. Data from: CLEMENTINE UVVIS DIGITAL IMAGE MODEL

    • catalog.data.gov
    • datasets.ai
    • +3more
    Updated Aug 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Aeronautics and Space Administration (2025). CLEMENTINE UVVIS DIGITAL IMAGE MODEL [Dataset]. https://catalog.data.gov/dataset/clementine-uvvis-digital-image-model-1beff
    Explore at:
    Dataset updated
    Aug 22, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    The Clementine UVVIS DIM Mosaic is a full-resolution (100 meters per pixel) global mosaic produced by the U.S. Geological Survey from Clementine EDR Data. Imaging data acquired by the Ultraviolet/Visible Camera were used to create the multi-band mosaic.

  7. t

    RAISE: A Raw Images Dataset for Digital Image Forensics - Dataset - LDM

    • service.tib.eu
    • resodate.org
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). RAISE: A Raw Images Dataset for Digital Image Forensics - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/raise--a-raw-images-dataset-for-digital-image-forensics
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    A dataset for digital image forensics, containing 224 images.

  8. Digital Record Store for images (DRCi) - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Mar 28, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2014). Digital Record Store for images (DRCi) - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/digital-record-store-for-images-drci
    Explore at:
    Dataset updated
    Mar 28, 2014
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    Hosted by Iron Mountain. The DRC is an IBM Image on Demand System. It stores and manages digital images and fixed content. It currently holds British Coal Corporation (BCC) scanned Medical Records (formerly paper based), EDM Legacy (Inbound, Medical and Outbound folders), Nurse Records and VWF Medical records from a former Atos contract. It also holds a number of other scanned images from legacy databases relating to BCC staff records, earning and spirometry records transferred for other databases. In addition NCS Claim Archive data is transferred on a regular basis.

  9. t

    Born-Digital Images Dataset (2025). Dataset: Born-Digital Images Dataset....

    • service.tib.eu
    Updated Jan 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Born-Digital Images Dataset (2025). Dataset: Born-Digital Images Dataset. https://doi.org/10.57702/5f1fi29r [Dataset]. https://service.tib.eu/ldmservice/dataset/born-digital-images-dataset
    Explore at:
    Dataset updated
    Jan 3, 2025
    Description

    This dataset contains images made digitally employing a desktop scanner, a camera, and screen capture software.

  10. d

    Data from: Sediment grain size and digital image calibration parameters from...

    • catalog.data.gov
    • data.usgs.gov
    • +2more
    Updated Nov 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Sediment grain size and digital image calibration parameters from the mouth of the Columbia River, Oregon and Washington, 2014 [Dataset]. https://catalog.data.gov/dataset/sediment-grain-size-and-digital-image-calibration-parameters-from-the-mouth-of-the-columbi
    Explore at:
    Dataset updated
    Nov 13, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Columbia River, Oregon
    Description

    This dataset includes 63 still images extracted from digital video imagery of sediment grab samples, along with laboratory grain size analysis of the sediment grab samples, taken from the mouth of the Columbia River, OR and WA, USA. Digital video was collected in September 2014 in the mouth of the Columbia River, USA, as part of the U.S. Geological Survey Coastal and Marine Geology Program contribution to the Office of Naval Research funded River and Inlets Dynamics experiment (RIVET II). Still images were extracted from the underwater video footage whenever the camera was resting on the sediment bed and individual sediment grains were visible and in focus. The images were used to calculate the calibration curve through auto-correlation regressed against the results of laboratory-determined median grain size (D50) of the grab samples (Barnard, 2007), provided in an accompanying .csv file.

  11. MGN V RDRS DERIVED DIGITAL IMAGE MAP DATA RECORD V1.0 - Dataset - NASA Open...

    • data.nasa.gov
    Updated Mar 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). MGN V RDRS DERIVED DIGITAL IMAGE MAP DATA RECORD V1.0 - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/mgn-v-rdrs-derived-digital-image-map-data-record-v1-0
    Explore at:
    Dataset updated
    Mar 31, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    This data set contains the Magellan the FMAP, a full-resolution (75 meters/pixel) global mosaic, produced by the U.S. Geological Survey from Magellan F-BIDR data. The complete dataset consists of 340 quadrangles in Sinusoidal equal-area projection. Quadrangles extend approximately 12 degrees in latitude, except for those between 84 and 90 degrees North and South. Quadrangles near the equator extend 12 degrees in longitude longitudinal extent is increased to maintain a roughly constant number of samples.

  12. b

    Image Library — Historical

    • data.brisbane.qld.gov.au
    Updated Sep 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Image Library — Historical [Dataset]. https://data.brisbane.qld.gov.au/explore/dataset/image-library-historical/
    Explore at:
    Dataset updated
    Sep 26, 2025
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Brisbane City Council’s digital collection of images, maps, plans and documents documenting Brisbane's History from the 1850s through to now. Search by title, subject or keyword. To search and view use the link in the Data and resources section below.

  13. 4

    4n6 Images Database

    • 4n6img.com
    Updated Nov 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Husam Shbib (2025). 4n6 Images Database [Dataset]. https://4n6img.com/
    Explore at:
    Dataset updated
    Nov 10, 2025
    Dataset provided by
    4n6 Images
    Authors
    Husam Shbib
    Description

    A curated collection of verified forensic images contributed by the digital forensics community.

  14. USDA ARS Image Gallery

    • agdatacommons.nal.usda.gov
    • s.cnmilf.com
    • +2more
    bin
    Updated Nov 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    USDA Agricultural Research Service (2023). USDA ARS Image Gallery [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/USDA_ARS_Image_Gallery/24659814
    Explore at:
    binAvailable download formats
    Dataset updated
    Nov 30, 2023
    Dataset provided by
    Agricultural Research Servicehttps://www.ars.usda.gov/
    Authors
    USDA Agricultural Research Service
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    This Image Gallery is provided as a complimentary source of high-quality digital photographs available from the Agricultural Research Service information staff. Photos, (over 2,000 .jpegs) in the Image Gallery are copyright-free, public domain images unless otherwise indicated. Resources in this dataset:Resource Title: USDA ARS Image Gallery (Web page) . File Name: Web Page, url: https://www.ars.usda.gov/oc/images/image-gallery/ Over 2000 copyright-free images from ARS staff.

  15. Paraguay Synoptic Digital Image Records

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Sep 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NOAA National Centers for Environmental Information (Point of Contact) (2023). Paraguay Synoptic Digital Image Records [Dataset]. https://catalog.data.gov/dataset/paraguay-synoptic-digital-image-records
    Explore at:
    Dataset updated
    Sep 19, 2023
    Dataset provided by
    National Centers for Environmental Informationhttps://www.ncei.noaa.gov/
    National Oceanic and Atmospheric Administrationhttp://www.noaa.gov/
    Area covered
    Paraguay
    Description

    This dataset consists of images of records containing meteorological surface observations in synoptic format for various stations in the country of Paraguay. It has a period of record ranging from 1940 to 1998. The observations include (but are not limited to) 6-hourly values of temperature, humidity, cloud coverage, cloud heights, rainfall, and wind speed/direction.

  16. Pictures Paintings And Digital Images Dataset

    • universe.roboflow.com
    zip
    Updated Feb 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Artera (2024). Pictures Paintings And Digital Images Dataset [Dataset]. https://universe.roboflow.com/artera-dwzwa/pictures-paintings-and-digital-images
    Explore at:
    zipAvailable download formats
    Dataset updated
    Feb 12, 2024
    Dataset provided by
    WELL Health Inc.
    Authors
    Artera
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Pictures Polygons
    Description

    Pictures Paintings And Digital Images

    ## Overview
    
    Pictures Paintings And Digital Images is a dataset for instance segmentation tasks - it contains Pictures annotations for 1,408 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  17. f

    Supplementary Material for: The Visible Ear: A Digital Image Library of the...

    • karger.figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sørensen M.S.; Dobrzeniecki A.B.; Larsen P.; Frisch T.; Sporring J.; Darvann T.A. (2023). Supplementary Material for: The Visible Ear: A Digital Image Library of the Temporal Bone [Dataset]. http://doi.org/10.6084/m9.figshare.5100184.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Karger Publishers
    Authors
    Sørensen M.S.; Dobrzeniecki A.B.; Larsen P.; Frisch T.; Sporring J.; Darvann T.A.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    High-fidelity computer-based modeling, simulation and visualization systems for the study of temporal bone anatomy and training for middle ear surgery are based on a sequence of digital anatomical images, which must cover a large tissue volume and yet display details in high resolution and with high fidelity. However, the use of existing image libraries by independent developers of virtual models of the ear is limited by copyright protection and low image resolution. A fresh frozen human temporal bone was CT-scanned and serially sectioned at 25 µm and digital images of the block surface were recorded at 50- to 100-µm increments with a Light PhaseTM single-shot camera back attachment. A total of 605 images were recorded in 24-bit RGB resolution. After color correction and elimination of image size variation by differential cropping to 15.4 cm × 9.7 cm, all images were resampled to 3,078 × 1,942 pixels at a final resolution of 50 µm/pixel and stored as 605 one-Mb JPEG files together with a three-dimensional viewer. The resulting complete set of image data provides: (1) a source material suitable for generating computer models of the human ear; (2) a resource of high-quality digital images of anatomical cross sections from the human ear, and (3) a PC-based viewer of the temporal bone in three perpendicular planes of section.

  18. PMcardio ECG Image Database (PM-ECG-ID): A Diverse ECG Database for...

    • zenodo.org
    zip
    Updated Aug 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrej Iring; Viera Krešňáková; Viera Krešňáková; Michal Hojcka; Vladimir Boza; Adam Rafajdus; Boris Vavrik; Andrej Iring; Michal Hojcka; Vladimir Boza; Adam Rafajdus; Boris Vavrik (2024). PMcardio ECG Image Database (PM-ECG-ID): A Diverse ECG Database for Evaluating Digitization Solutions [Dataset]. http://doi.org/10.5281/zenodo.13617673
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 31, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Andrej Iring; Viera Krešňáková; Viera Krešňáková; Michal Hojcka; Vladimir Boza; Adam Rafajdus; Boris Vavrik; Andrej Iring; Michal Hojcka; Vladimir Boza; Adam Rafajdus; Boris Vavrik
    License

    https://www.gnu.org/licenses/gpl-3.0-standalone.htmlhttps://www.gnu.org/licenses/gpl-3.0-standalone.html

    Description

    The dataset presents the collection of a diverse electrocardiogram (ECG) database for testing and evaluating ECG digitization solutions. The Powerful Medical ECG image database was curated using 100 ECG waveforms selected from the PTB-XL Digital Waveform Database and various images generated from the base waveforms with varying lead visibility and real-world paper deformations, including the use of different mobile phones, bends, crumbles, scans, and photos of computer screens with ECGs. The ECG waveforms were augmented using various techniques, including changes in contrast, brightness, perspective transformation, rotation, image blur, JPEG compression, and resolution change. This extensive approach yielded 6,000 unique entries, which provides a wide range of data variance and extreme cases to evaluate the limitations of ECG digitization solutions and improve their performance, and serves as a benchmark to evaluate ECG digitization solutions.

    PM-ECG-ID database contains electrocardiogram (ECG) images and their corresponding ECG information. The data records are organized in a hierarchical folder structure, which includes metadata, waveform data, and visual data folders. The contents of each folder are described below:

    • metadata.csv:
      This file serves as a key-to-key bridge between the image data and the corresponding ECG information. It contains the following columns:
      • Image name: image name with extension,
      • ECG ID: this ID corresponds to the specific ECG identifier from the original PTB-XL dataset. Under this ID you can find a cutout array in the leads.npz and rhythms.npz,
      • Image relative path: relative path to the image in question,
      • Image page: page number of the particular image (starting from 0),
      • ECG number of pages: number of pages in the whole ECG,
      • ECG number of columns per page: number of columns per page in the ECG,
      • ECG number of rows per page: number of rows in the ECG,
      • ECG number of rhythm leads: number of rhythms in the ECG,
      • ECG format: short version of the ECG format.
    • data folder:
      • leads.npz: NPZ file containing all underlying cutout lead signals; each signal is there under its ECG ID.
      • rhythms.npz: NPZ file containing all underlying rhythm signals; each signal is there under its ECG ID. If no rhythm lead is in the ECG, you will find an empty array in the NPZ.
    • visual_data folder:
      This folder contains subfolders for various image data, including augmented photos and visualization and different types of photos of ECG printouts. The subfolders are organized based on the specific augmentation or type of photograph. These folders contain images with various augmentation settings, such as different levels of blur, brightness, contrast, padding, perspective transformation, resolution scaling, and rotation. The database is organized in a way that allows for easy navigation and understanding of the different augmentations applied to the image data. Each of these subfolders contains images relevant to the specific augmentation or type of photograph. The metadata.csv file provides a direct link to each image and its associated ECG information.
  19. Computational Photography Project for Pill Identification (C3PI)

    • healthdata.gov
    • data.virginia.gov
    • +2more
    csv, xlsx, xml
    Updated Feb 26, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    datadiscovery.nlm.nih.gov (2021). Computational Photography Project for Pill Identification (C3PI) [Dataset]. https://healthdata.gov/NIH/Computational-Photography-Project-for-Pill-Identif/5vhs-kfa6
    Explore at:
    csv, xlsx, xmlAvailable download formats
    Dataset updated
    Feb 26, 2021
    Dataset provided by
    datadiscovery.nlm.nih.gov
    Description

    The Computational Photography Project for Pill Identification (C3PI) was sunset in 2018. No new images will be added to the collection. Identifiers for pills will not be updated. Images and metadata are for research and development purposes only.

    The Computational Photography Project for Pill Identification (C3PI) created the RxIMAGE database of freely available high-quality digital images of prescription pills and associated data for use in conducting computer vision research in text- and image-based search and retrieval. Photographs of pills for the RxIMAGE database were taken under laboratory lighting conditions, from a camera directly above the front and the back faces of the pill, at high resolution, and using specialized digital macro-photography techniques. Image segmentation algorithms were then applied to create the JPEG images in the database.

    Historical information about the project is available in the NLM archive at https://wayback.archive-it.org/7867/20190423182937/https:/lhncbc.nlm.nih.gov/project/c3pi-computational-photography-project-pill-identification.

  20. AAHQ Dataset

    • kaggle.com
    zip
    Updated Apr 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jade Sag3 (2022). AAHQ Dataset [Dataset]. https://www.kaggle.com/datasets/jadesag3/aahq-comped
    Explore at:
    zip(19430403567 bytes)Available download formats
    Dataset updated
    Apr 7, 2022
    Authors
    Jade Sag3
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    A highly usable and diverse artistic image dataset from Artstation comprised of high resolution facial portriats. A wide range of technique and styles make up the artworks which makes for a great generative art dataset. Since most of the images had either no clear usage license and/or are derivative works of copyrighted material. This dataset is for non commerical research use only.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Tokinori Suzuki (2022). Dataset of Pairs of an Image and Tags for Cataloging Image-based Records [Dataset]. http://doi.org/10.17632/msyc6mzvhg.1

Dataset of Pairs of an Image and Tags for Cataloging Image-based Records

Explore at:
Dataset updated
Feb 24, 2022
Authors
Tokinori Suzuki
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Brief Explanation

This dataset is created to develop and evaluate a cataloging system which assigns appropriate metadata to an image record for database management in digital libraries. That is assumed for evaluating a task, in which given an image and assigned tags, an appropriate Wikipedia page is selected for each of the given tags.

A main characteristic of the dataset is including ambiguous tags. Thus, visual contents of images are not unique to their tags. For example, it includes a tag 'mouse' which has double meaning of not a mammal but a computer controller device. The annotations are corresponding Wikipedia articles for tags as correct entities by human judgement.

The dataset offers both data and programs that reproduce experiments of the above-mentioned task. Its data consist of sources of images and annotations. The image sources are URLs of 420 images uploaded to Flickr. The annotations are a total 2,464 relevant Wikipedia pages manually judged for tags of the images. The dataset also provides programs in Jupiter notebook (scripts.ipynb) to conduct a series of experiments running some baseline methods for the designated task and evaluating the results.

Structure of the Dataset

  1. data directory 1.1. image_URL.txt This file lists URLs of image files.

    1.2. rels.txt This file lists collect Wikipedia pages for each topic in topics.txt

    1.3. topics.txt This file lists a target pair, which is called a topic in this dataset, of an image and a tag to be disambiguated.

    1.4. enwiki_20171001.xml This file is extracted texts from the title and body parts of English Wikipedia articles as of 1st October 2017. This is a modified data of Wikipedia dump data (https://archive.org/download/enwiki-20171001).

  2. img directory This directory is a placeholder directory to fetch image files for downloading.

  3. results directory This directory is a placeholder directory to store results files for evaluation. It maintains three results of baseline methods in sub-directories. They contain json files each of which is a result of one topic, and are ready to be evaluated using an evaluation scripts in scripts.ipynb for reference of both usage and performance.

  4. scripts.ipynb The scripts for running baseline methods and evaluation are ready in this Jupyter notebook file.

Search
Clear search
Close search
Google apps
Main menu