100+ datasets found
  1. d

    Selfie photo Dataset | 10M+ images | Global Coverage | Face Detection |...

    • datarade.ai
    .jpg, .jpeg, .png
    Updated Jul 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FileMarket (2025). Selfie photo Dataset | 10M+ images | Global Coverage | Face Detection | Computer vision Data [Dataset]. https://datarade.ai/data-products/selfie-photo-dataset-10m-images-global-coverage-face-d-filemarket
    Explore at:
    .jpg, .jpeg, .pngAvailable download formats
    Dataset updated
    Jul 25, 2025
    Dataset authored and provided by
    FileMarket
    Area covered
    Senegal, Saint Kitts and Nevis, Uganda, Djibouti, Honduras, Myanmar, South Georgia and the South Sandwich Islands, Chile, New Zealand, Martinique
    Description

    Total Users 10,229,822 Total Pictures 10M+ (mostly 1 per ID)

    Gender: - Male 60% - Female 40%

    Ethnicity: - Asian 9% - African Decent 13% - East Indian 3% - Latino Hispanic 28% - Caucasian 47%

    Age Group: - 0-17 3% - 18-24 62% - 25-34 21% - 35-44 10% - 45-54 3% - 55+ 1%

    Top Phone Models: - iPhone 6s 9% - iPhone XR 6% - iPhone 6 6% - iPhone 7 (US/CDMA) 6% - iPhone 11 5% - iPhone 8 (US/CDMA) 4% (Total 141 device)

    Top Countries: - US 48.84% - GB 10.57% - CA 4.26% - AU 3.48% - FR 2.80% - SA 2.17% (Total 131 countries)

    Average resolution 5761024 px.

    All photos are collected with the consent of users.

  2. Large dataset of geotagged images

    • kaggle.com
    Updated Jul 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hassan Abedi (2022). Large dataset of geotagged images [Dataset]. https://www.kaggle.com/datasets/habedi/large-dataset-of-geotagged-images
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 27, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hassan Abedi
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset consists of 4.2 million (4,233,900 more precisely) geotagged images from the YFCC100M dataset. The images are from a subset of images used in MediaEval Placing Task 2016. For each image, its id, latitude and longitude where it was taken, plus the image itself, are stored as a record in MessagePack format.

    Each shard file (a *.msg file) contains 30 thousand images.

    An illustration of how each record looks like is shown below.

    {'image': b'\xff\xd8\xff\xe0...
    \x05\x87\xef\x1e\x94o\xf6\xa6QG\xb4\x90Xv\xfa7\xd3h\', 
    'id': '13/20/8010869266.jpg', 'latitude': 29.426458, 'longitude': -98.490723}
    
  3. h

    female-selfie-image-dataset

    • huggingface.co
    Updated Apr 26, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unique Data (2024). female-selfie-image-dataset [Dataset]. https://huggingface.co/datasets/UniqueData/female-selfie-image-dataset
    Explore at:
    Dataset updated
    Apr 26, 2024
    Authors
    Unique Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Face Recognition, Face Detection, Female Photo Dataset šŸ‘©

      The dataset is created on the basis of Selfies and ID Dataset
    

    90,000+ photos of 46,000+ women from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are women. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of women of different ages and ethnic groups… See the full description on the dataset page: https://huggingface.co/datasets/UniqueData/female-selfie-image-dataset.

  4. D

    NTUT 4K Drone Photo Dataset

    • datasetninja.com
    Updated Dec 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kuan-Ting (K. T.) Lai (2023). NTUT 4K Drone Photo Dataset [Dataset]. https://datasetninja.com/ntut-4k-drone-photo
    Explore at:
    Dataset updated
    Dec 19, 2023
    Dataset provided by
    Dataset Ninja
    Authors
    Kuan-Ting (K. T.) Lai
    License

    https://spdx.org/licenses/https://spdx.org/licenses/

    Description

    In the NTUT 4K Drone Photo Dataset for Human Detection authors furnish 4K photos extracted from drone videos captured in Taiwan. Authors claim, that contemporary drones are outfitted with 4K video cameras, and the heightened resolution of the images facilitates modern object detectors in discerning smaller objects. Despite this capability, many drone image datasets typically offer only downscaled images. The dataset is curated by the AIoT Lab at the National Taiwan University of Technology (NTUT).

  5. Cloud Images Dataset

    • kaggle.com
    Updated Sep 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nuttida Lapthanachai (2021). Cloud Images Dataset [Dataset]. https://www.kaggle.com/datasets/nuttidalapthanachai/cloud-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 6, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Nuttida Lapthanachai
    Description

    Dataset

    This dataset was created by Nuttida Lapthanachai

    Contents

  6. h

    face-recognition-image-dataset

    • huggingface.co
    Updated Apr 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). face-recognition-image-dataset [Dataset]. https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset
    Explore at:
    Dataset updated
    Apr 15, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Image Dataset of face images for compuer vision tasks

    Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems. By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset.

  7. R

    Photo Dataset

    • universe.roboflow.com
    zip
    Updated Sep 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    fakeee (2025). Photo Dataset [Dataset]. https://universe.roboflow.com/fakeee/photo-dyvyo/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 18, 2025
    Dataset authored and provided by
    fakeee
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Photo
    Description

    Photo

    ## Overview
    
    Photo is a dataset for classification tasks - it contains Photo annotations for 5,166 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  8. o

    The Massively Multilingual Image Dataset (MMID)

    • registry.opendata.aws
    Updated Jan 23, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Penn NLP (2019). The Massively Multilingual Image Dataset (MMID) [Dataset]. https://registry.opendata.aws/mmid/
    Explore at:
    Dataset updated
    Jan 23, 2019
    Dataset provided by
    <a href="https://github.com/penn-nlp">Penn NLP</a>
    Description

    MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word's translation into English (and corresponding images.)

  9. h

    AI-Generated-vs-Real-Images-Datasets

    • huggingface.co
    Updated Aug 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hem Bahadur Gurung (2025). AI-Generated-vs-Real-Images-Datasets [Dataset]. https://huggingface.co/datasets/Hemg/AI-Generated-vs-Real-Images-Datasets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2025
    Authors
    Hem Bahadur Gurung
    Description

    Dataset Card for "AI-Generated-vs-Real-Images-Datasets"

    More Information needed

  10. i

    Mouse Labeled Image Dataset

    • images.cv
    zip
    Updated Dec 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Mouse Labeled Image Dataset [Dataset]. https://images.cv/dataset/mouse-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 20, 2021
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Labeled Mouse images suitable for AI and computer vision.

  11. Image Data (Object Detection and Captioning)

    • kaggle.com
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arunesh (2024). Image Data (Object Detection and Captioning) [Dataset]. https://www.kaggle.com/datasets/aruneshhh/object-detection-images
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 15, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Arunesh
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    🌟 Unlock the potential of advanced computer vision tasks with our comprehensive dataset comprising 15,000 high-quality images. Whether you're delving into segmentation, object detection, or image captioning, our dataset offers a diverse array of visual data to fuel your machine learning models.

    šŸ” Our dataset is meticulously curated to encompass a wide range of streams, ensuring versatility and applicability across various domains. From natural landscapes to urban environments, from wildlife to everyday objects, our collection captures the richness and diversity of visual content.

    šŸ“Š Dataset Overview:

    Total ImagesTraining Set (70%)Testing Set (30%)
    15,00010,5004,500

    šŸ”¢ Image Details:

    • Format: JPG
    • Size Range: Approximately 150 to 300 KB per image

    Embark on your computer vision journey and leverage our dataset to develop cutting-edge algorithms, advance research, and push the boundaries of what's possible in visual recognition tasks. Join us in shaping the future of AI-powered image analysis.

  12. F

    Middle Eastern Facial Images Dataset | Selfie & ID Card Images

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Middle Eastern Facial Images Dataset | Selfie & ID Card Images [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-selfie-id-middle-eastern
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Middle Eastern Human Facial Images Dataset, curated to advance facial recognition technology and support the development of secure biometric identity systems, KYC verification processes, and AI-driven computer vision applications. This dataset is designed to serve as a robust foundation for real-world face matching and recognition use cases.

    Facial Image Data

    The dataset contains over 1500 facial image sets of Middle Eastern individuals. Each set includes:

    •
    Selfie Images: 5 high-quality selfie images taken under different conditions
    •
    ID Card Images: 2 clear facial images extracted from different government-issued ID cards

    Diversity & Representation

    •
    Geographic Diversity: Participants represent Middle Eastern countries including Egypt, Jordan, Suadi Arabia, UAE, Tunisia, and more
    •
    Demographics: Individuals aged 18 to 70 years with a 60:40 male-to-female ratio
    •
    File Formats: Images are provided in JPEG and HEIC formats for compatibility and quality retention

    Image Quality & Capture Conditions

    All images were captured with real-world variability to enhance dataset robustness:

    •
    Lighting: Captured under diverse lighting setups to simulate real environments
    •
    Backgrounds: A wide variety of indoor and outdoor backgrounds
    •
    Device Quality: Captured using modern smartphones to ensure high resolution and clarity

    Metadata

    Each participant’s data is accompanied by rich metadata to support AI model training, including:

    •Unique participant ID
    •Image file names
    •Age at the time of capture
    •Gender
    •Country of origin
    •Demographic details
    •File format information

    This metadata enables targeted filtering and training across diverse scenarios.

    Use Cases & Applications

    This dataset is ideal for a wide range of AI and biometric applications:

    •
    Facial Recognition: Train accurate and generalizable face matching models
    •
    KYC & Identity Verification: Enhance onboarding and compliance systems in fintech and government services
    •
    Biometric Identification: Build secure facial recognition systems for access control and identity authentication
    •
    Age Prediction: Train models to estimate age from facial features
    •
    Generative AI: Provide reference data for synthetic face generation or augmentation tasks

    Secure & Ethical Collection

    •
    Data Security: All images were securely stored and processed on FutureBeeAI’s proprietary platform
    •
    Ethical Compliance: Data collection was conducted in full alignment with privacy laws and ethical standards
    •
    Informed Consent: Every participant provided written consent, with full awareness of the intended uses of the data

    Dataset Updates & Customization

    To meet evolving AI demands, this dataset is regularly updated and can be customized. Available options include:

    <div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px; align-items:

  13. T

    open_images_v4

    • tensorflow.org
    • opendatalab.com
    Updated Jun 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). open_images_v4 [Dataset]. https://www.tensorflow.org/datasets/catalog/open_images_v4
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    Open Images is a dataset of ~9M images that have been annotated with image-level labels and object bounding boxes.

    The training set of V4 contains 14.6M bounding boxes for 600 object classes on 1.74M images, making it the largest existing dataset with object location annotations. The boxes have been largely manually drawn by professional annotators to ensure accuracy and consistency. The images are very diverse and often contain complex scenes with several objects (8.4 per image on average). Moreover, the dataset is annotated with image-level labels spanning thousands of classes.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('open_images_v4', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/open_images_v4-original-2.0.0.png" alt="Visualization" width="500px">

  14. i

    Mountain Labeled Image Dataset

    • images.cv
    zip
    Updated Dec 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Mountain Labeled Image Dataset [Dataset]. https://images.cv/dataset/mountain-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 13, 2021
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Labeled Mountain images suitable for AI and computer vision.

  15. 30,696 Pairs – Portrait Retouching Before & After Image Dataset

    • nexdata.ai
    • m.nexdata.ai
    Updated Mar 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). 30,696 Pairs – Portrait Retouching Before & After Image Dataset [Dataset]. https://www.nexdata.ai/datasets/computervision/1581?source=huggingface
    Explore at:
    Dataset updated
    Mar 9, 2025
    Dataset authored and provided by
    Nexdata
    Variables measured
    Data size, Data type, Accuracy rata, Data diversity, Annotation content, Country distribution, Collecting environment, Population distribution
    Description

    This dataset collection scenarios include indoor and outdoor scenes, the country distribution is Algeria, Egypt, Hungary, Poland, and Japan. Data types include portrait photos and wedding photos. In terms of data annotation, detailed retouching and annotationing are performed on the collected studio portrait data. The data can be used for tasks such as raining models for portrait retouching, photo editing, and studio photography applications.

  16. BIQ2021: A Dataset for Image Quality Assessment

    • kaggle.com
    zip
    Updated Feb 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nisar Ahmed (2025). BIQ2021: A Dataset for Image Quality Assessment [Dataset]. https://www.kaggle.com/datasets/nisarahmedrana/biq2021
    Explore at:
    zip(426142727 bytes)Available download formats
    Dataset updated
    Feb 1, 2025
    Authors
    Nisar Ahmed
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The BIQ2021 dataset is a large-scale blind image quality assessment database, consisting of 12,000 authentically distorted images. Each image in the dataset has been quality rated by 30 observers, resulting in a total of 360,000 quality ratings. This dataset was created in a controlled laboratory environment, ensuring consistent and reliable subjective scoring. Moreover, the dataset provide a train/test split by which the researchers can report their results for benchmarking. The dataset is openly available and serves as a valuable resource for evaluating and benchmarking image quality assessment algorithms. The paper providing a detailed description of the dataset and its creation process is openly accessible at the following link: BIQ2021: A large-scale blind image quality assessment database.

    The paper can be sited as:

    Ahmed, N., & Asif, S. (2022). BIQ2021: a large-scale blind image quality assessment database. Journal of Electronic Imaging, 31(5), 053010.

    Dataset Description

    Images: The dataset contain a folder named images containing 12,000 images to be used for training and testing. Train (Images and MOS): It is a CSV file containing randomly partitioned train set of the dataset containing 10,000 images with their corresponding MOS. Test (Images and MOS): It is a CSV file containing randomly partitioned test set of the dataset containing 2,000 images with their corresponding MOS.

    Benchmarking: In order to compare the performance of a predictive model trained on the dataset, Pearson and Spearman's correlation can be computed and compared with the existing approaches and the CNN models listed at the following gitHub repository: https://github.com/nisarahmedrana/BIQ2021

  17. i

    Bull riding Labeled Image Dataset

    • images.cv
    zip
    Updated Mar 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Bull riding Labeled Image Dataset [Dataset]. https://images.cv/dataset/bull-riding-image-classification-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 1, 2022
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Labeled Bull riding images suitable for AI and computer vision.

  18. d

    1935 15' Quad #217 Aerial Photo Mosaic Index - NM

    • catalog.data.gov
    • datasets.ai
    • +3more
    Updated Dec 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Earth Data Analysis Center (Point of Contact) (2020). 1935 15' Quad #217 Aerial Photo Mosaic Index - NM [Dataset]. https://catalog.data.gov/dataset/1935-15-quad-217-aerial-photo-mosaic-index-nm
    Explore at:
    Dataset updated
    Dec 2, 2020
    Dataset provided by
    Earth Data Analysis Center (Point of Contact)
    Description

    Aerial Photo Reference Mosaics contain aerial photographs that are retrievable on a frame by frame basis. The inventory contains imagery from various sources that are now archived at the Earth Data Analysis Center.

  19. d

    Data from: Comparison of photo-matching algorithms commonly used for...

    • datadryad.org
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Jun 9, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maximilian MatthƩ; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow (2018). Comparison of photo-matching algorithms commonly used for photographic capture-recapture studies [Dataset]. http://doi.org/10.5061/dryad.4f0bh
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 9, 2018
    Dataset provided by
    Dryad
    Authors
    Maximilian MatthƩ; Marco Sannolo; Kristopher Winiarski; Annemarieke Spitzen - van der Sluijs; Daniel Goedbloed; Sebastian Steinfartz; Ulrich Stachow
    Time period covered
    May 16, 2017
    Description

    Images and source code for the publication.The archive contains all the images and the source code used for the analysis in the referenced publication. It also contains the obtained equality tables, such that the whole comparison does not need to be run again. Moreover, code for analyzing the results is provided. Refer to the contained readme.html file for more detailed information.comparisonOfPhotoMatchingAlgorithms.tar.gz

  20. RGB Image Dataset

    • figshare.com
    zip
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ling-Qi Zhang (2023). RGB Image Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.14542434.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Ling-Qi Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Curated RGB image dataset for our analysis, splited into training and evalutaion set. Based on ImageNet ILSVRC dataset (Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, 2015).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
FileMarket (2025). Selfie photo Dataset | 10M+ images | Global Coverage | Face Detection | Computer vision Data [Dataset]. https://datarade.ai/data-products/selfie-photo-dataset-10m-images-global-coverage-face-d-filemarket

Selfie photo Dataset | 10M+ images | Global Coverage | Face Detection | Computer vision Data

Explore at:
.jpg, .jpeg, .pngAvailable download formats
Dataset updated
Jul 25, 2025
Dataset authored and provided by
FileMarket
Area covered
Senegal, Saint Kitts and Nevis, Uganda, Djibouti, Honduras, Myanmar, South Georgia and the South Sandwich Islands, Chile, New Zealand, Martinique
Description

Total Users 10,229,822 Total Pictures 10M+ (mostly 1 per ID)

Gender: - Male 60% - Female 40%

Ethnicity: - Asian 9% - African Decent 13% - East Indian 3% - Latino Hispanic 28% - Caucasian 47%

Age Group: - 0-17 3% - 18-24 62% - 25-34 21% - 35-44 10% - 45-54 3% - 55+ 1%

Top Phone Models: - iPhone 6s 9% - iPhone XR 6% - iPhone 6 6% - iPhone 7 (US/CDMA) 6% - iPhone 11 5% - iPhone 8 (US/CDMA) 4% (Total 141 device)

Top Countries: - US 48.84% - GB 10.57% - CA 4.26% - AU 3.48% - FR 2.80% - SA 2.17% (Total 131 countries)

Average resolution 5761024 px.

All photos are collected with the consent of users.

Search
Clear search
Close search
Google apps
Main menu