100+ datasets found
  1. Face-Detection-Dataset

    • kaggle.com
    • gts.ai
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fares Elmenshawii (2023). Face-Detection-Dataset [Dataset]. https://www.kaggle.com/datasets/fareselmenshawii/face-detection-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Fares Elmenshawii
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The dataset comprises 16.7k images and 2 annotation files, each in a distinct format. The first file, labeled "Label," contains annotations with the original scale, while the second file, named "yolo_format_labels," contains annotations in YOLO format. The dataset was obtained by employing the OIDv4 toolkit, specifically designed for scraping data from Google Open Images. Notably, this dataset exclusively focuses on face detection.

    This dataset offers a highly suitable resource for training deep learning models specifically designed for face detection tasks. The images within the dataset exhibit exceptional quality and have been meticulously annotated with bounding boxes encompassing the facial regions. The annotations are provided in two formats: the original scale, denoting the pixel coordinates of the bounding boxes, and the YOLO format, representing the bounding box coordinates in normalized form.

    The dataset was meticulously curated by scraping relevant images from Google Open Images through the use of the OIDv4 toolkit. Only images that are pertinent to face detection tasks have been included in this dataset. Consequently, it serves as an ideal choice for training deep learning models that specifically target face detection tasks.

  2. h

    face-recognition-image-dataset

    • huggingface.co
    Updated Apr 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). face-recognition-image-dataset [Dataset]. https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset
    Explore at:
    Dataset updated
    Apr 15, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Image Dataset of face images for compuer vision tasks

    Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems. By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset.

  3. h

    male-selfie-image-dataset

    • huggingface.co
    Updated May 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Training Data (2024). male-selfie-image-dataset [Dataset]. https://huggingface.co/datasets/TrainingDataPro/male-selfie-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 2, 2024
    Authors
    Training Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Face Recognition, Face Detection, Male Photo Dataset šŸ‘Ø

      If you are interested in biometric data - visit our website to learn more and buy the dataset :)
    

    110,000+ photos of 74,000+ men from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are men. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of men of… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/male-selfie-image-dataset.

  4. i

    Expression and Occlusion

    • ieee-dataport.org
    Updated Nov 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhaskar Belavadi (2022). Expression and Occlusion [Dataset]. https://ieee-dataport.org/documents/sjb-face-dataset-indian-face-image-dataset-changes-pose-illuminationexpression-and
    Explore at:
    Dataset updated
    Nov 16, 2022
    Authors
    Bhaskar Belavadi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Expressions

  5. Gender Detection & Classification - Face Dataset

    • kaggle.com
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Training Data (2023). Gender Detection & Classification - Face Dataset [Dataset]. https://www.kaggle.com/datasets/trainingdatapro/gender-detection-and-classification-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 31, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Training Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Gender Detection & Classification - face recognition dataset

    The dataset is created on the basis of Face Mask Detection dataset

    Dataset Description:

    The dataset comprises a collection of photos of people, organized into folders labeled "women" and "men." Each folder contains a significant number of images to facilitate training and testing of gender detection algorithms or models.

    The dataset contains a variety of images capturing female and male individuals from diverse backgrounds, age groups, and ethnicities.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F1c4708f0b856f7889e3c0eea434fe8e2%2FFrame%2045%20(1).png?generation=1698764294000412&alt=media" alt="">

    This labeled dataset can be utilized as training data for machine learning models, computer vision applications, and gender detection algorithms.

    šŸ’“ For Commercial Usage: Full version of the dataset includes 376 000+ photos of people, leave a request on TrainingData to buy the dataset

    Metadata for the full dataset:

    • assignment_id - unique identifier of the media file
    • worker_id - unique identifier of the person
    • age - age of the person
    • true_gender - gender of the person
    • country - country of the person
    • ethnicity - ethnicity of the person
    • photo_1_extension, photo_2_extension, photo_3_extension, photo_4_extension - photo extensions in the dataset
    • photo_1_resolution, photo_2_resolution, photo_3_extension, photo_4_resolution - photo resolution in the dataset

    OTHER BIOMETRIC DATASETS:

    šŸ’“ Buy the Dataset: This is just an example of the data. Leave a request on https://trainingdata.pro/datasets to learn about the price and buy the dataset

    Content

    The dataset is split into train and test folders, each folder includes: - folders women and men - folders with images of people with the corresponding gender, - .csv file - contains information about the images and people in the dataset

    File with the extension .csv

    • file: link to access the file,
    • gender: gender of a person in the photo (woman/man),
    • split: classification on train and test

    TrainingData provides high-quality data annotation tailored to your needs

    keywords: biometric system, biometric system attacks, biometric dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, gender detection, supervised learning dataset, gender classification dataset, gender recognition dataset

  6. h

    face-re-identification-image-dataset

    • huggingface.co
    Updated Mar 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). face-re-identification-image-dataset [Dataset]. https://huggingface.co/datasets/UniDataPro/face-re-identification-image-dataset
    Explore at:
    Dataset updated
    Mar 30, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Dataset of face images with different angles and head positions

    Dataset contains 23,110 individuals, each contributing 28 images featuring various angles and head positions, diverse backgrounds, and attributes, along with 1 ID photo. In total, the dataset comprises over 670,000 images in formats such as JPG and PNG. It is designed to advance face recognition and facial recognition research, focusing on person re-identification and recognition systems. By utilizing this dataset… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-re-identification-image-dataset.

  7. u

    Instagram Faces Image Dataset

    • unidata.pro
    jpg
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata L.L.C-FZ, Instagram Faces Image Dataset [Dataset]. https://unidata.pro/datasets/instagram-faces-image/
    Explore at:
    jpgAvailable download formats
    Dataset authored and provided by
    Unidata L.L.C-FZ
    Description

    Instagram Faces Image dataset with diverse single-face images for facial recognition, anti-spoofing, and computer vision

  8. f

    Similar Face Dataset (SFD)

    • figshare.com
    zip
    Updated Jan 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AnPing Song (2020). Similar Face Dataset (SFD) [Dataset]. http://doi.org/10.6084/m9.figshare.11611071.v3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 15, 2020
    Dataset provided by
    figshare
    Authors
    AnPing Song
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Similar face recognition has always been one of the most challenging research directions in face recognition.This project shared similar face images (SFD.zip) that we have collected so far. All images are labeld and collected from publicly available datasets such as LFW, CASIA-WebFace.We will continue to collect larger-scale data and continue to update this project.Because the data set is too large, we uploaded a compressed zip file (SFD.zip). Meanwhile here we upload a few examples for everyone to view.email: ileven@shu.edu.cn

  9. F

    Middle Eastern Occluded Facial Image Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Middle Eastern Occluded Facial Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-occlusion-middle-east
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Area covered
    Middle East
    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Middle Eastern Human Face with Occlusion Dataset, carefully curated to support the development of robust facial recognition systems, occlusion detection models, biometric identification technologies, and KYC verification tools. This dataset provides real-world variability by including facial images with common occlusions, helping AI models perform reliably under challenging conditions.

    Facial Image Data

    The dataset comprises over 3,000 high-quality facial images, organized into participant-wise sets. Each set includes:

    •
    Occluded Images: 5 images per individual featuring different types of facial occlusions, masks, caps, sunglasses, or combinations of these accessories
    •
    Normal Image: 1 reference image of the same individual without any occlusion

    Diversity & Representation

    •
    Geographic Coverage: Participants from across Egypt, Jordan, Suadi Arabia, UAE, Tunisia, and more Middle Eastern countries
    •
    Demographics: Individuals aged 18 to 70 years, with a 60:40 male-to-female ratio
    •
    File Formats: Images available in JPEG and HEIC formats

    Image Quality & Capture Conditions

    To ensure robustness and real-world utility, images were captured under diverse conditions:

    •
    Lighting Variations: Includes both natural and artificial lighting scenarios
    •
    Background Diversity: Indoor and outdoor backgrounds for model generalization
    •
    Device Quality: Captured using the latest smartphones to ensure high resolution and consistency

    Metadata

    Each image is paired with detailed metadata to enable advanced filtering, model tuning, and analysis:

    •Unique Participant ID
    •File Name
    •Age
    •Gender
    •Country
    •Demographic Profile
    •Type of Occlusion
    •File Format

    This rich metadata helps train models that can recognize faces even when partially obscured.

    Use Cases & Applications

    This dataset is ideal for a wide range of real-world and research-focused applications, including:

    •
    Facial Recognition under Occlusion: Improve model performance when faces are partially hidden
    •
    Occlusion Detection: Train systems to detect and classify facial accessories like masks or sunglasses
    •
    Biometric Identity Systems: Enhance verification accuracy across varying conditions
    •
    KYC & Compliance: Support face matching even when the selfie includes common occlusions.
    •
    Security & Surveillance: Strengthen access control and monitoring systems in environments with mask usage

    Secure & Ethical Collection

    •
    Data Security: Collected and processed securely on FutureBeeAI’s proprietary platform
    •
    Ethical Compliance: Follows strict guidelines for participant privacy and informed consent
    •
    Transparent Participation: All contributors provided written consent and were informed of the intended use

    Dataset

  10. f

    Data from: Facial Expression Image Dataset for Computer Vision Algorithms

    • salford.figshare.com
    Updated Apr 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ali Alameer; Odunmolorun Osonuga (2025). Facial Expression Image Dataset for Computer Vision Algorithms [Dataset]. http://doi.org/10.17866/rd.salford.21220835.v2
    Explore at:
    Dataset updated
    Apr 29, 2025
    Dataset provided by
    University of Salford
    Authors
    Ali Alameer; Odunmolorun Osonuga
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset for this project is characterised by photos of individual human emotion expression and these photos are taken with the help of both digital camera and a mobile phone camera from different angles, posture, background, light exposure, and distances. This task might look and sound very easy but there were some challenges encountered along the process which are reviewed below: 1) People constraint One of the major challenges faced during this project is getting people to participate in the image capturing process as school was on vacation, and other individuals gotten around the environment were not willing to let their images be captured for personal and security reasons even after explaining the notion behind the project which is mainly for academic research purposes. Due to this challenge, we resorted to capturing the images of the researcher and just a few other willing individuals. 2) Time constraint As with all deep learning projects, the more data available the more accuracy and less error the result will produce. At the initial stage of the project, it was agreed to have 10 emotional expression photos each of at least 50 persons and we can increase the number of photos for more accurate results but due to the constraint in time of this project an agreement was later made to just capture the researcher and a few other people that are willing and available. These photos were taken for just two types of human emotion expression that is, ā€œhappyā€ and ā€œsadā€ faces due to time constraint too. To expand our work further on this project (as future works and recommendations), photos of other facial expression such as anger, contempt, disgust, fright, and surprise can be included if time permits. 3) The approved facial emotions capture. It was agreed to capture as many angles and posture of just two facial emotions for this project with at least 10 images emotional expression per individual, but due to time and people constraints few persons were captured with as many postures as possible for this project which is stated below: Ƙ Happy faces: 65 images Ƙ Sad faces: 62 images There are many other types of facial emotions and again to expand our project in the future, we can include all the other types of the facial emotions if time permits, and people are readily available. 4) Expand Further. This project can be improved furthermore with so many abilities, again due to the limitation of time given to this project, these improvements can be implemented later as future works. In simple words, this project is to detect/predict real-time human emotion which involves creating a model that can detect the percentage confidence of any happy or sad facial image. The higher the percentage confidence the more accurate the facial fed into the model. 5) Other Questions Can the model be reproducible? the supposed response to this question should be YES. If and only if the model will be fed with the proper data (images) such as images of other types of emotional expression.

  11. Real vs fake faces

    • kaggle.com
    Updated May 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Udit Sharma (2022). Real vs fake faces [Dataset]. https://www.kaggle.com/datasets/uditsharma72/real-vs-fake-faces
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 4, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Udit Sharma
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    About Dataset This dataset contains real and fake images of human faces. Real and Fake Face Detection Fake Face Photos by Photoshop Experts Introduction When using social networks, have you ever encountered a 'fake identity'? Anyone can create a fake profile image using image editing tools, or even using deep learning based generators. If you are interested in making the world wide web a better place by recognizing such fake faces, you should check this dataset.

  12. b

    BioID Face Database

    • bioid.com
    Updated Nov 15, 2006
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BioID (2006). BioID Face Database [Dataset]. https://www.bioid.com/face-database/
    Explore at:
    text/csv+zip, text//x-portable-graymap+zipAvailable download formats
    Dataset updated
    Nov 15, 2006
    Dataset authored and provided by
    BioID
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Pixel
    Description

    The BioID Face Database has been recorded and is published to give all researchers working in the area of face detection the possibility to compare the quality of their face detection algorithms with others. During the recording special emphasis has been laid on real world conditions. Therefore the testset features a large variety of illumination, background and face size. The dataset consists of 1521 gray level images with a resolution of 384x286 pixel. Each one shows the frontal view of a face of one out of 23 different test persons. For comparison reasons the set also contains manually set eye postions. The images are labeled BioID_xxxx.pgm where the characters xxxx are replaced by the index of the current image (with leading zeros). Similar to this, the files BioID_xxxx.eye contain the eye positions for the corresponding images.

  13. F

    Hispanic Occluded Facial Image Dataset

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Hispanic Occluded Facial Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-occlusion-hispanic
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Hispanic Human Face with Occlusion Dataset, carefully curated to support the development of robust facial recognition systems, occlusion detection models, biometric identification technologies, and KYC verification tools. This dataset provides real-world variability by including facial images with common occlusions, helping AI models perform reliably under challenging conditions.

    Facial Image Data

    The dataset comprises over 3,000 high-quality facial images, organized into participant-wise sets. Each set includes:

    •
    Occluded Images: 5 images per individual featuring different types of facial occlusions, masks, caps, sunglasses, or combinations of these accessories
    •
    Normal Image: 1 reference image of the same individual without any occlusion

    Diversity & Representation

    •
    Geographic Coverage: Participants from across Argentina, Brazil, Costa Rica, Ecuador, Colombia, Peru, and more Hispanic countries
    •
    Demographics: Individuals aged 18 to 70 years, with a 60:40 male-to-female ratio
    •
    File Formats: Images available in JPEG and HEIC formats

    Image Quality & Capture Conditions

    To ensure robustness and real-world utility, images were captured under diverse conditions:

    •
    Lighting Variations: Includes both natural and artificial lighting scenarios
    •
    Background Diversity: Indoor and outdoor backgrounds for model generalization
    •
    Device Quality: Captured using the latest smartphones to ensure high resolution and consistency

    Metadata

    Each image is paired with detailed metadata to enable advanced filtering, model tuning, and analysis:

    •Unique Participant ID
    •File Name
    •Age
    •Gender
    •Country
    •Demographic Profile
    •Type of Occlusion
    •File Format

    This rich metadata helps train models that can recognize faces even when partially obscured.

    Use Cases & Applications

    This dataset is ideal for a wide range of real-world and research-focused applications, including:

    •
    Facial Recognition under Occlusion: Improve model performance when faces are partially hidden
    •
    Occlusion Detection: Train systems to detect and classify facial accessories like masks or sunglasses
    •
    Biometric Identity Systems: Enhance verification accuracy across varying conditions
    •
    KYC & Compliance: Support face matching even when the selfie includes common occlusions.
    •
    Security & Surveillance: Strengthen access control and monitoring systems in environments with mask usage

    Secure & Ethical Collection

    •
    Data Security: Collected and processed securely on FutureBeeAI’s proprietary platform
    •
    Ethical Compliance: Follows strict guidelines for participant privacy and informed consent
    •
    Transparent Participation: All contributors provided written consent and were informed of the intended use

    Dataset

  14. Images of friends character for face recognition

    • kaggle.com
    Updated Mar 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AmirAli Kalbasi (2022). Images of friends character for face recognition [Dataset]. https://www.kaggle.com/datasets/amiralikalbasi/images-of-friends-character-for-face-recognition
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 27, 2022
    Dataset provided by
    Kaggle
    Authors
    AmirAli Kalbasi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset of friends series character sets for face detection and recognition consists of two parts: training and testing. The training section has fifty photos of each of the six characters, and the test section has fifty photos featuring two or more of the characters in each photo from the Friends series for facial detection and recognition.

  15. m

    Dataset for Smile Detection from Face Images

    • data.mendeley.com
    Updated Jan 24, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olasimbo Arigbabu (2017). Dataset for Smile Detection from Face Images [Dataset]. http://doi.org/10.17632/yz4v8tb3tp.5
    Explore at:
    Dataset updated
    Jan 24, 2017
    Authors
    Olasimbo Arigbabu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This data is used in the second experimental evaluation of face smile detection in the paper titled "Smile detection using Hybrid Face Representaion" - O.A.Arigbabu et al. 2015.

    Download the main images from LFWcrop website: http://conradsanderson.id.au/lfwcrop/ to select the samples we used for smile and non-smile, as in the list.

    Kindly cite:

    Arigbabu, Olasimbo Ayodeji, et al. "Smile detection using hybrid face representation." Journal of Ambient Intelligence and Humanized Computing (2016): 1-12.

    C. Sanderson, B.C. Lovell. Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference. ICB 2009, LNCS 5558, pp. 199-208, 2009

    Huang GB, Mattar M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. University of Massachusetts, Amherst, Technical Report

  16. F

    Caucasian Facial Images Dataset | Selfie & ID Card Images

    • futurebeeai.com
    wav
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FutureBee AI (2022). Caucasian Facial Images Dataset | Selfie & ID Card Images [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-selfie-id-caucasian
    Explore at:
    wavAvailable download formats
    Dataset updated
    Aug 1, 2022
    Dataset provided by
    FutureBeeAI
    Authors
    FutureBee AI
    License

    https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement

    Dataset funded by
    FutureBeeAI
    Description

    Introduction

    Welcome to the Caucasian Human Facial Images Dataset, curated to advance facial recognition technology and support the development of secure biometric identity systems, KYC verification processes, and AI-driven computer vision applications. This dataset is designed to serve as a robust foundation for real-world face matching and recognition use cases.

    Facial Image Data

    The dataset contains over 1,000 facial image sets of Caucasian individuals. Each set includes:

    •
    Selfie Images: 5 high-quality selfie images taken under different conditions
    •
    ID Card Images: 2 clear facial images extracted from different government-issued ID cards

    Diversity & Representation

    •
    Geographic Diversity: Participants represent Caucasian countries including Spain, Italy, Turkey, Germany, France, and more
    •
    Demographics: Individuals aged 18 to 70 years with a 60:40 male-to-female ratio
    •
    File Formats: Images are provided in JPEG and HEIC formats for compatibility and quality retention

    Image Quality & Capture Conditions

    All images were captured with real-world variability to enhance dataset robustness:

    •
    Lighting: Captured under diverse lighting setups to simulate real environments
    •
    Backgrounds: A wide variety of indoor and outdoor backgrounds
    •
    Device Quality: Captured using modern smartphones to ensure high resolution and clarity

    Metadata

    Each participant’s data is accompanied by rich metadata to support AI model training, including:

    •Unique participant ID
    •Image file names
    •Age at the time of capture
    •Gender
    •Country of origin
    •Demographic details
    •File format information

    This metadata enables targeted filtering and training across diverse scenarios.

    Use Cases & Applications

    This dataset is ideal for a wide range of AI and biometric applications:

    •
    Facial Recognition: Train accurate and generalizable face matching models
    •
    KYC & Identity Verification: Enhance onboarding and compliance systems in fintech and government services
    •
    Biometric Identification: Build secure facial recognition systems for access control and identity authentication
    •
    Age Prediction: Train models to estimate age from facial features
    •
    Generative AI: Provide reference data for synthetic face generation or augmentation tasks

    Secure & Ethical Collection

    •
    Data Security: All images were securely stored and processed on FutureBeeAI’s proprietary platform
    •
    Ethical Compliance: Data collection was conducted in full alignment with privacy laws and ethical standards
    •
    Informed Consent: Every participant provided written consent, with full awareness of the intended uses of the data

    Dataset Updates & Customization

    To meet evolving AI demands, this dataset is regularly updated and can be customized. Available options include:

  17. h

    female-selfie-image-dataset

    • huggingface.co
    Updated Apr 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Training Data (2024). female-selfie-image-dataset [Dataset]. https://huggingface.co/datasets/TrainingDataPro/female-selfie-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 26, 2024
    Authors
    Training Data
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Face Recognition, Face Detection, Female Photo Dataset šŸ‘©

      If you are interested in biometric data - visit our website to learn more and buy the dataset :)
    

    90,000+ photos of 46,000+ women from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are women. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of women of… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/female-selfie-image-dataset.

  18. LFW - People (Face Recognition)

    • kaggle.com
    zip
    Updated Nov 15, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atul Anand {Jha} (2019). LFW - People (Face Recognition) [Dataset]. https://www.kaggle.com/atulanandjha/lfwpeople
    Explore at:
    zip(243503888 bytes)Available download formats
    Dataset updated
    Nov 15, 2019
    Authors
    Atul Anand {Jha}
    License

    http://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html

    Description

    Welcome to Labeled Faces in the Wild, a database of face photographs designed for studying the problem of unconstrained face recognition. The data set contains more than 13,000 images of faces collected from the web. Each face has been labeled with the name of the person pictured. 1680 of the people pictured have two or more distinct photos in the data set. The only constraint on these faces is that they were detected by the Viola-Jones face detector.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1746215%2F92752ca2b0bbecdd3fd154b88495558d%2F1_RaupR7k7NrrTJZvop7sH-A.png?generation=1573849119616339&alt=media" alt="LFW-PEOPLE">

    This dataset is a collection of JPEG pictures of famous people collected on the internet. All details are available on the official website: http://vis-www.cs.umass.edu/lfw/

    Each picture is centered on a single face. Each pixel of each channel (color in RGB) is encoded by a float in range 0.0 - 1.0.

    The task is called Face Recognition (or Identification): given the picture of a face, find the name of the person given a training set (gallery).

    The original images are 250 x 250 pixels, but the default slice and resize arguments reduce them to 62 x 47 pixels.

    Acknowledgements

    We wouldn't be here without the help of others. I would like to thank Computer Vision Laboratory, university of Massachusetts for providing us with such an excellent database.

    Inspiration

    I had an activity in my college for facial recognition. I came up with this as the best kind of dataset for my task. I am posting it here on Kaggle to make it available for other data scientists conveniently and see what magic they can perform with this amazing dataset.

  19. a

    Labeled Faces in the Wild aligned (LFW-a)

    • academictorrents.com
    bittorrent
    Updated Nov 26, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yaniv Taigman and Lior Wolf and Tal Hassner (2015). Labeled Faces in the Wild aligned (LFW-a) [Dataset]. https://academictorrents.com/details/403e6d6945a64dd1b9e185a6cd8d029274efccdc
    Explore at:
    bittorrent(96770694)Available download formats
    Dataset updated
    Nov 26, 2015
    Dataset authored and provided by
    Yaniv Taigman and Lior Wolf and Tal Hassner
    License

    https://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified

    Description

    The "Labeled Faces in the Wild-a" image collection is a database of labeled, face images intended for studying Face Recognition in unconstrained images. It contains the same images available in the original Labeled Faces in the Wild data set, however, here we provide them after alignment using a commercial face alignment software. Some of our results, published in [1,2,3], were produced using these images. We show this alignment to improve the performance of face recognition algorithms. More information on how these images were aligned may be found in the two papers. We have maintained the same directory structure as in the original LFW data set, and so these images can be used as direct substitutes for those in the original image set. Note, however, that the images available here are grayscale versions of the originals. Citation: If you find these images useful and use them in your work, please follow these guidlines: Comply with any instructions specified for the original L

  20. h

    infrared-face-recognition-dataset

    • huggingface.co
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata (2025). infrared-face-recognition-dataset [Dataset]. https://huggingface.co/datasets/UniDataPro/infrared-face-recognition-dataset
    Explore at:
    Dataset updated
    Mar 18, 2025
    Authors
    Unidata
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Infrared Face Detection Dataset

    Dataset contains 125,500+ images, including infrared images, from 4,484 individuals with or without a mask of various races, genders, and ages. It is specifically designed for research in face recognition and facial recognition technology, focusing on the unique challenges posed by thermal infrared imaging. By utilizing this dataset, researchers and developers can enhance their understanding of recognition systems and improve the recognition accuracy… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/infrared-face-recognition-dataset.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Fares Elmenshawii (2023). Face-Detection-Dataset [Dataset]. https://www.kaggle.com/datasets/fareselmenshawii/face-detection-dataset
Organization logo

Face-Detection-Dataset

Face Detection Dataset

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 10, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Fares Elmenshawii
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

The dataset comprises 16.7k images and 2 annotation files, each in a distinct format. The first file, labeled "Label," contains annotations with the original scale, while the second file, named "yolo_format_labels," contains annotations in YOLO format. The dataset was obtained by employing the OIDv4 toolkit, specifically designed for scraping data from Google Open Images. Notably, this dataset exclusively focuses on face detection.

This dataset offers a highly suitable resource for training deep learning models specifically designed for face detection tasks. The images within the dataset exhibit exceptional quality and have been meticulously annotated with bounding boxes encompassing the facial regions. The annotations are provided in two formats: the original scale, denoting the pixel coordinates of the bounding boxes, and the YOLO format, representing the bounding box coordinates in normalized form.

The dataset was meticulously curated by scraping relevant images from Google Open Images through the use of the OIDv4 toolkit. Only images that are pertinent to face detection tasks have been included in this dataset. Consequently, it serves as an ideal choice for training deep learning models that specifically target face detection tasks.

Search
Clear search
Close search
Google apps
Main menu