100+ datasets found

Face Recognition Train
kaggle.com
zip
Updated Jun 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sanjana chaudhari☑️ (2023). Face Recognition Train [Dataset]. https://www.kaggle.com/datasets/sanjanchaudhari/face-recog-train
Explore at:
zip(95864 bytes)Available download formats
Dataset updated
Jun 20, 2023
Authors
Sanjana chaudhari☑️
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
Face Recognition Train

Face recognition is a technology that involves identifying or verifying individuals by analyzing their facial features. It has gained significant popularity and has various applications, including security systems, access control, surveillance, and personalized user experiences.

The process of face recognition typically involves the following steps:

Face detection: A face detection algorithm is used to locate and extract faces from an image or a video frame. This step helps in isolating the facial region for further analysis.

Face alignment and preprocessing: The extracted face images are usually aligned to a standardized size and orientation to account for variations in pose, scale, and rotation. Preprocessing techniques may be applied to normalize lighting conditions, remove noise, and enhance the quality of the images.

Feature extraction: Meaningful features are extracted from the aligned face images to represent the unique characteristics of each individual. These features are often represented as numerical vectors, capturing specific facial attributes or patterns. Traditional methods like Eigenfaces, Fisherfaces, or Local Binary Patterns (LBP) can be used, but deep learning-based approaches like Convolutional Neural Networks (CNNs) have shown superior performance in recent years.

Feature encoding and representation: The extracted features are encoded into a compact representation, making it easier to compare and match them against other faces. Techniques like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), or more advanced methods like Siamese networks or Triplet Loss can be employed for encoding the face features.

Face matching and recognition: During this stage, the extracted and encoded features are compared to a database of known faces or a set of reference features. The goal is to find the closest match or determine the identity of the individual represented by the face image. Various similarity metrics such as Euclidean distance, cosine similarity, or more sophisticated techniques like metric learning can be utilized for face matching.

Decision and classification: Based on the comparison results, a decision is made to recognize or classify the input face image. If a match is found within the database, the system can provide the identity of the person associated with the recognized face.
Custom Face Recognition Image Dataset
kaggle.com
zip
Updated Jul 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unidata (2025). Custom Face Recognition Image Dataset [Dataset]. https://www.kaggle.com/datasets/unidpro/face-recognition-image-dataset
Explore at:
zip(27609695 bytes)Available download formats
Dataset updated
Jul 3, 2025
Authors
Unidata
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Image Dataset of face images for compuer vision tasks

Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems.

By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine detection algorithms for more accurate recognizing faces in real-world scenarios. - Get the data

Metadata for the dataset

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22059654%2F87acb75b060abcd7838e8a9fad21fb79%2FFrame%201%20(8).png?generation=1743153407873743&alt=media" alt=""> All images come with rigorously verified metadata annotations (age, gender, ethnicity), achieving ≥95% labeling accuracy. Also images are captured under different lighting conditions and resolutions, enhancing the dataset's utility for computer vision tasks and image classifications.

💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

Researchers can leverage this dataset to improve recognition technology and develop learning models that enhance the accuracy of face detections. The dataset also supports projects focused on face anti-spoofing and deep learning applications, making it an essential tool for those studying biometric security and liveness detection technologies.

🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects
Face Detection - Face Recognition Dataset
kaggle.com
zip
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unique Data (2023). Face Detection - Face Recognition Dataset [Dataset]. https://www.kaggle.com/datasets/trainingdatapro/face-detection-photos-and-labels
Explore at:
zip(1252666206 bytes)Available download formats
Dataset updated
Nov 8, 2023
Authors
Unique Data
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Face Detection - Object Detection & Face Recognition Dataset

The dataset is created on the basis of Selfies and ID Dataset

The dataset is a collection of images (selfies) of people and bounding box labeling for their faces. It has been specifically curated for face detection and face recognition tasks. The dataset encompasses diverse demographics, age, ethnicities, and genders.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F01348572e2ae2836f10bc2f2da381009%2FFrame%2050%20(1).png?generation=1699439342545305&alt=media" alt="">

The dataset is a valuable resource for researchers, developers, and organizations working on age prediction and face recognition to train, evaluate, and fine-tune AI models for real-world applications. It can be applied in various domains like psychology, market research, and personalized advertising.

👉 Legally sourced datasets and carefully structured for AI training and model development. Explore samples from our dataset of 95,000+ human images & videos - Full dataset

Metadata for the full dataset:

assignment_id - unique identifier of the media file

worker_id - unique identifier of the person

age - age of the person

true_gender - gender of the person

country - country of the person

ethnicity - ethnicity of the person

photo_1_extension, photo_2_extension, …, photo_15_extension - photo extensions in the dataset

photo_1_resolution, photo_2_resolution, …, photo_15_resolution - photo resolution in the dataset

OTHER BIOMETRIC DATASETS:

Anti Spoofing Real Dataset

Antispoofing Replay Dataset

Selfies, ID Images dataset (5591 sets of 15 files)

Selfies and video dataset (4 052 sets)

Dataset of bald people, 5000 images

🧩 This is just an example of the data. Leave a request here to learn more

Dataset structure

images - contains of original images of people

labels - includes visualized labeling for the original images

annotations.xml - contains coordinates of the bbox, created for the original photo

Data Format

Each image from images folder is accompanied by an XML-annotation in the annotations.xml file indicating the coordinates of the polygons and labels . For each point, the x and y coordinates are provided.

Example of XML file structure

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F19e61b2d0780e9db80afe4a0ce879c4b%2Fcarbon.png?generation=1699440100527867&alt=media" alt="">

🚀 You can learn more about our high-quality unique datasets here

keywords: biometric system, biometric system attacks, biometric dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, object detection dataset, deep learning datasets, computer vision datset, human images dataset, human faces dataset
g
Face Recognition Dataset – One-Shot Learning
gts.ai
zip
Updated Oct 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS (2025). Face Recognition Dataset – One-Shot Learning [Dataset]. https://gts.ai/dataset-download/face-recognition-dataset-one-shot-learning-ai-data-collection/
Explore at:
zipAvailable download formats
Dataset updated
Oct 27, 2025
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Face Recognition Dataset for One-Shot Learning by Globose Technology Solutions enables AI models to perform face recognition using just a single example per class. It includes diverse facial images covering various demographics, lighting conditions, and expressions for high-quality model training.
Face Detection Dataset
kaggle.com
Updated Dec 30, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sudhanshu Rastogi (2024). Face Detection Dataset [Dataset]. https://www.kaggle.com/datasets/sudhanshu2198/face-detection-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 30, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sudhanshu Rastogi
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This Dataset is created by organizing the WIDER FACE dataset. WIDER FACE dataset is a face detection benchmark dataset, of which images are selected from the publicly available WIDER dataset. We chose 32,203 images and labeled 393,703 faces with a high degree of variability in scale, pose, and occlusion as depicted in the sample images. WIDER FACE dataset is organized based on 61 event classes. For each event class, we randomly select 40%/10%/50% of data as training, validation, and testing sets. We adopt the same evaluation metric employed in the PASCAL VOC dataset.

Original Dataset http://shuoyang1213.me/WIDERFACE/
Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video...
datarade.ai
Updated Dec 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2023). Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video AI Training Data | Machine Learning(ML) Data [Dataset]. https://datarade.ai/data-products/nexdata-multi-race-human-face-data-200-000-id-image-vi-nexdata
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Dec 22, 2023
Dataset authored and provided by
Nexdata
Area covered
Lao People's Democratic Republic, Bosnia and Herzegovina, Iran (Islamic Republic of), Bulgaria, Canada, Cambodia, Chile, Belarus, Mexico, Germany
Description
Specifications Product : Biometric Data

Data size : 200,000 ID

Race distribution : black people, Caucasian people, brown(Mexican) people, Indian people and Asian people

Gender distribution : gender balance

Age distribution : young, midlife and senior

Collecting environment : including indoor and outdoor scenes

Data diversity : different face poses, races, ages, light conditions and scenes Device : cellphone

Data format : .jpg/png

Accuracy : the accuracy of labels of face pose, race, gender and age are more than 97%

About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 3 million hours of Speech Data and 800TB of Imagery Data. These ready-to-go Machine Learning(ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/computervision?source=Datarade
d
FileMarket | Dataset for Face Anti-Spoofing (Videos) in Computer Vision...
datarade.ai
Updated Jul 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FileMarket (2024). FileMarket | Dataset for Face Anti-Spoofing (Videos) in Computer Vision Applications | Machine Learning (ML) Data | Deep Learning (DL) Data [Dataset]. https://datarade.ai/data-products/filemarket-dataset-for-face-anti-spoofing-videos-in-compu-filemarket
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Jul 10, 2024
Dataset authored and provided by
FileMarket
Area covered
Belarus, Malawi, Zimbabwe, Congo (Democratic Republic of the), Central African Republic, Mali, Sierra Leone, Ukraine, Chad, South Africa
Description
Live Face Anti-Spoof Dataset

A live face dataset is crucial for advancing computer vision tasks such as face detection, anti-spoofing detection, and face recognition. The Live Face Anti-Spoof Dataset offered by Ainnotate is specifically designed to train algorithms for anti-spoofing purposes, ensuring that AI systems can accurately differentiate between real and fake faces in various scenarios.

Key Features:

Comprehensive Video Collection: The dataset features thousands of videos showcasing a diverse range of individuals, including males and females, with and without glasses. It also includes men with beards, mustaches, and clean-shaven faces. Lighting Conditions: Videos are captured in both indoor and outdoor environments, ensuring that the data covers a wide range of lighting conditions, making it highly applicable for real-world use. Data Collection Method: Our datasets are gathered through a community-driven approach, leveraging our extensive network of over 700k users across various Telegram apps. This method ensures that the data is not only diverse but also ethically sourced with full consent from participants, providing reliable and real-world applicable data for training AI models. Versatility: This dataset is ideal for training models in face detection, anti-spoofing, and face recognition tasks, offering robust support for these essential computer vision applications. In addition to the Live Face Anti-Spoof Dataset, FileMarket provides specialized datasets across various categories to support a wide range of AI and machine learning projects:

Object Detection Data: Perfect for training AI in image and video analysis. Machine Learning (ML) Data: Offers a broad spectrum of applications, from predictive analytics to natural language processing (NLP). Large Language Model (LLM) Data: Designed to support text generation, chatbots, and machine translation models. Deep Learning (DL) Data: Essential for developing complex neural networks and deep learning models. Biometric Data: Includes diverse datasets for facial recognition, fingerprint analysis, and other biometric applications. This live face dataset, alongside our other specialized data categories, empowers your AI projects by providing high-quality, diverse, and comprehensive datasets. Whether your focus is on anti-spoofing detection, face recognition, or other biometric and machine learning tasks, our data offerings are tailored to meet your specific needs.
Gender Detection & Classification - Face Dataset
kaggle.com
Updated Oct 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Unique Data (2023). Gender Detection & Classification - Face Dataset [Dataset]. https://www.kaggle.com/datasets/trainingdatapro/gender-detection-and-classification-image-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 31, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Unique Data
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
Gender Detection & Classification - face recognition dataset

The dataset is created on the basis of Medical Masks Dataset dataset

Dataset Description:

The dataset comprises a collection of photos of people, organized into folders labeled "women" and "men." Each folder contains a significant number of images to facilitate training and testing of gender detection algorithms or models.

The dataset contains a variety of images capturing female and male individuals from diverse backgrounds, age groups, and ethnicities.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F1c4708f0b856f7889e3c0eea434fe8e2%2FFrame%2045%20(1).png?generation=1698764294000412&alt=media" alt="">

This labeled dataset can be utilized as training data for machine learning models, computer vision applications, and gender detection algorithms.

👉 Legally sourced datasets and carefully structured for AI training and model development. Explore samples from our dataset of 95,000+ human images & videos - Full dataset

Metadata for the full dataset:

assignment_id - unique identifier of the media file

worker_id - unique identifier of the person

age - age of the person

true_gender - gender of the person

country - country of the person

ethnicity - ethnicity of the person

photo_1_extension, photo_2_extension, photo_3_extension, photo_4_extension - photo extensions in the dataset

photo_1_resolution, photo_2_resolution, photo_3_extension, photo_4_resolution - photo resolution in the dataset

🧩 This is just an example of the data. Leave a request here to learn more

Content

The dataset is split into train and test folders, each folder includes: - folders women and men - folders with images of people with the corresponding gender, - .csv file - contains information about the images and people in the dataset

File with the extension .csv

file: link to access the file,

gender: gender of a person in the photo (woman/man),

split: classification on train and test

🚀 You can learn more about our high-quality unique datasets here

keywords: biometric system, biometric system attacks, biometric dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, gender detection, supervised learning dataset, gender classification dataset, gender recognition dataset
d
FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data |...
datarade.ai
Updated Jul 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FileMarket (2024). FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data | Image/Video AI Training Data | Biometric Data [Dataset]. https://datarade.ai/data-products/filemarket-diverse-human-face-data-20-000-ids-face-reco-filemarket
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Jul 5, 2024
Dataset authored and provided by
FileMarket
Area covered
Oman, Georgia, Hong Kong, Libya, Curaçao, United Kingdom, Martinique, Sri Lanka, Iceland, Kyrgyzstan
Description
Biometric Data

FileMarket provides a comprehensive Biometric Data set, ideal for enhancing AI applications in security, identity verification, and more. In addition to Biometric Data, we offer specialized datasets across Object Detection Data, Machine Learning (ML) Data, Large Language Model (LLM) Data, and Deep Learning (DL) Data. Each dataset is meticulously crafted to support the development of cutting-edge AI models.

Data Size: 20,000 IDs

Race Distribution: The dataset encompasses individuals from diverse racial backgrounds, including Black, Caucasian, Indian, and Asian groups.

Gender Distribution: The dataset equally represents all genders, ensuring a balanced and inclusive collection.

Age Distribution: The data spans a broad age range, including young, middle-aged, and senior individuals, providing comprehensive age coverage.

Collection Environment: Data has been gathered in both indoor and outdoor environments, ensuring variety and relevance for real-world applications.

Data Diversity: This dataset includes a rich variety of face poses, racial backgrounds, age groups, lighting conditions, and scenes, making it ideal for robust biometric model training.

Device: All data has been collected using mobile phones, reflecting common real-world usage scenarios.

Data Format: The data is provided in .jpg and .png formats, ensuring compatibility with various processing tools and systems.

Accuracy: The labels for face pose, race, gender, and age are highly accurate, exceeding 95%, making this dataset reliable for training high-performance biometric models.
g
Tufts Face Database
gts.ai
json
Updated Dec 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED (2023). Tufts Face Database [Dataset]. https://gts.ai/dataset-download/tufts-face-database-ai-data-collection-company/
Explore at:
jsonAvailable download formats
Dataset updated
Dec 3, 2023
Dataset authored and provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Tufts Face Database is a comprehensive collection of human face images, ideal for facial recognition, biometric verification, and computer vision model training. It includes diverse data by ethnicity, age, gender, and region for robust AI development.
Facial Recognition Dataset
kaggle.com
zip
Updated Jun 30, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gotam Dahiya (2020). Facial Recognition Dataset [Dataset]. https://www.kaggle.com/apollo2506/facial-recognition-dataset
Explore at:
zip(62587032 bytes)Available download formats
Dataset updated
Jun 30, 2020
Authors
Gotam Dahiya
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Content

This dataset contains folders pertaining to different expressions of the human face, namely , Surprise, Anger, Happiness, Sad, Neutral, Disgust, Fear.

The folders are split into two super-folders, Training and Testing, so that it can become easier for the end user to configure any model using this data.

The training set consists of 28,079 samples in total with the testing set consisting of 7,178 samples in total. The data consists of 48x48 pixel grayscale images of faces. The faces have been automatically registered so that the face is more or less centered and occupies about the same amount of space in each image.

Acknowledgements

This dataset was obtained from the competition "Challenges in Representation Learning: Facial Expression Recognition Challenge"

This dataset was prepared by Pierre-Luc Carrier and Aaron Courville, as part of an ongoing research project. They have graciously provided the workshop organizers with a preliminary version of their dataset to use for this contest.

The code for splitting the data into different directories was provided by Jainam Mehta. Here is the link to the code: Create Training and Testing
Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video...
data.nexdata.ai
Updated Aug 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2024). Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video AI Training Data | Biometric AI Datasets [Dataset]. https://data.nexdata.ai/products/nexdata-multi-race-human-face-data-200-000-id-image-vi-nexdata
Explore at:
Dataset updated
Aug 3, 2024
Dataset authored and provided by
Nexdata
Area covered
Hong Kong, Saudi Arabia, Turkmenistan, Romania, Afghanistan, Uzbekistan, Austria, Brazil, Montenegro, India
Description
Off-the-shelf biometric data (human face) covers 3D depth, segmentation: face organs and accessory, key points, facial expression, alpha Matte, age in variety and etc. All the Biometric Data are collected with signed authorization agreement.
Face Recognition Dataset – 10,109 People with Multi-angle Face Images and...
nexdata.ai
Updated Jun 14, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2024). Face Recognition Dataset – 10,109 People with Multi-angle Face Images and Demographic Labels [Dataset]. https://www.nexdata.ai/datasets/1402?source=Github
Explore at:
Dataset updated
Jun 14, 2024
Dataset authored and provided by
Nexdata
Variables measured
Data size, Data format, Data diversity, Age distribution, Race distribution, Gender distribution, Collecting environment
Description
This large-scale face image dataset features 10,109 individuals from various countries and ethnic backgrounds. Each subject has been captured in multiple real-world scenarios, resulting in diverse facial images under varying angles, lighting conditions, and expressions. Detailed annotations include gender, race, and age, making the dataset suitable for tasks such as facial recognition, face clustering, demographic analysis, and machine learning model training.The dataset has been validated by multiple AI companies and proven to deliver strong performance in real-world applications. All data collection, storage, and processing strictly adhere to global data protection regulations, including GDPR, CCPA, and PIPL, ensuring legal compliance and privacy preservation.
m
Human Faces and Objects Mix Image Dataset
data.mendeley.com
Updated Mar 13, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bindu Garg (2025). Human Faces and Objects Mix Image Dataset [Dataset]. http://doi.org/10.17632/nzwvnrmwp3.1
Explore at:
Unique identifier
https://doi.org/10.17632/nzwvnrmwp3.1
Dataset updated
Mar 13, 2025
Authors
Bindu Garg
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset Description: Human Faces and Objects Dataset (HFO-5000) The Human Faces and Objects Dataset (HFO-5000) is a curated collection of 5,000 images, categorized into three distinct classes: male faces (1,500), female faces (1,500), and objects (2,000). This dataset is designed for machine learning and computer vision applications, including image classification, face detection, and object recognition. The dataset provides high-quality, labeled images with a structured CSV file for seamless integration into deep learning pipelines.

Column Description: The dataset is accompanied by a CSV file that contains essential metadata for each image. The CSV file includes the following columns: file_name: The name of the image file (e.g., image_001.jpg). label: The category of the image, with three possible values: "male" (for male face images) "female" (for female face images) "object" (for images of various objects) file_path: The full or relative path to the image file within the dataset directory.

Uniqueness and Key Features: 1) Balanced Distribution: The dataset maintains an even distribution of human faces (male and female) to minimize bias in classification tasks. 2) Diverse Object Selection: The object category consists of a wide variety of items, ensuring robustness in distinguishing between human and non-human entities. 3) High-Quality Images: The dataset consists of clear and well-defined images, suitable for both training and testing AI models. 4) Structured Annotations: The CSV file simplifies dataset management and integration into machine learning workflows. 5) Potential Use Cases: This dataset can be used for tasks such as gender classification, facial recognition benchmarking, human-object differentiation, and transfer learning applications.

Conclusion: The HFO-5000 dataset provides a well-structured, diverse, and high-quality set of labeled images that can be used for various computer vision tasks. Its balanced distribution of human faces and objects ensures fairness in training AI models, making it a valuable resource for researchers and developers. By offering structured metadata and a wide range of images, this dataset facilitates advancements in deep learning applications related to facial recognition and object classification.
F
Middle Eastern Children Facial Image Dataset for Facial Recognition
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Middle Eastern Children Facial Image Dataset for Facial Recognition [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-minor-middle-eastern
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
The Middle Eastern Children Facial Image Dataset is a thoughtfully curated collection designed to support the development of advanced facial recognition systems, biometric identity verification, age estimation tools, and child-specific AI models. This dataset enables researchers and developers to build highly accurate, inclusive, and ethically sourced AI solutions for real-world applications.
Facial Image Data
The dataset includes over 1000 high-resolution image sets of children under the age of 18. Each participant contributes approximately 15 unique facial images, captured to reflect natural variations in appearance and context.
Diversity and Representation
•
Geographic Coverage: Children from Egypt, Jordan, Suadi Arabia, UAE, Tunisia, and more

•
Age Group: All participants are minors, with a wide age spread across childhood and adolescence.

•
Gender Balance: Includes both boys and girls, representing a balanced gender distribution.

•
File Formats: Images are available in JPEG and HEIC formats.

Quality and Image Conditions
To ensure robust model training and generalizability, images are captured under varied natural conditions:
•
Lighting: A mix of lighting setups, including indoor, outdoor, bright, and low-light scenarios.

•
Backgrounds: Diverse backgrounds—plain, natural, and everyday environments—are included to promote realism.

•
Capture Devices: All photos are taken using modern mobile devices, ensuring high resolution and sharp detail.

Metadata
Each child’s image set is paired with detailed, structured metadata, enabling granular control and filtering during model training:
•Unique Participant ID
•File Name
•Age
•Gender
•Country
•Demographic Attributes
•File Format
This metadata is essential for applications that require demographic awareness, such as region-specific facial recognition or bias mitigation in AI models.
Applications
This dataset is ideal for a wide range of computer vision use cases, including:
•
Facial Recognition: Improving identification accuracy across diverse child demographics.

•
KYC and Identity Verification: Enabling more inclusive onboarding processes for child-specific platforms.

•
Biometric Systems: Supporting child-focused identity verification in education, healthcare, or travel.

•
Age Estimation: Training AI models to estimate age ranges of children from facial features.

•
Child Safety Models: Assisting in missing child identification or online content moderation.

•
Generative AI Training: Creating more representative synthetic data using real-world diverse inputs.

Ethical Collection and Data Security
We maintain the highest ethical and security standards throughout the data lifecycle:
•
Guardian Consent: Every participant’s guardian provided informed, written consent, clearly outlining the dataset’s use cases.

•
Privacy-First Approach: Personally identifiable information is not shared. Only anonymized metadata is included.

•
Secure Storage: <span style="font-weight:
g
CelebA Face Recognition Triplets
gts.ai
json
Updated Nov 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS (2023). CelebA Face Recognition Triplets [Dataset]. https://gts.ai/dataset-download/celeba-face-recognition-triplets/
Explore at:
jsonAvailable download formats
Dataset updated
Nov 20, 2023
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
CelebA Face Recognition Triplets is a high-quality dataset designed for facial recognition research, particularly optimized for training models using triplet loss architectures. It features curated face triplets supporting robust identity verification, matching, and embedding learning.
g
Dog Face Recognition Model
gts.ai
json
Updated Jun 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS (2024). Dog Face Recognition Model [Dataset]. https://gts.ai/dataset-download/dog-face-recognition-model/
Explore at:
jsonAvailable download formats
Dataset updated
Jun 16, 2024
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
A curated dataset of 160x160 resolution dog face images optimized for training and evaluating dog face recognition and identification models.
F
Middle Eastern Occluded Facial Image Dataset
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). Middle Eastern Occluded Facial Image Dataset [Dataset]. https://www.futurebeeai.com/dataset/image-dataset/facial-images-occlusion-middle-east
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Dataset funded by
FutureBeeAI
Description
Introduction
Welcome to the Middle Eastern Human Face with Occlusion Dataset, carefully curated to support the development of robust facial recognition systems, occlusion detection models, biometric identification technologies, and KYC verification tools. This dataset provides real-world variability by including facial images with common occlusions, helping AI models perform reliably under challenging conditions.
Facial Image Data
The dataset comprises over 3,000 high-quality facial images, organized into participant-wise sets. Each set includes:
•
Occluded Images: 5 images per individual featuring different types of facial occlusions, masks, caps, sunglasses, or combinations of these accessories

•
Normal Image: 1 reference image of the same individual without any occlusion

Diversity & Representation
•
Geographic Coverage: Participants from across Egypt, Jordan, Suadi Arabia, UAE, Tunisia, and more Middle Eastern countries

•
Demographics: Individuals aged 18 to 70 years, with a 60:40 male-to-female ratio

•
File Formats: Images available in JPEG and HEIC formats

Image Quality & Capture Conditions
To ensure robustness and real-world utility, images were captured under diverse conditions:
•
Lighting Variations: Includes both natural and artificial lighting scenarios

•
Background Diversity: Indoor and outdoor backgrounds for model generalization

•
Device Quality: Captured using the latest smartphones to ensure high resolution and consistency

Metadata
Each image is paired with detailed metadata to enable advanced filtering, model tuning, and analysis:
•Unique Participant ID
•File Name
•Age
•Gender
•Country
•Demographic Profile
•Type of Occlusion
•File Format
This rich metadata helps train models that can recognize faces even when partially obscured.
Use Cases & Applications
This dataset is ideal for a wide range of real-world and research-focused applications, including:
•
Facial Recognition under Occlusion: Improve model performance when faces are partially hidden

•
Occlusion Detection: Train systems to detect and classify facial accessories like masks or sunglasses

•
Biometric Identity Systems: Enhance verification accuracy across varying conditions

•
KYC & Compliance: Support face matching even when the selfie includes common occlusions.

•
Security & Surveillance: Strengthen access control and monitoring systems in environments with mask usage

Secure & Ethical Collection
•
Data Security: Collected and processed securely on FutureBeeAI’s proprietary platform

•
Ethical Compliance: Follows strict guidelines for participant privacy and informed consent

•
Transparent Participation: All contributors provided written consent and were informed of the intended use

Dataset
29,523 People Face Recognition Dataset with ID Photos (Multi-race, Real-Life...
nexdata.ai
Updated Sep 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nexdata (2024). 29,523 People Face Recognition Dataset with ID Photos (Multi-race, Real-Life Image) [Dataset]. https://www.nexdata.ai/datasets/computervision/1020
Explore at:
Dataset updated
Sep 10, 2024
Dataset authored and provided by
Nexdata
Variables measured
Device, accuracy, Data size, Data format, Data diversity, Age distribution:, Race distribution, Gender distribution, Collecting environment
Description
This dataset contains 29,523 individuals. For each subject, one ID photo and 5-10 life photos were collected, the race distribution covering Asian, Caucasian, black and brown races. This data can be used for training and evaluating face recognition models, identity verification systems, and AI-based authentication solutions.
u
Face recognition scoping article data
repository.uj.ac.za
figshare.com
message/news
Updated May 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ANDISANI NEMAVHOLA; Andisani Nemavhola (2024). Face recognition scoping article data [Dataset]. http://doi.org/10.25415/ujhb.25792587.v1
Explore at:
message/newsAvailable download formats
Unique identifier
https://doi.org/10.25415/ujhb.25792587.v1
Dataset updated
May 11, 2024
Dataset provided by
University of Johannesburg
Authors
ANDISANI NEMAVHOLA; Andisani Nemavhola
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data is used in a scoping review of face recognition using CNN architectures. The research seeks to investigate various CNN architectures and their capabilities. The data contains a list of articles that were consulted.

Facebook

Twitter

Click to copy link

Link copied

Cite

Sanjana chaudhari☑️ (2023). Face Recognition Train [Dataset]. https://www.kaggle.com/datasets/sanjanchaudhari/face-recog-train

Face Recognition Train

Training a Face Recognition Model 👨‍💻

Explore at:

zip(95864 bytes)Available download formats

Dataset updated

Jun 20, 2023

Authors

Sanjana chaudhari☑️

License

ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically

Description

Face Recognition Train

Face recognition is a technology that involves identifying or verifying individuals by analyzing their facial features. It has gained significant popularity and has various applications, including security systems, access control, surveillance, and personalized user experiences.

The process of face recognition typically involves the following steps:

Face detection: A face detection algorithm is used to locate and extract faces from an image or a video frame. This step helps in isolating the facial region for further analysis.

Face alignment and preprocessing: The extracted face images are usually aligned to a standardized size and orientation to account for variations in pose, scale, and rotation. Preprocessing techniques may be applied to normalize lighting conditions, remove noise, and enhance the quality of the images.

Feature extraction: Meaningful features are extracted from the aligned face images to represent the unique characteristics of each individual. These features are often represented as numerical vectors, capturing specific facial attributes or patterns. Traditional methods like Eigenfaces, Fisherfaces, or Local Binary Patterns (LBP) can be used, but deep learning-based approaches like Convolutional Neural Networks (CNNs) have shown superior performance in recent years.

Feature encoding and representation: The extracted features are encoded into a compact representation, making it easier to compare and match them against other faces. Techniques like Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), or more advanced methods like Siamese networks or Triplet Loss can be employed for encoding the face features.

Face matching and recognition: During this stage, the extracted and encoded features are compared to a database of known faces or a set of reference features. The goal is to find the closest match or determine the identity of the individual represented by the face image. Various similarity metrics such as Euclidean distance, cosine similarity, or more sophisticated techniques like metric learning can be utilized for face matching.

Decision and classification: Based on the comparison results, a decision is made to recognize or classify the input face image. If a match is found within the database, the system can provide the identity of the person associated with the recognized face.

Clear search

Close search

Google apps

Main menu

Face Recognition Train

Face Recognition Train

Custom Face Recognition Image Dataset

Image Dataset of face images for compuer vision tasks

Metadata for the dataset

💵 Buy the Dataset: This is a limited preview of the data. To access the full dataset, please contact us at https://unidata.pro to discuss your requirements and pricing options.

🌐 UniData provides high-quality datasets, content moderation, data collection and annotation for your AI/ML projects

Face Detection - Face Recognition Dataset

Face Detection - Object Detection & Face Recognition Dataset

The dataset is created on the basis of Selfies and ID Dataset

👉 Legally sourced datasets and carefully structured for AI training and model development. Explore samples from our dataset of 95,000+ human images & videos - Full dataset

Metadata for the full dataset:

OTHER BIOMETRIC DATASETS:

🧩 This is just an example of the data. Leave a request here to learn more

Dataset structure

Data Format

Example of XML file structure

Face Recognition Dataset – One-Shot Learning

Face Detection Dataset

Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video...

FileMarket | Dataset for Face Anti-Spoofing (Videos) in Computer Vision...

Gender Detection & Classification - Face Dataset

Gender Detection & Classification - face recognition dataset

The dataset is created on the basis of Medical Masks Dataset dataset

👉 Legally sourced datasets and carefully structured for AI training and model development. Explore samples from our dataset of 95,000+ human images & videos - Full dataset

Metadata for the full dataset:

🧩 This is just an example of the data. Leave a request here to learn more

Content

File with the extension .csv

FileMarket | Diverse Human Face Data | 20,000 IDs | Face Recognition Data |...

Tufts Face Database

Facial Recognition Dataset

Content

Acknowledgements

Multi-race Human Face Data | 200,000 ID | Face Recognition Data| Image/Video...

Face Recognition Dataset – 10,109 People with Multi-angle Face Images and...

Human Faces and Objects Mix Image Dataset

Middle Eastern Children Facial Image Dataset for Facial Recognition

Introduction

Facial Image Data

Diversity and Representation

Quality and Image Conditions

Metadata

Applications

Ethical Collection and Data Security

CelebA Face Recognition Triplets

Dog Face Recognition Model

Middle Eastern Occluded Facial Image Dataset

Introduction

Facial Image Data

Diversity & Representation

Image Quality & Capture Conditions

Metadata

Use Cases & Applications

Secure & Ethical Collection

Dataset

29,523 People Face Recognition Dataset with ID Photos (Multi-race, Real-Life...

Face recognition scoping article data

Face Recognition Train

Training a Face Recognition Model 👨‍💻

Face Recognition Train