Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The dataset contains around 9.6k images of human faces which are both real images and those generated by AI.
The zip contains two folders: - Real Images: 5000 images of real human faces - AI-Generated Images: 4630 images of ai-generated human faces.
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Image Dataset of face images for compuer vision tasks
Dataset comprises 500,600+ images of individuals representing various races, genders, and ages, with each person having a single face image. It is designed for facial recognition and face detection research, supporting the development of advanced recognition systems. By leveraging this dataset, researchers and developers can enhance deep learning models, improve face verification and face identification techniques, and refine… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-recognition-image-dataset.
Facebook
TwitterThis dataset is a curated subset of the CelebFaces Attributes (CelebA) Dataset, handpicked for deep learning tasks such as image synthesis and facial recognition. It includes 50,000 celebrity face images from diverse identities, covering a wide range of poses, backgrounds, and facial attributes. These images are suitable for experimenting with GANs, facial recognition models, and other machine learning tasks related to face analysis.
This dataset is perfect for hobbyists, researchers, and machine learning practitioners looking to experiment with a manageable yet diverse collection of celebrity face images.
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Dataset of face images with different angles and head positions
Dataset contains 23,110 individuals, each contributing 28 images featuring various angles and head positions, diverse backgrounds, and attributes, along with 1 ID photo. In total, the dataset comprises over 670,000 images in formats such as JPG and PNG. It is designed to advance face recognition and facial recognition research, focusing on person re-identification and recognition systems. By utilizing this dataset… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/face-re-identification-image-dataset.
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Face Recognition, Face Detection, Male Photo Dataset 👨
The dataset is created on the basis of Selfies and ID Dataset
110,000+ photos of 74,000+ men from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are men. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of men of different ages and ethnic groups… See the full description on the dataset page: https://huggingface.co/datasets/UniqueData/male-selfie-image-dataset.
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the South Asian Human Face with Occlusion Dataset, carefully curated to support the development of robust facial recognition systems, occlusion detection models, biometric identification technologies, and KYC verification tools. This dataset provides real-world variability by including facial images with common occlusions, helping AI models perform reliably under challenging conditions.
The dataset comprises over 5,000 high-quality facial images, organized into participant-wise sets. Each set includes:
To ensure robustness and real-world utility, images were captured under diverse conditions:
Each image is paired with detailed metadata to enable advanced filtering, model tuning, and analysis:
This rich metadata helps train models that can recognize faces even when partially obscured.
This dataset is ideal for a wide range of real-world and research-focused applications, including:
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Fake Ai generated Human faces using Stable Diffusion 1.5, 2.1, and SDXL 1.0 checkpoint. The main objective was to generate photos that were as realistic as possible, without any specific style, focusing mainly on the face.
Fake Ai generated Human faces
More details on the images and the process of creating the images in the readme file.
The data is not mine, the data is taken from a GitHub repository to a user named: tobecwb Repo link: https://github.com/tobecwb/stable-diffusion-face-dataset
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Accurately estimated foreground object in images. Dataset for editing applications for creating visual effects.
Includes 2 folders: - images - original images of faces - masks - matting masks for images
keywords: head segmentation dataset, face-generation, segmentation, human faces, portrait segmentation, human face extraction, image segmentation, annotation, biometric dataset, biometric data dataset, face recognition database, facial recognition, face forgery detection, face shape, ar, augmented reality, face detection dataset, facial analysis, human images dataset, hair segmentation, matting, image matting, computer vision, deep learning, potrait matting, natural image matting
Facebook
TwitterInstagram Faces Image dataset with diverse single-face images for facial recognition, anti-spoofing, and computer vision
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Our LSLF dataset consists of 1,195,976 labeled face images for 11,459 individuals. These images are stored in JPEG format with a total size of 5.36 GB. Individuals have a minimum of 1 face image and a maximum of 1,157 face images. The average number of face images per individual is 104. Each image is automatically named as (PersonName VideoNumber FrameNumber ImageNuumber) and stored in the related individual folder.
Facebook
TwitterAI-generated, high-quality face datasets. Based on model-released photos. Diverse expressions, ethnicities, and age groups. Excellent for face recognition and analysis projects.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Similar face recognition has always been one of the most challenging research directions in face recognition.This project shared similar face images (SFD.zip) that we have collected so far. All images are labeld and collected from publicly available datasets such as LFW, CASIA-WebFace.We will continue to collect larger-scale data and continue to update this project.Because the data set is too large, we uploaded a compressed zip file (SFD.zip). Meanwhile here we upload a few examples for everyone to view.email: ileven@shu.edu.cn
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The BioID Face Database has been recorded and is published to give all researchers working in the area of face detection the possibility to compare the quality of their face detection algorithms with others. During the recording special emphasis has been laid on real world conditions. Therefore the testset features a large variety of illumination, background and face size. The dataset consists of 1521 gray level images with a resolution of 384x286 pixel. Each one shows the frontal view of a face of one out of 23 different test persons. For comparison reasons the set also contains manually set eye postions. The images are labeled BioID_xxxx.pgm where the characters xxxx are replaced by the index of the current image (with leading zeros). Similar to this, the files BioID_xxxx.eye contain the eye positions for the corresponding images.
Facebook
Twitterhttps://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the Caucasian Human Face with Occlusion Dataset, carefully curated to support the development of robust facial recognition systems, occlusion detection models, biometric identification technologies, and KYC verification tools. This dataset provides real-world variability by including facial images with common occlusions, helping AI models perform reliably under challenging conditions.
The dataset comprises over 3,000 high-quality facial images, organized into participant-wise sets. Each set includes:
To ensure robustness and real-world utility, images were captured under diverse conditions:
Each image is paired with detailed metadata to enable advanced filtering, model tuning, and analysis:
This rich metadata helps train models that can recognize faces even when partially obscured.
This dataset is ideal for a wide range of real-world and research-focused applications, including:
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
A large-scale dataset for age estimation from facial images, including Indian Movie Face Database (IMFDB) with 19,906 labeled images and UTKFace with over 20,000 images labeled with age, gender, and ethnicity. Useful for AI, biometrics, and facial recognition research.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Pgu-Face dataset contains 896 images from 224 different subjects. All of the subjects was Iranian men and most of them live in tropical regions of the southwest of Iran. The range of age of the subject's was 16 to 82 years with average 27.89 years. In addition, we make the following information available for the subjects: age and quality of the camera in mega pixels.
Facebook
Twitterhttps://academictorrents.com/nolicensespecifiedhttps://academictorrents.com/nolicensespecified
The "Labeled Faces in the Wild-a" image collection is a database of labeled, face images intended for studying Face Recognition in unconstrained images. It contains the same images available in the original Labeled Faces in the Wild data set, however, here we provide them after alignment using a commercial face alignment software. Some of our results, published in [1,2,3], were produced using these images. We show this alignment to improve the performance of face recognition algorithms. More information on how these images were aligned may be found in the two papers. We have maintained the same directory structure as in the original LFW data set, and so these images can be used as direct substitutes for those in the original image set. Note, however, that the images available here are grayscale versions of the originals. Citation: If you find these images useful and use them in your work, please follow these guidlines: Comply with any instructions specified for the original L
Facebook
TwitterAttribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Infrared Face Detection Dataset
Dataset contains 125,500+ images, including infrared images, from 4,484 individuals with or without a mask of various races, genders, and ages. It is specifically designed for research in face recognition and facial recognition technology, focusing on the unique challenges posed by thermal infrared imaging. By utilizing this dataset, researchers and developers can enhance their understanding of recognition systems and improve the recognition accuracy… See the full description on the dataset page: https://huggingface.co/datasets/UniDataPro/infrared-face-recognition-dataset.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
SoloFace: A Single-Face Dataset for Resource-Constrained Face Detection and Tracking
Description
SoloFace is a custom dataset derived from the COCO-Faces and Visual Wake Word datasets, specifically designed for single-face detection tasks in resource-constrained environments. This dataset is ideal for developing machine learning models for embedded AI applications, such as TinyML, which operate on low-power devices. Each image either contains a single human face or no face, with corresponding labels providing class information and bounding box coordinates for face detection. The dataset includes data augmentation to ensure robustness across diverse conditions, such as variations in lighting, scale, and orientation.
Dataset Structure
The dataset is organized into three subsets: train, test, and val. Each subset contains:
images/: .jpg image files.labels/: .json label files with matching filenames to the images.Label Format
Each .json label file includes:
image: Name of the corresponding image file.class: 1 if a face is present, 0 otherwise.bbox: Normalized bounding box coordinates [top_left_x, top_left_y, bottom_right_x, bottom_right_y]. If no face is present, the bounding box is set to [0.0, 0.0, 0.01, 0.01].Statistics
Original Dataset:
After Data Augmentation:
Class Distribution:
Data Augmentation Details
To improve model robustness, the following augmentation techniques were applied to the training set:
Each augmentation preserved bounding box consistency with the transformed images.
Usage This dataset supports the following use cases:
Loading the Dataset
unzip soloface-detection-dataset.zip
soloface-detection-dataset/
├── train/
│ ├── images/
│ ├── labels/
├── test/
│ ├── images/
│ ├── labels/
├── val/
│ ├── images/
│ ├── labels/
License
This dataset is released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
For more details, visit the CC BY 4.0 License.
Contact
For inquiries or collaborations, please contact:
sahabidyut999@gmail.comstudy.riya1792@gmail.comThis format fits Zenodo's description field requirements while providing clarity and structure. Let me know if further refinements are needed!
Facebook
TwitterThe 110 People – Human Face Image Data is gathered through camera shot involving 110 participants, with a proper balance of gender ratio and age group distribution covering major skin tones. Each person contributes 2100 pictures with glasses/ no glasses, expressions, camera shooting angle, and lighting conditions. All Attributes are annotated such as gender, age, expression, etc. The overall accuracy rate is ≥ 97%.This dataset is suitable for face recognition, facial expression analysis, and AI training.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The dataset contains around 9.6k images of human faces which are both real images and those generated by AI.
The zip contains two folders: - Real Images: 5000 images of real human faces - AI-Generated Images: 4630 images of ai-generated human faces.