Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
90,000+ photos of 46,000+ women from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are women. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups.
Our dataset will diversify your data by adding more photos of women of different ages and ethnic groups, enhancing the quality of your model.
People in the dataset
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Fd1b31bcda4a90b808473dbe5970bebfb%2FFrame%20108.png?generation=1714148221118707&alt=media" alt="">
The dataset can be utilized for a wide range of tasks, including face recognition, age estimation, image feature extraction, or any problem related to human image analysis.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F2796accc4d7b47e8e1ac02701f4eac7b%2FFemale%20Images.png?generation=1714147921067232&alt=media" alt="">
The dataset consists of: - files - includes 20 images corresponding to each person in the sample, - .csv file - contains information about the images and people in the dataset
keywords: biometric system, biometric dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, object detection dataset, deep learning datasets, computer vision datset, human images dataset, human faces dataset, machine learning, image-to-image, verification models, digital photo-identification, women images, females dataset, female selfie, female face recognition
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
About this dataset
The Face Recognition Dataset is a collection of 2482 annotated images of human faces collected and labeled by Noor F. Abdul Hassan, Basrah University. This Dataset was created for the purpose of training on YOLO Models.
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the Middle Eastern Human Face with Occlusion Dataset, meticulously curated to enhance face recognition models and support the development of advanced occlusion detection systems, biometric identification systems, KYC models, and other facial recognition technologies.
This dataset comprises over 3,000 human facial images, divided into participant-wise sets with each set including:
The dataset includes contributions from a diverse network of individuals across Middle Eastern countries:
To ensure high utility and robustness, all images are captured under varying conditions:
Each facial image set is accompanied by detailed metadata for each participant, including:
This metadata is essential for training models that can accurately recognize and identify human faces with occlusions across different demographics and conditions.
This facial image dataset is ideal for various applications in the field of computer vision, including but not limited to:
We understand the evolving nature of AI and machine
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Face Recognition, Face Detection, Male Photo Dataset 👨
If you are interested in biometric data - visit our website to learn more and buy the dataset :)
110,000+ photos of 74,000+ men from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are men. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of men of… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/male-selfie-image-dataset.
Data size : 200,000 ID
Race distribution : black people, Caucasian people, brown(Mexican) people, Indian people and Asian people
Gender distribution : gender balance
Age distribution : young, midlife and senior
Collecting environment : including indoor and outdoor scenes
Data diversity : different face poses, races, ages, light conditions and scenes Device : cellphone
Data format : .jpg/png
Accuracy : the accuracy of labels of face pose, race, gender and age are more than 97%
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the Hispanic Human Facial Images Dataset, meticulously curated to enhance face recognition models and support the development of advanced biometric identification systems, KYC models, and other facial recognition technologies.
This dataset comprises over 1500 Hispanic individual facial image sets, with each set including:
[object Object][object Object]The dataset includes contributions from a diverse network of individuals across Hispanic countries.
[object Object][object Object][object Object]To ensure high utility and robustness, all images are captured under varying conditions:
[object Object][object Object][object Object]Each facial image set is accompanied by detailed metadata for each participant, including:
[object Object][object Object][object Object][object Object][object Object][object Object][object Object]This metadata is essential for training models that can accurately recognize and identify faces across different demographics and conditions.
This facial image dataset is ideal for various applications in the field of computer vision, including but not limited to:
[object Object][object Object][object Object][object Object][object Object]We understand the evolving nature of AI and machine learning requirements. Therefore, we continuously add more assets with diverse conditions to this off-the-shelf facial image dataset.
[object Object][object Object][object Object][object Object][object Object][object Object][object Object]This facial image training dataset is created by FutureBeeAI and is available for commercial use.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
The dataset consists of photos featuring the same individuals captured before and after botox injections procedure. The dataset contains a diverse range of individuals with various ages, ethnicities and genders.
The dataset is useful for evaluation of the effectiveness of botox injections for different skin and face types, face recognition and reidentification tasks. It can be utilised for biometric tasks , in beauty sphere, for medical purposes and e-commerce.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Ff4e4aa816bc874020b913bbc3cabfe16%2F96.png?generation=1693400980674716&alt=media" alt="">
includes the following information for each set of media files: - person: id of the person, - before: link to the photo before the injection, - after: link to the photo after the injection
keywords: beauty, cosmetics, botox treated faces, botulinum toxin, aesthetic surgery, cosmetic surgery data, plastic surgery, facial plastic surgery, biometric dataset, biometric data dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, human images dataset, image segmentation, images dataset, computer vision, deep learning dataset, augmented reality, ar, human identification, re-identification, health
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Welcome to the African Human Facial Images Dataset, meticulously curated to enhance face recognition models and support the development of advanced biometric identification systems, KYC models, and other facial recognition technologies.
This dataset comprises over 2,000 African individual facial image sets, with each set including:
The dataset includes contributions from a diverse network of individuals across African countries.
To ensure high utility and robustness, all images are captured under varying conditions:
Each facial image set is accompanied by detailed metadata for each participant, including:
This metadata is essential for training models that can accurately recognize and identify faces across different demographics and conditions.
This facial image dataset is ideal for various applications in the field of computer vision, including but not limited to:
We understand the evolving nature of AI and machine learning requirements. Therefore, we continuously add more assets with diverse conditions to this off-the-shelf facial image dataset.
WIDER FACE dataset is a face detection benchmark dataset, of which images are selected from the publicly available WIDER dataset. We choose 32,203 images and label 393,703 faces with a high degree of variability in scale, pose and occlusion as depicted in the sample images. WIDER FACE dataset is organized based on 61 event classes. For each event class, we randomly select 40%/10%/50% data as training, validation and testing sets. We adopt the same evaluation metric employed in the PASCAL VOC dataset. Similar to MALF and Caltech datasets, we do not release bounding box ground truth for the test images. Users are required to submit final prediction files, which we shall proceed to evaluate.
To use this dataset:
import tensorflow_datasets as tfds
ds = tfds.load('wider_face', split='train')
for ex in ds.take(4):
print(ex)
See the guide for more informations on tensorflow_datasets.
https://storage.googleapis.com/tfds-data/visualization/fig/wider_face-0.1.0.png" alt="Visualization" width="500px">
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Face Recognition, Face Detection, Female Photo Dataset 👩
If you are interested in biometric data - visit our website to learn more and buy the dataset :)
90,000+ photos of 46,000+ women from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are women. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Our dataset will diversify your data by adding more photos of women of… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/female-selfie-image-dataset.
FileMarket provides premium Large Language Model (LLM) Data designed to support and enhance a wide range of AI applications. Our globally sourced LLM Data sets are meticulously curated to ensure high quality, diversity, and accuracy, making them ideal for training robust and reliable language models. In addition to LLM Data, we also offer comprehensive datasets across Object Detection Data, Machine Learning (ML) Data, Deep Learning (DL) Data, and Biometric Data. Each dataset is carefully crafted to meet the specific needs of cutting-edge AI and machine learning projects.
Key use cases of our Large Language Model (LLM) Data:
Text generation Chatbots and virtual assistants Machine translation Sentiment analysis Speech recognition Content summarization Why choose FileMarket's data:
Object Detection Data: Essential for training AI in image and video analysis. Machine Learning (ML) Data: Ideal for a broad spectrum of applications, from predictive analysis to NLP. Deep Learning (DL) Data: Designed to support complex neural networks and deep learning models. Biometric Data: Specialized for facial recognition, fingerprint analysis, and other biometric applications. FileMarket's premier sources for top-tier Large Language Model (LLM) Data and other specialized datasets ensure your AI projects drive innovation and achieve success across various applications.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
there are more than 10 images for that person. Face features can be extracted from this database. Also
Off-the-shelf biometric data (human face) covers 3D depth, segmentation: face organs and accessory, key points, facial expression, alpha Matte, age in variety and etc. All the Biometric Data are collected with signed authorization agreement.
Details about IMFDB: Indian Movie Face database (IMFDB) is a large unconstrained face database consisting of 34512 images of 100 Indian actors collected from more than 100 videos. All the images are manually selected and cropped from the video frames resulting in a high degree of variability interms of scale, pose, expression, illumination, age, resolution, occlusion, and makeup. IMFDB is the first face database that provides a detailed annotation of every image in terms of age, pose, gender, expression and type of occlusion that may help other face related applications.
This dataset is modified in such a way that it is ready for training a Face Recognition model. For dataset with annotations as mentioned above, you can download from here(official): https://cvit.iiit.ac.in/projects/IMFDB/
Acknowledgements: https://cvit.iiit.ac.in/projects/IMFDB/ Shankar Setty, Moula Husain, Parisa Beham, Jyothi Gudavalli, Menaka Kandasamy, Radhesyam Vaddi, Vidyagouri Hemadri, J C Karure, Raja Raju, Rajan, Vijay Kumar and C V Jawahar. "Indian Movie Face Database: A Benchmark for Face Recognition Under Wide Variations" National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), 2013.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Mobile Face Liveness Detection
The dataset consists of videos featuring individuals wearing various types of masks. Videos are recorded under different lighting conditions and with different attributes (glasses, masks, hats, hoods, wigs, and mustaches for men). In the dataset, there are 4 types of videos filmed on mobile devices:
2D mask with holes for eyes - demonstration of an attack with a paper/cardboard mask (mask) 2D mask with holes for eyes, nose, and mouth - demonstration… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/on-device-face-liveness-detection.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset comprises of images featuring diverse human faces annotated with various emotions like happiness, sadness, anger, neutral, and surprised. With a standard resolution of pixels, it's suitable for training and evaluating facial expression recognition models, and is publicly accessible.
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Know Your Customer Dataset, Face Detection and Re-identification
The similar dataset that includes all ethnicities - Selfies and ID Dataset
80,000+ photos including 10,600+ document photos from 5,300 people from 28 countries. The dataset includes 2 photos of a person from his documents and 13 selfies. All people presented in the dataset are caucasian. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups. Photo documents… See the full description on the dataset page: https://huggingface.co/datasets/TrainingDataPro/caucasian-people-kyc-photo-dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data is used in the second experimental evaluation of face smile detection in the paper titled "Smile detection using Hybrid Face Representaion" - O.A.Arigbabu et al. 2015.
Download the main images from LFWcrop website: http://conradsanderson.id.au/lfwcrop/ to select the samples we used for smile and non-smile, as in the list.
Kindly cite:
Arigbabu, Olasimbo Ayodeji, et al. "Smile detection using hybrid face representation." Journal of Ambient Intelligence and Humanized Computing (2016): 1-12.
C. Sanderson, B.C. Lovell. Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference. ICB 2009, LNCS 5558, pp. 199-208, 2009
Huang GB, Mattar M, Berg T, Learned-Miller E (2007) Labeled faces in the wild: a database for studying face recognition in unconstrained environments. University of Massachusetts, Amherst, Technical Report
Wirestock's AI/ML Image Training Data, 4.5M Files with Metadata: This data product is a unique offering in the realm of AI/ML training data. What sets it apart is the sheer volume and diversity of the dataset, which includes 4.5 million files spanning across 20 different categories. These categories range from Animals/Wildlife and The Arts to Technology and Transportation, providing a rich and varied dataset for AI/ML applications.
The data is sourced from Wirestock's platform, where creators upload and sell their photos, videos, and AI art online. This means that the data is not only vast but also constantly updated, ensuring a fresh and relevant dataset for your AI/ML needs. The data is collected in a GDPR-compliant manner, ensuring the privacy and rights of the creators are respected.
The primary use-cases for this data product are numerous. It is ideal for training machine learning models for image recognition, improving computer vision algorithms, and enhancing AI applications in various industries such as retail, healthcare, and transportation. The diversity of the dataset also means it can be used for more niche applications, such as training AI to recognize specific objects or scenes.
This data product fits into Wirestock's broader data offering as a key resource for AI/ML training. Wirestock is a platform for creators to sell their work, and this dataset is a collection of that work. It represents the breadth and depth of content available on Wirestock, making it a valuable resource for any company working with AI/ML.
The core benefits of this dataset are its volume, diversity, and quality. With 4.5 million files, it provides a vast resource for AI training. The diversity of the dataset, spanning 20 categories, ensures a wide range of images for training purposes. The quality of the images is also high, as they are sourced from creators selling their work on Wirestock.
In terms of how the data is collected, creators upload their work to Wirestock, where it is then sold on various marketplaces. This means the data is sourced directly from creators, ensuring a diverse and unique dataset. The data includes both the images themselves and associated metadata, providing additional context for each image.
The different image categories included in this dataset are Animals/Wildlife, The Arts, Backgrounds/Textures, Beauty/Fashion, Buildings/Landmarks, Business/Finance, Celebrities, Education, Emotions, Food Drinks, Holidays, Industrial, Interiors, Nature Parks/Outdoor, People, Religion, Science, Signs/Symbols, Sports/Recreation, Technology, Transportation, Vintage, Healthcare/Medical, Objects, and Miscellaneous. This wide range of categories ensures a diverse dataset that can cater to a variety of AI/ML applications.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Expressions
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
90,000+ photos of 46,000+ women from 141 countries. The dataset includes photos of people's faces. All people presented in the dataset are women. The dataset contains a variety of images capturing individuals from diverse backgrounds and age groups.
Our dataset will diversify your data by adding more photos of women of different ages and ethnic groups, enhancing the quality of your model.
People in the dataset
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2Fd1b31bcda4a90b808473dbe5970bebfb%2FFrame%20108.png?generation=1714148221118707&alt=media" alt="">
The dataset can be utilized for a wide range of tasks, including face recognition, age estimation, image feature extraction, or any problem related to human image analysis.
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F12421376%2F2796accc4d7b47e8e1ac02701f4eac7b%2FFemale%20Images.png?generation=1714147921067232&alt=media" alt="">
The dataset consists of: - files - includes 20 images corresponding to each person in the sample, - .csv file - contains information about the images and people in the dataset
keywords: biometric system, biometric dataset, face recognition database, face recognition dataset, face detection dataset, facial analysis, object detection dataset, deep learning datasets, computer vision datset, human images dataset, human faces dataset, machine learning, image-to-image, verification models, digital photo-identification, women images, females dataset, female selfie, female face recognition