8 datasets found
  1. FaceDetectionDataset

    • kaggle.com
    zip
    Updated Dec 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sunil G (2024). FaceDetectionDataset [Dataset]. https://www.kaggle.com/sunilgo/facedetectiondataset
    Explore at:
    zip(160920 bytes)Available download formats
    Dataset updated
    Dec 13, 2024
    Authors
    Sunil G
    Description

    Dataset

    This dataset was created by Sunil G

    Contents

  2. Faces: Age Detection from Images

    • kaggle.com
    Updated Jan 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Möbius (2024). Faces: Age Detection from Images [Dataset]. https://www.kaggle.com/datasets/arashnic/faces-age-detection-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 26, 2024
    Dataset provided by
    Kaggle
    Authors
    Möbius
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Recently, many applications from biometrics,to entertainment use the information extracted from face images that contain information about age, gender, ethnic background, and emotional state. Automatic age estimation from facial images is one of the popular and challenging tasks that have different fields of applications such as controlling the content of the watched media depending on the customer's age. So facial feature analysis has been a topic of interest mainly due to its applicability and Deep Learning techniques are now making it possible for face analysis to be not just a dream but a reality. This simple practice dataset can get you more acquainted with application of deep learning in age detection. #

    https://media.gettyimages.com/photos/facial-recognition-technology-picture-id1139859279?k=6&m=1139859279&s=612x612&w=0&h=H-i0yAM3A49I_r44424-jACD667nxiKb7bZR52ByOA=" alt="im"> #

    Content

    Indian Movie Face database (IMFDB) is a large unconstrained face database consisting of 34512 images of 100 Indian actors collected from more than 100 videos. All the images are manually selected and cropped from the video frames resulting in a high degree of variability interms of scale, pose, expression, illumination, age, resolution, occlusion, and makeup. IMFDB is the first face database that provides a detailed annotation of every image in terms of age, pose, gender, expression and type of occlusion that may help other face related applications.

    The dataset provided a total of 19906 images.The attributes of data are as follows:

    • ID – Unique ID of image
    • Class – Age bin of person in image

    #

    https://ars.els-cdn.com/content/image/1-s2.0-S0925231215017348-gr1.jpg" alt="face"> image ref : Automatic age estimation based on CNN #

    Acknowledgements

    CVIT focuses on basic and advanced research in image processing, computer vision, computer graphics and machine learning. This center deals with the generation, processing, and understanding of primarily visual data as well as with the techniques and tools required doing so efficiently. The activity of this center overlaps the traditional areas of Computer Vision, Image Processing, Computer Graphics, Pattern Recognition and Machine Learning. CVIT works on both theoretical as well as practical aspects of visual information processing. Center aims to keep the right balance between the cutting edge academic research and impactful applied research.

    Inspiration

    The main task is to predict the age of a person from his or her facial attributes. For simplicity, the problem has been converted to a multiclass problem with classes as Young, Middle and Old.

    Version 2: Faces dataset for regression added

    UTKFace dataset is a large-scale face dataset with long age span (range from 0 to 116 years old). The dataset consists of over 20,000 face images with annotations of age, gender, and ethnicity. The images cover large variation in pose, facial expression, illumination, occlusion, resolution, etc. This dataset could be used on a variety of tasks, e.g., face detection, age estimation, age progression/regression, landmark localization, etc. Some sample images are shown as following: https://susanqq.github.io/UTKFace/icon/samples.png" alt="face2">

    • consists of 20k+ face images in the wild (only single face in one image)
    • provides the correspondingly aligned and cropped faces
    • provides the corresponding landmarks (68 points)
    • images are labelled by age, gender, and ethnicity

    Complete Dataset: https://susanqq.github.io/UTKFace/

    The labels of each face image is embedded in the file name, formated like [age]_[gender]_[race]_[date&time].jpg

    • [age] is an integer from 0 to 116, indicating the age
    • [gender] is either 0 (male) or 1 (female)
    • [race] is an integer from 0 to 4, denoting White, Black, Asian, Indian, and Others (like Hispanic, Latino, Middle Eastern).
    • [date&time] is in the format of yyyymmddHHMMSSFFF, showing the date and time an image was collected to UTKFace

    *If you download and find the data useful your upvote is an explicit feedback for future works*

  3. O

    300-W

    • opendatalab.com
    • datasets.activeloop.ai
    zip
    Updated Jul 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imperial College London (2024). 300-W [Dataset]. https://opendatalab.com/OpenDataLab/300w
    Explore at:
    zip(3235379200 bytes)Available download formats
    Dataset updated
    Jul 1, 2024
    Dataset provided by
    Imperial College London
    License

    https://ibug.doc.ic.ac.uk/resources/300-W/https://ibug.doc.ic.ac.uk/resources/300-W/

    Description

    The 300-W is a face dataset that consists of 300 Indoor and 300 Outdoor in-the-wild images. It covers a large variation of identity, expression, illumination conditions, pose, occlusion and face size. The images were downloaded from google.com by making queries such as “party”, “conference”, “protests”, “football” and “celebrities”. Compared to the rest of in-the-wild datasets, the 300-W database contains a larger percentage of partially-occluded images and covers more expressions than the common “neutral” or “smile”, such as “surprise” or “scream”. Images were annotated with the 68-point mark-up using a semi-automatic methodology. The images of the database were carefully selected so that they represent a characteristic sample of challenging but natural face instances under totally unconstrained conditions. Thus, methods that achieve accurate performance on the 300-W database can demonstrate the same accuracy in most realistic cases. Many images of the database contain more than one annotated faces (293 images with 1 face, 53 images with 2 faces and 53 images with [3, 7] faces). Consequently, the database consists of 600 annotated face instances, but 399 unique images. Finally, there is a large variety of face sizes. Specifically, 49.3% of the faces have size in the range [48.6k, 2.0M] and the overall mean size is 85k (about 292 × 292) pixels.

  4. CatFLW

    • kaggle.com
    zip
    Updated Apr 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    George Martvel (2024). CatFLW [Dataset]. https://www.kaggle.com/datasets/georgemartvel/catflw
    Explore at:
    zip(1446970985 bytes)Available download formats
    Dataset updated
    Apr 16, 2024
    Authors
    George Martvel
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The Cat Facial Landmarks in the Wild (CatFLW) dataset contains 2079 images of cats' faces in various environments and conditions, annotated with 48 facial landmarks and a bounding box on the cat’s face.

    If you use the CatFLW, please cite the dataset paper:

    @article{martvel2023catflw,
     title={Catflw: Cat facial landmarks in the wild dataset},
     author={Martvel, George and Farhat, Nareed and Shimshoni, Ilan and Zamansky, Anna},
     journal={arXiv preprint arXiv:2305.04232},
     year={2023}
    }
    

    You can also check out the landmark detection paper and compare the detection performance on the CatFLW:

    @article{martvel2024automated,
     title={Automated Detection of Cat Facial Landmarks},
     author={Martvel, George and Shimshoni, Ilan and Zamansky, Anna},
     journal={International Journal of Computer Vision},
     pages={1--16},
     year={2024},
     publisher={Springer}
    }
    
  5. Expression in-the-Wild (ExpW) Dataset

    • kaggle.com
    zip
    Updated Jul 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shahzad Abbas (2023). Expression in-the-Wild (ExpW) Dataset [Dataset]. https://www.kaggle.com/datasets/shahzadabbas/expression-in-the-wild-expw-dataset/suggestions
    Explore at:
    zip(8117492353 bytes)Available download formats
    Dataset updated
    Jul 27, 2023
    Authors
    Shahzad Abbas
    Description

    Data Description

    The Expression in-the-Wild (ExpW) Dataset is a comprehensive and diverse collection of facial images carefully curated to capture spontaneous and unscripted facial expressions exhibited by individuals in real-world scenarios. This extensively annotated dataset serves as a valuable resource for advancing research in the fields of computer vision, facial expression analysis, affective computing, and human behavior understanding.

    Key Features:

    1. Real-world Expressions: The ExpW dataset stands apart from traditional lab-controlled datasets as it focuses on capturing facial expressions in real-life environments. This authenticity ensures that the dataset reflects the natural diversity of emotions experienced by individuals in everyday situations, making it highly relevant for real-world applications.

    2. Large and Diverse: Comprising a vast number of images, the ExpW dataset encompasses an extensive range of subjects, ethnicities, ages, and genders. This diversity allows researchers and developers to build more robust and inclusive models for facial expression recognition and emotion analysis.

    3. Annotated Emotions: Each facial image in the dataset is meticulously annotated with corresponding emotion labels, including but not limited to happiness, sadness, anger, surprise, fear, disgust, and neutral expressions. The emotion annotations provide ground truth data for training and validating machine learning algorithms.

    4. Various Pose and Illumination: To account for the varying challenges posed by real-life scenarios, the ExpW dataset includes images captured under different lighting conditions and poses. This variability helps researchers create algorithms that are robust to changes in illumination and head orientation.

    5. Privacy and Ethics: ExpW has been compiled adhering to strict privacy and ethical guidelines, ensuring the subjects' consent and data protection. The dataset maintains a high level of anonymity by excluding any personal information or sensitive details.

    This dataset has been downloaded from the following Public Directory... https://drive.google.com/drive/folders/1SDcI273EPKzzZCPSfYQs4alqjL01Kybq

    Dataset contains 91,793 faces manually labeled with expressions (Figure 1). Each of the face images is annotated as one of the seven basic expression categories: “angry (0)”, “disgust (1)”, “fear (2)”, “happy (3)”, “sad (4)”, “surprise (5)”, or “neutral (6)”.

  6. O

    iBugMask

    • opendatalab.com
    zip
    Updated Mar 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Imperial College London (2023). iBugMask [Dataset]. https://opendatalab.com/OpenDataLab/iBugMask
    Explore at:
    zip(5678248733 bytes)Available download formats
    Dataset updated
    Mar 31, 2023
    Dataset provided by
    Imperial College London
    Description

    iBugMask is an in-the-wild face parsing dataset that contains 1,000 challenging face images and manually annotated labels for 11 semantic classes: background, facial skin, left/right brow, left/right eye, nose, upper/lower lip, inner mouth, and hair. The images are curated from challenging in-the-wild face alignment datasets, including 300W and Menpo. Compared with the existing face parsing datasets, iBugMask contains in-the-wild scenarios such as “party” and “conference”, which include more challenging appearance variations or multiple faces. There is a larger number of profile faces. More expressions other than ”neutral” and ”smile” are also included (e.g. ”surprise” and ”scream”).

  7. DogFLW

    • kaggle.com
    zip
    Updated Jul 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    George Martvel (2025). DogFLW [Dataset]. https://www.kaggle.com/datasets/georgemartvel/dogflw/code
    Explore at:
    zip(1426348794 bytes)Available download formats
    Dataset updated
    Jul 2, 2025
    Authors
    George Martvel
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The Dog Facial Landmarks in the Wild (DogFLW) dataset contains 4335 images of dogs' faces in various environments and conditions, annotated with 46 facial landmarks and a bounding box on the dog’s face.

    If you use the DogFLW, please cite the paper:

    @article{martvel2025dog,
     title={Dog facial landmarks detection and its applications for facial analysis},
     author={Martvel, George and Zamansky, Anna and Pedretti, Giulia and Canori, Chiara and Shimshoni, Ilan and Bremhorst, Annika},
     journal={Scientific Reports},
     volume={15},
     number={1},
     pages={21886},
     year={2025},
     publisher={Nature Publishing Group UK London}
    }
    

    You can also check out the cat landmark detection paper:

    @article{martvel2024automated,
     title={Automated Detection of Cat Facial Landmarks},
     author={Martvel, George and Shimshoni, Ilan and Zamansky, Anna},
     journal={International Journal of Computer Vision},
     pages={1--16},
     year={2024},
     publisher={Springer}
    }
    
  8. Face Synthetics Glasses

    • kaggle.com
    Updated Feb 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mantas (2024). Face Synthetics Glasses [Dataset]. https://www.kaggle.com/datasets/mantasu/face-synthetics-glasses
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 2, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mantas
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    About

    Binary semantic segmentation dataset for glasses. This is just a specific category, i.e., GLASSES or ID=16, of the original Face Synthetics dataset introduced in Fake It Till You Make It: Face analysis in the wild using synthetic data alone.

    Info

    The dataset has the following structure (a total of 14,303 256x256 face-centered images and corresponding masks): bash └── face-synthetics-glasses ├── test │ ├── images <- 1450 (about 10%) of 256x256 test images │ └── masks <- 1450 (about 10%) of 256x256 corresponding masks │ ├── train │ ├── images <- 11372 (about 80%) of 256x256 train images │ └── masks <- 11372 (about 80%) of 256x256 corresponding masks │ └── val ├── images <- 1481 (about 10%) of 256x256 validation images └── masks <- 1481 (about 10%) of 256x256 corresponding masks

    Collection

    The dataset was collected in the following way (full processing script is available in this gist):

    Step 1: The original dataset parts were downloaded from the official links, then unzipped:

    cat dataset_100000.zip.* > dataset_100000.zip
    unzip dataset_100000.zip 
    

    Step 2: Extracted only those image and mask pairs for which glasses exist and split them to train, test, and val directories (files chosen randomly)

    Step 3: Applied Face Crop Plus with the provided landmarks to crop only the face area with a varying face factor from 0.65 to 0.95

    Citation

    @misc{face-synthetics-glasses,
      author = {{Kaggle Contributors}},
      title = {Face Synthetics Glasses},
      year = {2024},
      publisher = {Kaggle},
      journal = {Kaggle datasets},
      howpublished = {\url{https://www.kaggle.com/datasets/mantasu/face-synthetics-glasses}}
    }
    
    @inproceedings{wood2021fake,
     title={Fake it till you make it: face analysis in the wild using synthetic data alone},
     author={Wood, Erroll and Baltru{\v{s}}aitis, Tadas and Hewitt, Charlie and Dziadzio, Sebastian and Cashman, Thomas J and Shotton, Jamie},
     booktitle={Proceedings of the IEEE/CVF international conference on computer vision},
     pages={3681--3691},
     year={2021}
    }
    
  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sunil G (2024). FaceDetectionDataset [Dataset]. https://www.kaggle.com/sunilgo/facedetectiondataset
Organization logo

FaceDetectionDataset

Faces annotated from Labeled Faces in the Wild (LFW)

Explore at:
zip(160920 bytes)Available download formats
Dataset updated
Dec 13, 2024
Authors
Sunil G
Description

Dataset

This dataset was created by Sunil G

Contents

Search
Clear search
Close search
Google apps
Main menu